Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what is Memtable in Cassandra?
As part of optimizing the queries in hive, what should be the order of table size in a join query?
In cloudera there is already a cluster, but if I want to form a cluster on ubuntu can we do it?
What are the types of tables in Hive?
What is the difference between spark and apache spark?
What is the difference between spark and python?
What is the Repository?
What is Hadoop Custom partitioner ?
Explain what you understand by speculative execution
Does google use hadoop?
What are the four modules that make up the Apache Hadoop framework?
Explain the run-time architecture of Spark?
What are the features of RDD, that makes RDD an important abstraction of Spark?
Can I do trforms or add new functionality?
Explain data flow in Flume?