Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How the HDFS Blocks are replicated?
Can you tell us more about ssh?
What is TextInputFormat in Hadoop?
What is spark executor cores?
Explain pipe() operation. How it writes the result to the standard output?
How Cassandra provide High availability feature?
What is DistributedCache and its purpose?
Mention how can you stop a partition form being queried?
Explain Catalyst framework?
What is the problem in having lots of small files in hdfs?
Can a table be renamed in Hive?
Differentiate between describe and describe extended?
What is tungsten engine in spark?
What is a bookie in bookkeeper?
How to set up local repository manually?