Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does hdfs give great throughput?
Can you explain clustering in mahout?
What is the use of truncate command?
How does apache spark engine work?
What is zookeper?
How often do you need to reformat the namenode?
Why aggregation cannot be done in Mapper?
What is Pig Latin?
mapper or reducer?
Where is kafka used?
Explain pipe() operation in Apache Spark?
What are the basic steps to writing a UDF Function in Pig?
In Hadoop, which file controls reporting in Hadoop?
Can hadoop handle streaming data?
Explain avrostorage function?