Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Does Pig give any warning when there is a type mismatch or missing field?
What do sorting do?
what are the basic parameters of a Mapper?
What is the disadvantage of spark sql?
Give the name of some components of Cassandra?
State some command line options?
Explain how indexing is done in hdfs?
Why Hadoop MapReduce?
What is metastore?
What is Sqoop Import? Explain its purpose?
If DataNode increases, then do we need to upgrade NameNode?
What is the function of NodeManager?
Define tasktracker.
What is difference between hive and hdfs?
What is skew data?