Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is hdfs block size?
What do you know about sequencefileinputformat?
What combiners is and when you should use a combiner in a MapReduce Job?
What is lazy evaluation and how is it useful?
What is the use of foreach operation in Pig scripts?
Give a brief overview of Hadoop history?
When should you not use Cassandra? OR When to use RDBMS instead of Cassandra?
why use hcolumndescriptor class?
How does NameNode tackle DataNode failures?
What kinds of impala queries or data are best suited for hbase?
Define Spark-SQL?
Explain how input and output data format of the hadoop framework?
What will apache driver do?
Which language is best for spark?
How are joins performed in impala?