Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the advantages of DataSets?
What is bucketing ?
State some DDL Command with brief Description?
When should you use sequencefileinputformat?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
What do you understand by standalone (or local) mode?
What is spark vs scala?
What are the features of Spark?
Does rdd have schema?
What are the steps involved in MapReduce framework?
Is multiline comment supported in Hive Script ?
What do you know about nlineinputformat?
The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
What is struct and explain its purpose?
What are pig scripts?