Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How do I load a big csv file into a partitioned table?
What are the roles of the file system in any framework?
Name a few commonly used spark ecosystems?
Why is hadoop faster?
Where is rdd stored?
Does the archiving of hive tables give any space saving in hdfs?
What are distinct operators in impala?
What does the Spark Engine do?
How do I start a spark cluster?
Define tasktracker.
What is cluster in Cassandra?
State syntax of the command to drop an index?
What are the different ways of executing Pig script?
how can we access the sub directories recursively?
Can you define serde in hive?