Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the various data sources available in SparkSQL?
What do you mean by a bag in Pig?
Virtual Box & Ubuntu Installation?
How multi-hop agent can be setup in Flume?
Where can I find impala documentation?
What is hadoop framework?
How to save RDD?
Where is apache spark used?
How to change from su to cloudera?
What are the various advantages of DataFrame over RDD in Apache Spark?
Name the operating system(s) which are supported for production hadoop deployment?
Explain SHOW and DESCRIBE Commands in Hive?
What are some of the apache pig use cases you can think of?
Which technique can you use in hbase to access hfile directly without the help of hbase?
Explain what is wal and hlog in hbase?