Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are all stats classes in the org.apache.pig.tools.pigstats package?
What is Flatten?
Mention what is the difference between hdfs and nas?
What is wal and hlog in hbase?
When to use hadoop, hbase, hive and pig?
Who is a 'user' in HDFS?
Which method is used to access HFile directly without using HBase?
How hive can improve performance with orc format tables?
What is graph db? Explain with an example.
Name different types of the data model?
what happens when Hadoop spawned 50 tasks for a job and one of the task failed?
How does gossip protocol help in failure detection?
How can you prevent a large job from running for a long time? What do u think is more popular among the developers - Pig or Hive?
How does pig work?
How is security achieved in Hadoop?