Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
Which command is used to show the current hbase user?
Can hadoop handle streaming data?
Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?
Can you briefly explain the apache mahout?
Map reduce jobs are failing on a cluster that was just restarted. They worked before restart. What could be wrong?
Explain the term 'Log Anatomy'?
What are the most commonly defined input formats in Hadoop?
Define ttl in hbase?
What are the names of daemons in impala?
What do you mean by ss table and explain how it is different from the other original tables?
What is the meaning of the term "non-DFS used" in Hadoop web-console?
Can hive queries be executed from script files? How?
What are the ways to launch Apache Spark over YARN?
Define Apache Pig?