Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Kundera in Cassandra?
Explain the use of tasktracker in the hadoop cluster?
Can flume provide 100% reliability to the data flow?
Give the sqoop command to see the content of the job named myjob?
State one best feature of Kafka?
What are the key segments of hive architecture?
Can Flume can distribute data to multiple destinations?
Is spark part of hadoop?
Can you give us some more details about ssh communication between masters and the slaves?
Can hadoop replace relational database?
Why is sqoop is used?
If you run hive as a server, what are the available mechanism for connecting it from application?
What do slaves consist of?
What is the importance of dfs.namenode.name.dir in HDFS?
Explain the repartition() operation in Spark?