Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Is kafka open source?
What is the difference between spark ml and spark mllib?
Main Components of Hadoop?
Use of create-hive-table command in hadoop sqoop?
What are the different parts of Hive ?
What languages support spark?
Why is pig used in hadoop?
What are tools available to send the streaming data to hdfs?
What is a Block Scanner in HDFS?
What is the InputSplit in map reduce ?
What are best features of Apache Avro?
Explain how can apache spark be used alongside hadoop?
What are the roles and responsibilities of worker nodes in the Apache Spark cluster? Is Worker Node in Spark is same as Slave Node?
Clarify how ordering in hdfs is finished?
Is map like a pointer?