Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does hdfs give great throughput?
What is closing out ledgers?
List some use cases where classification machine learning algorithms can be used.
What is the role of Connector API?
What are the different CQL data manipulation commands in Cassandra?
What is the difference between spark ml and spark mllib?
What is anti-entropy?
How namenode handles data node failures?
How can we check whether namenode is working or not?
Explain HCatWriter?
Explain cap theorem?
Explain REVERSE function in Hive with example?
How much space will the split occupy in Mapreduce?
List the diagnostic operators in pig.
Explain data flow in Flume?