Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is spark a language?
How data or file is read in Hadoop HDFS?
Can you explain apache kafka?
What is presto?
What is the best way to copy files between HDFS clusters?
Why is output file name in Hadoop MapReduce part-r-00000?
What is the importance of driver in hive?
Can you list some useful zookeeper tools?
What is the use of BloomMapFile?
What is the throughput? How does hdfs give great throughput?
What is difference between spark and scala?
What are the filters are available in apache hbase?
How can you connect an application, if you run hive as a server?
What is the difference between kafka and flume?
When is it not recommended to use MapReduce paradigm for large