Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is the hdfs block size reduced to achieve faster query results?
What is the use of paging cqlsh command in Cassandra?
How to create RDD?
what is SPF?
What are the log files of the presto server?
What do you mean by Stream Processing in Kafka?
explain Metadata in Namenode?
Is a job split into maps?
What is Zookeeper Cluster?
How will you consume CSV file into the Hive warehouse using built SerDe?
How can you debug a pig script?
What is flatten in pig?
What is the function of Cluster.Builder class in Cassandra?
What is MapReduce? What are the syntax you use to run a MapReduce program?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?