Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is kafka open source?
Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?
What is the difference between hbase and hdfs?
How Hive distributes the rows into buckets?
What are partitions and tokens in cassandra?
Is the hdfs block size reduced to achieve faster query results?
What is distinct clause in apache tajo?
Explain Spark SQL caching and uncaching?
How do you stop a spark?
What is kafka Producer?
Why should we use ‘orderby’ keyword in pig scripts?
How the Client communicates with HDFS?
How do I stop flume agent?
What is the role of Connector API?
What is pig latin statements?