Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to write a custom partitioner for a Hadoop MapReduce job?
What is cluster in Cassandra data model?
What are the Optimizations a developer can use during joins?
How to use 'foreach' operation in pig scripts?
What is the procedure of data storage in cassandra?
How does cassandra perform write operations?
What is the work of hive/hcatalog?
How Hadoop’s CLASSPATH plays a vital role in starting or stopping in Hadoop daemons?
How can you manually partition the rdd?
Can you list few commonly used hive services?
What are the languages supported by apache spark and which is the most popular one?
Where is the Mapper Output intermediate kay-value data stored ?
Explain transformation and action in RDD in Apache Spark?
Explain when using field grouping in storm, is there any time-out or limit to known field values?
How many compaction types are in HBase?