Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Where is kafka used?
Different running modes for running Pig?
What is difference between dataset and dataframe?
How can you stop a partition form being queried?
When should you use a reducer?
Explain why to use hbase?
Define a metadata?
What is sparkContext?
How do you parse data in xml? Which kind of class do you use with java to pass data?
Does 'ILLUSTRATE' run a MapReduce job?
How does inputsplit in mapreduce determines the record boundaries correctly?
Use of list-databases command in hadoop sqoop?
What are the most common InputFormats in Hadoop?
What is the use of “resultset execute” method?
How do I download spark?