Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
Explain about the different complex data types in Pig?
Explain Reliability and Failure Handling in Apache Flume?
In a very huge text file, you want to just check if a particular keyword exists. How would you do this using Spark?
Define the consistency levels for read operations in Cassandra?
What is the core of the job in MapReduce framework?
Define data centre?
Can multiple clients write into an HDFS file concurrently in hadoop?
What is the row key?
How to specify more than one directory as input to the MapReduce Job?
Does google use hadoop?
What do sorting and shuffling do?
List the advantage of Parquet files?
What do you mean by metadata in HDFS?
What is graph db?