Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the main purpose of HDFS fsck command?
What is UDF in Pig?
How does gossip protocol work?
What is spark certification?
What is pagerank?
Can you modify the file present in hdfs?
What are the Optimizations a developer can use during joins?
How to explain Bigdatadeveloper projects
how is a file of the size 1 GB uncompressed
Compare hbase vs hdfs?
What is the function of ApplicationMaster?
What are ‘maps’ and ‘reduces’?
What is Client API?
When is it not recommended to use MapReduce paradigm for large scale data processing?
Why is BlinkDB used?