Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what is JobTracker in Hadoop? What are the actions followed by Hadoop?
What are input format, input split & record reader and what they do?
How to configure hadoop to reuse JVM for mappers?
What are the roles and responsibilities of worker nodes in the Apache Spark cluster? Is Worker Node in Spark is same as Slave Node?
How to Containerizing ZooKeeper With Docker?
Explain the data model of hbase.
How can we assure that the values regarding a particular key goes to the same reducer?
What is pregel api?
Can Ambari manage multiple clusters?
Is apache spark a tool?
Explain a common use case for Flume?
What are the major features/characteristics of rdd (resilient distributed datasets)?
What is the use of cassandra and why to use cassandra?
Are job tracker and task trackers present in separate machines?
How to explain Bigdatadeveloper projects