Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Describe how hbase uses zookeeper?
How a task is scheduled by a jobtracker?
What is the difference between Input Split and an HDFS Block?
When is it not recommended to use MapReduce paradigm for large scale data processing?
State some command line options?
For using hadoop list the network requirements?
What are the Applications of Apache Pig?
What are the presto applications?
How can you implement machine learning in Spark?
How to show up details in pig ?
What are the primitive data types in Pig?
When should we use SORT BY instead of ORDER BY?
Define fault tolerance?
Explain the Methods Of ZooKeeper class?
What are the basic commands in Apache Sqoop and its uses?