Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
State about ZooKeeper WebUI?
What all tasks you can perform for managing services using Ambari service tab?
How do you stop a running job gracefully?
What is Rack Awareness? What is its need in Hadoop?
Is there a date data type in Hive?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
Mention some instances where zookeeper is using?
Is hadoop obsolete?
What is a namenode? How many instances of namenode run on a hadoop cluster?
What is fluming?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
How can we remove a znode?
How is mapreduce related to cloud computing?
How to enable buckets in Hive?
What is executor cores in spark?