Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the core benefits for hadoop users by using the apache ambari?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
How to set the number of mappers to be created in MapReduce?
Whether Pig Latin language is case-sensitive or not?
How is indexing done in HDFS?
Why is transformation lazy operation in Apache Spark RDD? How is it useful?
What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?
How can we create rdds in apache spark?
What is Safemode in Apache Hadoop?
What is meant by rdd lazy evaluation?
Which one is the master node in HDFS? Can it be commodity hardware?
Tell me about the execution modes of Apache Pig?
Explain Spark Executor
Mention how can you stop a partition form being queried?
Which data storage components are used by hadoop?