Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is spark in big data?
Where is hadoop-env.sh file present?
Explain about the execution plans of a Pig Script? Or Differentiate between the logical and physical plan of an Apache Pig script?
Explain the Job OutputFormat?
What is the use of Bloom Filter in Cassandra?
What is apache spark and what is it used for?
What are the components of Apache Spark Ecosystem?
Mention Hadoop core components?
What is Spark SQL?
Explain Hadoop streaming?
What is a RecordReader in Hadoop MapReduce?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
What happens if there is an error in impala?
Why do we need MapReduce during Pig programming?
How does gossip protocol work?