Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain about the execution plans of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
what daemons run on a master node and slave nodes?
What are different String functions available in PIG?
What is SequenceFileInputFormat in Hadoop MapReduce?
Name some companies that are already using Spark Streaming?
What is a bloom filter?
What is the role of recordreader in hadoop mapreduce?
What are the different modes in which we can configure/install Hadoop?
When to use hadoop, hbase, hive and pig?
What is sqoop in Hadoop ?
What are the features of Pseudo mode?
Explain totuple function?
What happens when the data set exceeds available memory?
Which command do we use to show the version?
Explain JobConf in MapReduce.