Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?
42Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
708
what Hive query processor does?
Define the run-time architecture of Spark?
What is different table structure available in the hive?
Which command do we use to run HBase Shell?
What is the difference between SQL and NoSQL?
Can a partition be archived? What are the advantages and Disadvantages?
What are the features of presto?
When would you use hbase?
What do you mean by Schema Declaration?
Write a Pig UDF Example ?
Explain how can spark be connected to apache mesos?
Explain caching in spark streaming.
What is the use of map transformation?
What do we mean by Paraquet?
What is the definition of Hive?