Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?
36Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
658
Explain Data Type Conversion in Hive?
What is the function of Cluster.Builder class in Cassandra?
What is spark slang for?
How is dag created in spark?
Why Mapper runs in heavy weight process and not in a thread in MapReduce?
Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
What does the file hadoop-metrics.properties do?
What is the purpose of sqoop-merge?
Name a few commonly used spark ecosystems?
What is ganglia is used for in ambari?
What is the use of paging cqlsh command in Cassandra?
What is meant by rdd lazy evaluation?
Is it possible to add a parameter while running a saved job?
How data or file is written into Hadoop HDFS?
Mention some machine learning algorithms exposed by mahout?