Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a distributed cache in mapreduce framework?
Define the term thrift
What are the data components used by Hadoop?
What is Apache Hive?
Compare Transformation and Action in Apache Spark?
What are the various components in kafka.
Explain how can spark be connected to apache mesos?
What are impala built-in functions?
Define composite type in Cassandra?
What is a relation in Pig?
What is DataFrames?
How to configure hadoop to reuse JVM for mappers?
Explain the difference between gen1 and gen2 hadoop with regards to the namenode?
What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?
What is the importance of driver in hive?