Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain pig architecture?
What is the work of hive/hcatalog?
How multi-hop agent can be setup in Flume?
What do you mean by meta information in hdfs? List the documents related to metadata.
State some impala hadoop benefits?
When the reducers are are started in a mapreduce job?
What is sparkContext?
What is a generic UDF in the hive?
Does hdfs enable a customer to peruse a record, which is already opened for writing?
What is the data storage component used by Hadoop?
What is Pig Storage?
What is the problem in having lots of small files in hdfs?
What do sorting do?
What is a block in HDFS, why block size 64MB?
How can you transfer data from hive to hdfs?