Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What do you know about nlineinputformat?
How to start a kafka server?
While reading data from hbase, from which three places data will be reconciled before returning the value?
How can one check whether NameNode is working or not?
What is Apache Spark and what are the benefits of Spark over MapReduce?
Define Actions.
Explain the maximum size of a message that can be received by the Kafka?
Would you be able to change the block size of hdfs files?
What must we know to work on Zookeeper well?
Clarify how hive de-serialize and serialize the information?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
What is the use of binstorage?
Define catalog tables in HBase?
Is it possible to add or delete column families in a working group?
How does spark rdd work?