Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How Mapper is instantiated in a running job?
What is the role of the kafka producer api.
Illustrate a simple example of the working of MapReduce.
Is there another way to check whether Namenode is working?
How is it different from doing machine learning in r or sas?
What is the Reducer used for?
Define Spark Streaming.
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
Explain what does hbase consists of?
What is a keyspace in Cassandra?
What is the use of shutdown command?
What is the significance of using –compress-codec parameter?
What are the three types of tombstone markers in hbase?
what is difference between pig and sql?
If a Replica stays out of the ISR for a long time, what does it signify?