Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which components are used for stream flow of data?
Explain deletion in hbase?
Explain a scenario where you will be using spark streaming.
when do reducers play their role in a mapreduce task?
What a task tracker is in hadoop?
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
Is there any API available for implementing graphs in Spark?
Who created spark?
How is jmx useful in cassandra?
What is the Hadoop MapReduce API contract for a key and value Class?
Explain the features of stand alone (local) mode?
Explain what is sqoop in Hadoop ?
What is the abstraction of Spark Streaming?
How many job tracker processes can run on a single Hadoop cluster?
what is difference between int and intwritable?