Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
when do reducers play their role in a mapreduce task?
What is Chain Mapper?
What is Distributed Cache?
Explain briefly what is Action in Apache Spark? How is final result generated using an action?
Explain the core benefits for hadoop users by using the apache ambari?
When do you have to avoid secondary indexes?
Explain what happens in textinformat ?
How can I import large objects (BLOB and CLOB objects) in Apache Sqoop?
How do you overwrite replication factor?
What is Hector in Cassandra?
What are the different methods to set up local repositories?
What is spark reducebykey?
Give some points of hive for hadoop ?
What is sc parallelize in spark?
Is it mandatory to set input and output type/format in MapReduce?