Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Why HDFS performs replication, although it results in data redundancy in Hadoop?
List commonly used machine learning algorithm?
How to change Replication Factor For below cases ?
Why is Reading done in parallel and writing is not in HDFS?
What is the default extension of the files produced from a sqoop import using the –compress parameter?
Explain the key features of hdfs?
Can I set the number of reducers to zero?
Which code is used to open a connection in hbase?
How to set which framework would be used to run mapreduce program?
How can one write custom record reader?
How does Cassandra perform write function?
How will you convert the string ’51.2’ to a float value in the price column?
What is rdd lineage graph? How is it useful in achieving fault tolerance?
Use of import command in hadoop sqoop?
What are the different tasks we can perform managing host using ambari host tab?