Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What do you understand by the partitions in spark?
How to create a Sparse vector from a dense vector?
how Hadoop is different from other data processing tools?
How much is flume worth?
Can you list some useful zookeeper tools?
What is the command to change the replication factor ?
What happens if rdd partition is lost due to worker node failure?
How to set the number of mappers for a MapReduce job?
What is node?
Is it possible to rename the output file, and if so, how?
What are the different types of nosql databases?
What is bag data type in Pig?
What is Spark Driver?
What is column store db? Explain with an example.
Give any two features of flume?