Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How to specify more than one path for storage in Hadoop?
When do you call the cleanup method?
What are the network requirements for hadoop?
What is Safemode in Apache Hadoop?
Is hadoop required for data science?
What is distributed cache in spark?
Explain what is kafka?
What is the difference between persist() and cache()?
What problem does Apache Pig solve?
What is the disadvantage of spark sql?
Explain Dsstream with reference to Apache Spark
Define the term ‘sparse vector.’
What is DStream in Apache Spark Streaming?
Where can the metastore database be hosted?
Explain about the different channel types in Flume.