Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain the process to trigger automatic clean-up in Spark to manage accumulated metadata.
how Cassandra writes data?
What is accumulator in spark?
How Spark uses Akka?
Explain what is difference between an input split and hdfs block?
Explain apache kafka?
In which language ambari shell is developed?
What is UDF in Pig?
What are the most common InputFormats in Hadoop?
How many partitions are created by default in Apache Spark RDD?
Is spark part of hadoop ecosystem?
How is hadoop different from spark?
Which file systems does Spark support?
Explain the difference between gen1 and gen2 hadoop with regards to the namenode?
What do you mean by schema on read?