Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Distinguish HDFS Block and Input Unit?
What is active and passive NameNode in Hadoop?
What is the default database provided by Apache Hive for metastore?
What is spark dynamic allocation?
How does one create RDDs in Spark?
What is the syntax of describe Command?
How are sparks created?
Why would nosql be better than using a sql database? And how much better is it?
How do you stop a spark?
Explain the lookup() operation in Spark?
Can you define parquet file?
How tasks are created in spark?
Is databricks an etl tool?
How does inputsplit in mapreduce determines the record boundaries correctly?
What is Spark Core?