Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what are the different modes of Hive?
What are different Hive commands available for hive and beeline CLI?
What is bloom filter?
What is wal and hlog in hbase?
How tasks are created in spark?
How to create hadoop archive?
What is the basic difference between traditional RDBMS and Hadoop?
What is hadoop? Name the main components of a hadoop application?
Why do I have to use refresh and invalidate metadata, what do they do?
Tell something about the query language used in Cassandra Database?
Can we run spark on windows?
What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?
How do you process big data with spark?
What is flume interceptor?
What is the difference between python and spark?