Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
Explain use cases where SequenceFile class can be a good fit?
Explain about the core components of Flume?
Can you explain indexing?
Define Cassandra?
How do I clear my spark cache?
What are the different ways of executing Pig script?
Why do we use spark?
Explain first() operation in Apache Spark?
How to create and manage a view in HCatalog?
What do you mean by metadata in Hadoop?
Explain the term Cluster?
Explain some Disadvantages of Avro?
What is commodity hardware?
Explain what is a sequence file in hadoop?