Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are complex data types in pig?
In Hive, how can you enable buckets?
When to use secondary indexes?
What are components of Cassandra Data Model?
Can you explain the common input formats in hadoop?
How can you execute a free-form SQL query in Sqoop to import the rows in a sequential manner?
Define fault tolerance?
Specify the different methods of hive?
What is the key difference between textfile and wholetextfile method?
What is azure spark?
How to optimize Hive Performance?
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
What do you understand by an inner bag and outer bag in Pig?
What is decorating filters?
Which spark library allows reliable file sharing at memory speed across different cluster frameworks?