Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a tuple?
How did you debug your Hadoop code ?
What is a primary key? And what are it’s different types?
If the hadoop administrator needs to make a change, which configuration file does he need to change?
How many instances of JobTracker can run on a Hadoop Cluser?
What do you understand by the parquet file?
What is the role of the offset.
What are the collection data types provided by CQL?
What is the purpose of context object?
Clarify what webdav is in hadoop?
What is the primary objective of NoSQL databases?
What is spark pipeline?
What is job tracker role in hadoop?
How would an hadoop administrator deploy various components of hadoop in production?
What is Counter in MapReduce?