Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain Any 3 Features of HBase?
How many types of nosql databases?
What is a skewed join?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
What are Pig Execution modes?
Describe the different consistency levels for read operation in cassandra?
Explain the architecture of Hadoop Pig?
What do you understand by unit and ()in scala?
What is a Combiner?
What are the the issues associated with the map and reduce slots based mechanism in mapReduce?
How many filters are available in HBase?
Explain the Parquet File format in Apache Spark. When is it the best to choose this?
What is node in Cassandra?
Can you tell us how many daemon processes run on a hadoop system?
Which is better scala or python for spark?