Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is standalone mode in spark?
What is Row Key?
What are the main features and Characteristics of Hadoop which makes it the most popular and powerful Big Data tool?
Can you define udf?
Describe Spark SQL?
What is hfile ?
What do you understand by composite type?
What is the difference between an RDBMS and Hadoop?
Clarify what is sequence file input format?
What is Derby database?
How to Administering Hadoop?
What is broadcast variable?
What does hadoop-env.sh do?
Explain how Hive Deserialize and serialize the data?
Explain the role of offset in kafka?