Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is spark vectorization?
What are the different tasks we can perform managing host using ambari host tab?
Define compaction in HBase?
Why is block size set to 128 MB in Hadoop HDFS?
What is difference between spark and scala?
Can you define serde in hive?
How should 'store' keyword is useful in pig scripts?
What is a column family?
Main Components of Hadoop?
When to use Cassandra?
What is metastore?
What is configured in /etc/hosts and what is its role in setting Hadoop cluster?
What is the role of JDBC driver in Sqoop?
How will you explain COGROUP in Pig?
Explain the usage of Context Object?