Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the term sstables?
Explain Reliability and Failure Handling in Apache Flume?
Explain apache kafka?
What does map transformation do? Provide an example.
What is a DStream?
Define the common faults of the developer while using apache spark?
Write a query to insert a new column(new_col int) into a hiev table (htab) at a position before an existing column (x_col)
What are the tools that are we needed or helps to build ambari?
What is Zookeeper Cluster?
Explain Alter Table Statement in HCatalog?
What is the relationship between Job and Task in Hadoop?
Is hadoop the future?
Why Mapreduce output written in local disk?
What is HBase HMaster?
Explain what is distributed cache in mapreduce framework?