Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is replication in kafka?
How you can use Akka with Spark?
How is data represented in Spark?
What is the purpose of button groups?
Define composite key?
What are the benefits of Spark lazy evaluation?
How do you write your own custom SerDe ?
What is DataFrames?
What are the different Relational Operators available in pig language?
Explain about the replication and multiplexing selectors in Flume?
What are the components of spark?
What is kafka technology?
How many Reducers run for a MapReduce job?
How much space will the split occupy in Mapreduce?
What is a broker?