Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is small file problem in hadoop?
Please explain the sparse vector in Spark.
Explain HCatLoader APIs?
What is the difference between cassandra's schema and rdbms schema?
Discuss the role of Spark driver in Spark application?
What is vectorized query execution?
What is jmx connector?
Can we use kafka without zookeeper?
What is spark accreditation?
Mention what is the benefits of apache kafka over the traditional technique?
How will you connect Apache Spark with Apache Mesos?
what is the maximum size of the message does Kafka server can receive?
List the various options available with the Hive command?
Can we submit the mapreduce job from slave node?
What is tungsten engine in spark?