Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is scala and spark?
What is Mapper? How can we compress Mapper output in Hadoop?
How is 0xdata's h2o different from apache mahout ?
What is the best hardware configuration to run Hadoop?
What is the difference between namenode and datanode in hadoop?
Is java required for spark?
What is the next step after Mapper or MapTask?
Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.
how Cassandra writes changed data into commitlog?
State one best feature of Kafka?
Name some Complex types of Data types, Avro Supports?
Is hadoop open source?
What are the main components of Cassandra data models?
What do you understand by compute and storage nodes?
How message is consumed by consumer in kafka?