Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
Can you define parquet file?
Can you list down the limitations of using Apache Spark?
Explain about the indexing process in hdfs?
When a large data set is maintained?
Define various running modes of apache spark?
How is jmx useful in cassandra?
What are the names of daemons in impala?
How does executor work in spark?
What is Hadoop Custom partitioner ?
In cloudera there is already a cluster, but if I want to form a cluster on ubuntu can we do it?
State the limitations of Apache Pig?
What is the relationship between Hadoop, HBase, Hive and Cassandra ?
What is the command to start and stop the Spark in an interactive shell?
How much Metadata will be created on NameNode in Hadoop?