Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Can you explain indexing?
What is HBase HMaster?
Is there any API available for implementing graphs in Spark?
what are the most common input formats defined in Hadoop?
What are the types of hive ddl commands?
Specify some uses of HBase?
How indexing is done in HDFS?
What is the difference between an inputsplit and a block?
What is Cassandra Data Modelling ?
Why is spark so fast?
What is HDFS block size and what did you chose in your project?
What is cqlsh?
What is distinct clause in apache tajo?
How to change the replication factor of data which is already stored in HDFS?
What is the Reducer used for?