Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is dataframe in spark?
What is the latest version of Ambari that is available in the market?
What is data replication in Cassandra?
What is the use of MasterServer?
When creating an RDD, what goes on internally?
What is the roadmap for apache mahout version 1.0?
Is there any benefit of learning MapReduce, then?
Clarify how ordering in hdfs is finished?
Which is the best hadoop certification?
Give the difference between Drop and Truncate in CQLSH?
What are the limitations of importing RDBMS tables into Hcatalog directly?
How do I change hive execution engine to spark?
How tables are managed in apache tajo?
Explain bagtotuple?
What is the maximum size of string data type supported by Hive?