Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Differentiate between drop and truncate in cqlsh
How to perform the inter-cluster data copying work in HDFS?
What do you mean by Speculative execution in Apache Spark?
What are the file formats supported by spark?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
What is the concept of SuperColumn in Cassandra?
Why is Hive not suitable for OLTP systems?
Explain about tajo worker configuration?
How to change the replication factor of data which is already stored in HDFS?
What is master node in spark?
What are the usage of different consistency levels for write operations ?
How does apache spark work?
What is shuffling and sorting in Hadoop MapReduce?
Who is a 'user' in HDFS?
Explain what is the purpose of RecordReader in Hadoop?