Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How is impala metadata managed?
What is HBase HMaster?
Which one is the master node in HDFS? Can it be commodity hardware?
Is there any difference between HBase datamodel and RDBMS datamodel?
Give the data storage units in Cassandra?
Knox and Hadoop Development Tools?
What is the syntax of describe Command?
Explain Spark SQL caching and uncaching?
How to process data using Transformation operation in Spark?
Explain Spark leftOuterJoin() and rightOuterJoin() operation?
What is the concept of SuperColumn in Cassandra?
What is off heap memory in spark?
What do you mean by ss table and explain how it is different from the other original tables?
How to change a column data type in Hive?
Is it possible to rename the output file?