Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What main configuration parameters are specified in mapreduce?
What are the components of a Hive query processor?
How to write 'foreach' statement for map datatype in pig scripts?
What is zookeeper in hadoop?
What is Spark SQL?
How is spark sql different from hql and sql?
What is difference between split and block in hadoop?
How to write a query in Cassandra?
Define streaming access?
Ideally what should be the replication factor in hadoop?
What are the different operational commands in HBase at record level and table level?
What is the function of UNION and SPLIT operators? Give examples?
Name some sources from where Spark streaming component can process real-time data?
What is Cassandra Data Modelling ?
Difference Between Apache Sqoop vs Flume?