Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the hadoop configuration files at present?
Is there any API available for implementing graphs in Spark?
What is the latest version of sqoop?
What is a pipelinedrdd?
Why is transformation lazy operation in Apache Spark RDD? How is it useful?
Define fold() operation in Apache Spark?
What are ‘reduces’?
Can you use Spark to access and analyse data stored in Cassandra databases?
Explain Alter Table Statement in HCatalog?
Is apache flume real time processing framework?
Why is cqlsh used?
What are the four essential parameters of a mapper?
Why do we need Hadoop Archives? How is it created?
How we can check hadoop sqoop installed or not in a system?
How can we change the split size if our commodity hardware has less storage space?