Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can free-form SQL queries be used with Sqoop import command? If yes, then how can they be used?
What is Hadoop HDFS – Hadoop Distributed File System?
What is spark tool in big data?
Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
What is serialization in spark?
Compare hbase vs hdfs?
How to keep HDFS cluster balanced?
What is a block and block scanner in HDFS?
What are the various levels of persistence in Apache Spark?
What is Reducer in MapReduce?
What are different logging levels in cassandra?
Give the sqoop command to see the content of the job named myjob?
How can I speed up my spark?
What is the meaning of speculative execution in Hadoop? Why is it important?
What do you understand by the super column in cassandra?