Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is spark tool in big data?
What are the features of RDD, that makes RDD an important abstraction of Spark?
Explain keys() operation in Apache spark?
What is spark table?
Can you define udf?
Can you tell us more about ssh?
What do you understand by Commit log in Cassandra?
What are the components of Apache Spark Ecosystem?
How to use Apache Zookeeper command line interface?
What is the advantage of using –password-file rather than -P option while preventing the display of password in the sqoop import statement?
What is the use of combiners in the hadoop framework?
Explain the difference between mapreduce engine and hdfs cluster?
What is lambda in spark?
What are the components of Hive architecture?
What are the main features of hdfssite.xml?