Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is CQLSH in Cassandra?
How to create a custom key and custom value in MapReduce Job?
Give key features of any NoSQL database?
What are broadcast variables in Apache Spark? Why do we need them?
How to use hdfs put command for data transfer from flume to hdfs?
What is the difference between rdd and dataframe in spark?
What is ColumnFamily?
Name the types of tunable consistency?
What is indexing and why do we need it?
What is unstructured data?
What is cassandra database used for?
Can we do real-time processing using spark sql?
Can you define sqoop in hadoop?
Have you ever used counters in hadoop?
Can you explain spark mllib?