Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
List out the difference between textFile and wholeTextFile in Apache Spark?
What are the different Data Types available in Hive?
Explain what do you understand by cassandra- cql collections?
Explain jsonloader, jsonstorage functions in pig?
Why slaves limited to 4000 in hadoop version 1?
Do we need to install spark in all nodes?
What do you mean by Schema Declaration?
What is the process to change the files at arbitrary locations in HDFS?
how indexing in HDFS is done?
How does gossip protocol work?
What is a namenode in hadoop?
Is there an api for implementing graphs in spark?
Do I need scala for spark?
Explain how cassandra writes changed data into commitlog?
Discuss writeahead logging in Apache Spark Streaming?