Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain the different logging levels in cassandra.
What file systems does spark support?
Name three data source available in SparkSQL
What are the operational commands of HBase?
In a given spark program, how will you identify whether a given operation is Transformation or Action ?
What will you do when NameNode is down?
How do I get apache spark on windows 10?
Mention what is the use of Context Object?
What is the default database provided by Apache Hive for metastore?
What do you mean by Thrift in Cassandra?
What is difference between spark and mapreduce?
Explain the concept of Leader and Follower?
What are the design goals of zookeeper?
how would you modify that solution to only count the number of unique words in all the documents?
Explain how indexing in hdfs is done?