Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the different modes in which PIG can run and explain those?
How do I use spark with big data?
What is a spark standalone cluster?
Discuss how you can use filters in apache hbase
Explain how indexing is done in hdfs?
Explain what is a keyspace in Cassandra?
Define Nodetool Utility?
What are the features of spark rdd?
Can you define the process of creating ambari client?
What is CQL?
What is the default replication factor and how will you change it?
Explain the role of the offset?
Does spark require hdfs?
How can we create a hadoop cluster from scratch?
What are the key features of any nosql database?