Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the operational commands of HBase?
What do you know about schemardd?
how can we change Replication Factor?
what Hive is composed of ?
What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?
What are the side effects of not running a secondary name node?
How can we create znodes?
How the read operation is performed on Cassandra node ?
List some use cases where classification machine learning algorithms can be used.
What do you understand by unit and ()in scala?
Can you explain rack awareness?
How to create hadoop archive?
Explain how cassandra writes data?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
What does ‘jps’ command do?