Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the difference between Spark SQL and Hive.
Explain edge nodes in hadoop?
How can you use adminclient api?
Can you explain combiner?
What is cqlsh? And why is it used?
Can you tell us more about ssh?
What is the difference between external table and managed table?
Why HDFS stores data using commodity hardware despite the higher chance of failures?
What is a combiner in hadoop?
Which database the sqoop metastore runs on?
Explain sum(), max(), min() operation in Apache Spark?
Describe DataStaxOpsCenter?
Define Partitions?
What is spark shuffle?
What are the main benefits of using cassandra?