Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
When NameNode enter in Safe Mode?
I have a relation r. How can I get the top 10 tuples from the relation r?
What is mapper in map reduce?
Can I do transforms or add new functionality?
Explain partitions?
Elaborate kafka architecture?
Explain Spark SQL caching and uncaching?
What is the difference between local and remote metastore?
How Sqoop can be used in a Java program?
What are Guarantees provided by Kafka?
is it necessary to install Spark on all nodes while running Spark application on Yarn?
What are ‘maps’ and ‘reduces’?
Why are spark transformations lazy?
Describe HDFS Federation?
Define replication strategy?