Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How many partitions are created by default in Apache Spark RDD?
Can you define pagerank?
What is spark sqlcontext?
what is Speculative Execution?
What operations does the "RDD" support?
How can you stop a partition form being queried?
Compare MapReduce and Spark?
Mention what is the hadoop mapreduce apis contract for a key and value class?
How to create a custom key and custom value in MapReduce Job?
How many Daemon processes run on a Hadoop system?
Name the most common input formats defined in hadoop?
Is spark better than mapreduce?
How will you write a custom partitioner for a Hadoop job?
What is 'jps'?
Can you explain how it is different from doing machine learning in r or sas?