Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between mahout and graphlab ?
What is Hadoop Distributed File System- HDFS?
What does /var/hadoop/pids do?
Does HDFS allow a client to read a file which is already opened for writing?
What are the different functions available in pig latin language?
What is Pig Storage?
Mention what is the data storage component used by hadoop?
Does 'ILLUSTRATE' run a MapReduce job?
What is decorating filters?
Explain the main difference between kafka and flume?
How do you integrate spark and hive?
If the source data gets updated every now and then, how will you synchronize the data in hdfs that is imported by sqoop?
How can you check all the tables present in a single database using Sqoop?
what are Task Tracker and Job Tracker?
What is the sequence of execution of map, reduce, recordreader, split, combiner, partitioner?