Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is spark a programming language?
What is the difference between piglatin and hiveql?
Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?
In a very huge text file, you want to just check if a particular keyword exists. How would you do this using Spark?
What is dag – directed acyclic graph?
What are the machine learning algorithms supports in apache mahout?
How data or file is read in Hadoop HDFS?
What do you understand by node in cassandra?
How to add column in apache tajo?
Tell something about the query language used in Cassandra Database?
Explain the basic difference between traditional rdbms and hadoop?
How are file systems checked in hdfs?
Define replication strategy?
What is a block in HDFS, why block size 64MB?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?