Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain HCatLoader and HCatStorer APIs?
how would you modify that solution to only count the number of unique words in all the documents?
What is throughput? How does HDFS provide good throughput?
what is Cassandra- CQL collections?
How do you run pig scripts on kerberos secured cluster?
List the benefits of using Cassandra.
What is the difference between client mode and cluster mode in spark?
What are the benefits of apache tajo?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
What is the advantage of hadoop over java serialization?
Write the command to start and stop the spark in an interactive shell?
How to Write a UDF function in Hive?
Specify Cassandra’s importance on Facebook?
What is Hadoop serialization?
In which language Cassandra is written?