Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
how you can improve the throughput of a remote consumer?
Explain what is kafka?
Does spark sql use hive?
When NameNode enter in Safe Mode?
Why Hive is not suitable for OLTP systems?
How does NameNode tackle DataNode failures?
Give some advantages of Cassandra?
Define composite type in Cassandra?
what is the traditional method of message transfer?
Define a udf?
What are the port numbers of namenode?
What is a map in pig?
What is the use of checkpoints in spark?
What is the difference between persist() and cache()?
What is shuffling and sorting in mapreduce?