Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
how can we access the sub directories recursively?
Is piglatin a strongly typed language? If yes, then how did you come to the conclusion?
What is big data spark?
Mention what is the number of default partitioner in Hadoop?
Can we deploy job tracker other than name node?
How is security achieved in Apache Hadoop?
What is the key difference between NameNode and DataNode in Hadoop?
How does a client read/write data in HDFS?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
What is the difference between Reducer and Combiner in Hadoop MapReduce?
How Big is ‘Big Data’?
What is HBase Shell?
What are the ways in which Apache Spark handles accumulated Metadata?
What is hadoop? Name the main components of a hadoop application?
How do I check my spark status?