Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the default maximum dynamic partition that can be created by a mapper reducer? How can you change it?
727The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
1005
Define a column family?
What is the use of checkpoints in spark?
Can we run spark on windows?
How one can format Hadoop HDFS?
What is the difference between nas (network attached storage) and hdfs?
What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?
How to specify more than one directory as input to the MapReduce Job?
Define fold() operation in Apache Spark?
Name the languages which are supported by apache spark and which one is most popular?
Define "Action" in Spark
What is the difference between Hadoop and RDBMS?
How namenode handles data node failures?
Explain the role of the offset?
What is the difference between sort by and order by in hive?
What does connector api in kafka?