Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the default replication factor in Hadoop and how will you change it?
What are file permissions in HDFS and how HDFS check permissions for files or directory?
Tell any two feature Flume?
What is spark vs scala?
The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
What are the functionalities of jobtracer?
What is key-value store db?
Why aggregation cannot be done in Mapper?
List the advantage of Parquet files?
Can Hadoop be compared to NOSQL database like Cassandra?
What are transformations in spark?
Explain the use of .mecia class?
What are Paired RDD?
Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
What does conf.setmapper class do?