Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
The partition of hive table has been modified to point to a new directory location. Do I have to move the data to the new location or the data will be moved automatically to the new location?
What do you understand by cluster in cassandra?
What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)
Explain the features of pseudo mode?
Explain is it possible to search for files using wildcards?
What is the maximum size of string data type supported by Hive?
Give the name of some components of Cassandra?
Name the Spark Library which allows reliable file sharing at memory speed across different cluster frameworks.
What is HBase HMaster?
What do you understand by worker node?
Does hadoop always require digital data to process?
How can you add the arbitrary key-value pairs in your mapper?
How does cassandra perform read operation?
How many layers of Hadoop components are supported by Apache Ambari and what are they?
Explain the concept of bloom filter?