Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Are job tracker and task trackers present in separate machines?
What is the role of recordreader in hadoop mapreduce?
What are configuration files in Hadoop?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
Explain the process to trigger automatic clean-up in Spark to manage accumulated metadata.
You have a file employee.txt in the hdfs directory with 100 records. You want to see only the first 10 records from the employee.txt file. How will you do this?
Explain in which directory hadoop is installed?
What the information segments utilized by hadoop are?
What are the main components of spark?’
Define the purpose of the partition function in mapreduce framework
How would you restart NameNode?
How can we change the split size if our commodity hardware has less storage space?
Explain the rudimentary difference between Cassandra and HBase?
What are partitions in cassandra?
Define standalone mode in hbase?