What is Hadoop?
Answer / sai krishna kesanapalli
Hadoop is an open source distributed processing framework that manages data processing and storing for big data applications running in clustered systems.
| Is This Answer Correct ? | 0 Yes | 0 No |
Explain what is the purpose of RecordReader in Hadoop?
Mention what is the use of Context Object?
On what basis Namenode will decide which datanode to write on?
Do we need to give a password, even if the key is added in ssh?
Is hadoop required for data science?
What problems can be addressed by using Zookeeper?
Explain the features of stand alone (local) mode?
Can we call vms as pseudos?
What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
What will you do when NameNode is down?
What is HDFS block size and what did you chose in your project?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)