What is Hadoop?
Answer / sai krishna kesanapalli
Hadoop is an open source distributed processing framework that manages data processing and storing for big data applications running in clustered systems.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is a namenode? How many instances of namenode run on a hadoop cluster?
what factors the block size takes before creation?
How client application interacts with the NameNode?
Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
Define streaming?
Explain InputFormat?
What is the default replication factor?
Which one is default InputFormat in Hadoop ?
What is Rack Awareness in Apache Hadoop?
Explain the features of pseudo mode?
On which port does ssh work?
What do masters consist of?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)