Mention what is data cleansing?
No Answer is Posted For this Question
Be the First to Post Answer
Is it possible to rename the output file, and if so, how?
What is tasktracker in hadoop?
List of the some best tools that can be useful for data-analysis?
What is meant by streaming access?
What is the NameNode port number?
Can you explain record reader?
Mention what is the data storage component used by hadoop?
What happens if you get a ‘connection refused java exception’ when you type hadoop fsck /?
Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
Explain how jobtracker schedules a task?
Why Hadoop performs replication, although it results in data redundancy?
How would you check whether your NameNode is working or not?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)