On what basis name node distribute blocks across the data nodes?
No Answer is Posted For this Question
Be the First to Post Answer
How can one increase replication factor to a desired value in Hadoop?
What happens in a textinputformat?
What is Disk Balancer in Apache Hadoop?
How to come out of the insert mode?
Explain what is the purpose of RecordReader in Hadoop?
Command to format the NameNode?
Virtual Box & Ubuntu Installation?
What is HDFS - Hadoop Distributed File System?
What are the two main components of ResourceManager?
What is crontab? Explain with suitable example?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
What is the communication channel between client and namenode/datanode?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)