Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What roles do Replicas and the ISR play?
What are the essential hooping tools that improve performance? Big data?
Mention what is the difference between an rdbms and hadoop?
Does Pig differ from MapReduce? If yes, how?
What problems have you faced when you are working on Hadoop code?
Whenever we run hive query, new metastore_db is created. Why?
You have a file personal_data.txt in the HDFS directory with 100 records. You want to see only the first 5 records from the employee.txt file. How will you do this?
How is it completely different from doing machine learning in r or sas?
What is Purpose to Validate in Sqoop?
In Map Reduce why map write output to Local Disk instead of HDFS?
Define a datanode?
Why Hadoop performs replication, although it results in data redundancy?
What is the difference between an input split and hdfs block?
On what all basis can you differentiate rdd, dataframe, and dataset?
Have you ever used counters in hadoop?