Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the difference between Reducer and Combiner in Hadoop MapReduce?
What are the storage supported by tajo?
What do you know about nlineinputformat?
How to load data in pig?
Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?
Explain bucketing in Hive?
What exactly kafka does?
What are the components of Apache Pig platform?
Is big data unstructured?
Is apache spark a database?
Why Flume?
Describe Replication Factor?
How many types of NoSQL databases are there?
What is the need of key-value pair to process the data in MapReduce?
explain Metadata in Namenode?