Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why do we need buckets?
What is the difference between the external table and managed table?
How much space will the split occupy in Mapreduce?
What are the design goals of zookeeper?
How to create RDD?
Explain the core methods of the reducer?
What is the use of context object?
What does conf.setmapper class do?
What is the use of cassandra and why to use cassandra?
What is the default extension of the files produced from a sqoop import using the –compress parameter?
Name some companies that are already using Spark Streaming?
How do I install spark?
RLIKE in Hive?
What is the difference between apache mahout and cloudera oryx ?
Big data is a good life?