Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How many compaction types are in HBase?
Explain how do ‘map’ and ‘reduce’ work?
How can you avoid importing tables one-by-one when importing a large number of tables from a database?
Can you explain about the cluster manager of apache spark?
What is a Seed Node in Cassandra ?
How much space will the split occupy in Mapreduce?
What language is apache kafka written in?
What is Rack awareness?
How is Ambari different from ZooKeeper?
How does cassandra perform read operation?
What does spark do during speculative execution?
How is the option in Hadoop to skip the bad records?
Name some Big Data products?
how to proceed to write your first mapreducer program?
Which is the reliable channel in Flume to ensure that there is no data loss?