Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How will you implement joins in HBase?
What do you know about Partition in Kafka?
In which language apache kafka is written?
Explain the architecture of Hadoop Pig?
What is driver and executor in spark?
How is RDD in Spark different from Distributed Storage Management?
When to use –target-dir and when to use –warehouse-dir while importing data?
How is machine learning implemented in spark?
What is Grunt shell?
Can we write map reduce program in other than java programming language. How?
Why is Cassandra popular? Clarify.
Can NameNode and DataNode be a commodity hardware?
Explain the input type/format in mapreduce by default?
Explain Spark map() transformation?
What is cloudera and why it is used?