Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain what is the row key?
How can hive avoid mapreduce?
Can you explain hadoop streaming?
Which language is best for spark?
What are shared variables?
Why do we need Hadoop Archives? How is it created?
What is pregel api?
How does hadoop achieve fault tolerance?
What are the different UDF’s in Pig?
Explain the difference between mahout & mllib?
How to resolve ioexception: cannot create directory, while formatting namenode in hadoop?
How does impala compare to hive and pig?
What are the three components of Cassandra write?
Explain some Advantages of Avro?
Is it possible to have hadoop job output in multiple directories?