Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Say what the views are in hive?
Define a worker node?
What are some alternatives to apache kafka?
What is rack awareness in hadoop?
What happens to a namenode, when job tracker is down?
How is streaming implemented in spark? Explain with examples.
Explain the Parquet File format in Apache Spark. When is it the best to choose this?
Why is Apache Spark faster than Apache Hadoop?
What is Apache Spark Machine learning library?
Pig Features ?
What happens when we submit a spark job?
What are problems with small files and hdfs?
What do you understand by the term snitch in cassandra? Give some example.
Can we install spark on windows?
How many instances of tasktracker run on a hadoop cluster?