Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is difference between memory channel and file channel in flume?
What is Pig Latin?
Name different types of primary keys in Cassandra?
Are spark dataframes immutable?
What is a Block Scanner in HDFS?
What is rdd in spark with example?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
What do you understand by standalone (or local) mode?
What is the difference between Reducer and Combiner in Hadoop MapReduce?
In a very huge text file, you want to just check if a particular keyword exists. How would you do this using Spark?
Can we run unix shell commands from hive? Can hive queries be executed from script files? How? Give an example.
Explain various cluster manager in Apache Spark?
What are some of the characteristics of Hadoop framework?
Differentiate between FileSink and FileRollSink?
How is the option in Hadoop to skip the bad records?