Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
how can you identify whether a given operation is transformation or action?
What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?
What is the difference between rdd and dataframe in spark?
What is spark shuffle service?
Can you explain textinformat?
What is difference between reducer and combiner?
How to Delete file from HDFS?
How does Mappers run method works?
How does rdd work in spark?
How to call impala built-in functions?
Explain combiners.
What are sink processors?
What is the purpose of Hive Driver?
What is spark client?
Explain jsonloader, jsonstorage functions in pig?