Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why HDFS stores data using commodity hardware despite the higher chance of failures in hadoop?
How are sparks created?
what is the difference between order by and sort by in Hive?
What is the major difference between local and remote meta-store?
What is spark deploy mode?
Where is the Mapper Output intermediate kay-value data stored ?
Where is spark used?
Mention what needs to be taken care while adding a column?
How to set property in apache tajo?
How to specify more than one directory as input in the Hadoop MapReduce Program?
After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
What are the window functions provided by apache tajo?
Mention what job does the conf class do?
What are consumers in kafka?
What is spark driver application?