Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the Cassandra Coefficient ?
Is it possible to leverage real-time analysis of the big data collected by Flume directly? If yes, then explain how?
What are the limitations of Spark?
How Hive organize the data?
Explain how jobtracker schedules a task?
How do you stop a spark?
What is a bookie in bookkeeper?
what are the most common input formats defined in Hadoop?
Define data lake?
Is hive an impala requirement?
Does the HDFS go wrong? If so, how?
How to restart NameNode or all the daemons in Hadoop HDFS?
What is BloomMapFile used for?
How hdfs is different from traditional file systems?
Mention what needs to be taken care while adding a column?