Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What does apache spark do?
What do you understand from Node redundancy and is it exist in hadoop cluster?
Give the data storage units in Cassandra?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
Difference between cassandra and mongodb?
What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?
How to format the HDFS? How frequently it will be done?
What do you understand by the term snitch in cassandra?
How can we see all the clusters that are available in Ambari?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
What do you understand by hive?
What services run after running hbase job?
What is Flatten?
HCatalog helps to Integrate Hadoop with everything. Explain?
How can you use streams api?