Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain the zookeeper workflow?
Explain the steps to be followed to deploy a big data solution?
Before deploying the hadoop instance, what are the checks that an individual should do?
What is a Distributed Cache in Hadoop?
What does /var/hadoop/pids do?
What do you know about sequencefileinputformat?
Why lazy evaluation is good in spark?
Can NameNode and DataNode be a commodity hardware?
Explain coalesce operation in Apache Spark?
How would you import data from MYSQL into HDFS ?
How does impala compare to hive and pig?
What is data processing in big data?
Can you mention some features of spark?
What is the usage of "void close()" method?
Define Partition and Partitioner in Apache Spark?