Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Can you explain how you can use Apache Spark along with Hadoop?
What are the options-process for upgrading zookeeper?
How to change a number of mappers running on a slave in MapReduce?
What is big data in dbms?
Highlight the difference between group and Cogroup operators in Pig?
When is it not recommended to use MapReduce paradigm for large scale data processing?
Explain the various table design approaches in HBase?
Describe Accumulator in detail in Apache Spark?
What is the advantage of hadoop over java serialization?
What are the different String functions available in pig?
Define the level of parallelism and its need in spark streaming?
Can we rename the output file?
What do you mean by ZNode?
Can you define a block and block scanner in hdfs?
How message is consumed by consumer in kafka?