Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the basic steps to writing a UDF Function in Pig?
How to debug Hadoop code?
What makes Apache Spark good at low-latency workloads like graph processing and machine learning?
Write a short note on the disadvantages of mapreduce
What is mandatory while creating a table in cassandra?
Define the level of parallelism and its need in spark streaming?
What is Fault Tolerance?
What are the ways in which Apache Spark handles accumulated Metadata?
Explain what are the tools used in Big Data?
What is the key- value pair in Hadoop MapReduce?
Explain what is “map” and what is "reducer" in hadoop?
Can you explain how you can use Apache Spark along with Hadoop?
What is the functionality of jobtracker in hadoop?
Mention how many operational commands in hbase?
Can you change the block size of hdfs files?