Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is spark machine learning?
What does rdd mean?
What are the tools used in big data?
What are the benefits of apache kafka over the traditional technique?
What is the default replication factor and how will you change it?
How to set mappers and reducers for Hadoop jobs?
What is heap memory in spark?
How will you explain COGROUP in Pig?
Illustrate some demerits of using Spark.
What are the different Primitive Data Types available in Hive?
Explain the difference between an inputsplit and a block?
How will format the HDFS ?
Why password is needed in ssh localhost?
What is hadoop framework?
Which java class handles the Input record encoding into files which store the tables in Hive?