Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Use of Help command in Hadoop sqoop?
What is the abstraction of Spark Streaming?
In hadoop_pid_dir, what does pid stands for?
Difference Between Hadoop and HDFS?
Can we broadcast an rdd?
Explain the process for starting a kafka server?
What operations RDD support?
What is the difference between a node, a cluster, and data centre?
What is Safemode in Apache Hadoop?
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
What are the four essential parameters of a mapper?
While processing data from hdfs, does it execute code near data?
What is javardd?
why should we use 'group' keyword in pig scripts?
Differentiate between FileSink and FileRollSink?