Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How do I start a spark cluster?
Why is the spark so fast?
Explain different transformation on DStream?
What is the use of shutdown command?
How will you perform the inter cluster data copying work in hdfs?
What are different hdfs dfs shell commands to perform copy operation?
In which scenario Pig is better fit than MapReduce?
What are Flume core components?
Explain SparkContext in Apache Spark?
What is a TaskInstance?
Comparison between Secondary NameNode and Checkpoint Node in Hadoop?
what is Memtable in Cassandra?
why should we use 'group' keyword in pig scripts?
Explain how can you check whether namenode is working beside using the jps command?
What are the features of kafka?