Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain how HDFS communicates with Linux native file system?
Define fault tolerance?
Explain what is the role of the zookeeper?
Why is rdd immutable?
Explain Cqlsh?
Features of Kafka Stream?
What is the use of tracing cqlsh command in Cassandra?
Define ttl in hbase?
What are the main configuration parameters in a MapReduce program?
What is the difference between HDFS block and input split?
How will you merge the contents of two or more relations and divide a single relation into two or more relations?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
What is identity mapper and chain mapper?
What is flatmap in apache spark?
Explain Hadoop Archives?