Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Does impala performance improve as it is deployed to more hosts in a cluster in much the same way that hadoop performance does?
164What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?
320What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
346What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?
480
Explain the various types of partitioners in cassandra?
What is a keyspace in Cassandra?
how can you identify whether a given operation is transformation or action?
How we can take Hadoop out of Safe Mode?
How the SSTable is different from other relational tables?
Elaborate on CQL?
What is row in hbase?
Explain the fundamental difference between Cassandra and Hadoop?
Define sparksession in apache spark? Why is it needed?
What is the difference between an input split and hdfs block?
Explain first() operation in Spark?
Establish the difference between a node, cluster & data centres in Cassandra.
What is difference between cache and persist in spark?
Name different elements of JConsole?
What are the advantages of DataFrame?