Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2842
Define the consistency levels for read operations in Cassandra?
What is the difference between persist() and cache()?
What do you understand by logging in cassandra?
What is the next step after Mapper or MapTask?
Why we use intwritable instead of int? Why we use longwritable instead of long?
What is difference between cache and persist in spark?
Is it necessary to know java to learn hadoop?
Can NameNode and DataNode be a commodity hardware?
Name the examples of some companies that are using hadoop structure?
What is a heartbeat in HDFS?
What are the ways to create RDDs in Apache Spark? Explain.
What are the different math functions available in Pig?
What are the DDL commands used in hbase?
How many partitions are created by default in Apache Spark RDD?
What is difference between spark and hadoop?