Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) what are the languages supported by apache spark for developing big data applications?
What is the role of a MapReduce partitioner?
What is Derby database?
What do you mean by meta information in hdfs? List the documents related to metadata.
When to use spark sql?
What is python spark?
Explain how indexing in hdfs is done?
What do you mean by replication strategy?
What is Cassandra?
Whenever we run hive query, new metastore_db is created. Why?
what is Bloom Filter is used for in Cassandra?
What are the different clustering in mahout?
What is lineage graph?
What Avro offers?
Explain what is a Hive variable. What do we use it for?