Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Can you define rdd lineage?
What is distinct clause in apache tajo?
How should you handle session_expired?
What is the use of spark sql?
What are the file formats supported by spark?
What is dynamic partitioning and when is it used?
Explain how message is consumed by consumer in Kafka?
Where is the output of Mapper written in Hadoop?
What do you mean by Schema Declaration?
What is the characteristic of streaming API that makes it flexible run MapReduce jobs in languages like Perl, Ruby, Awk etc.?
What is the biggest shortcoming of Spark?
what do you mean by data processing?
Define partitions in apache spark.
Give some points of hive for hadoop ?
Explain parquet file?