Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Explain write ahead log(journaling) in spark?
What are the three characteristics of big data according to ibm?
Why should we use presto?
What is transformation in spark?
what Hive is composed of ?
Can you tell us how many daemon processes run on a hadoop system?
How is HCatalog different from Hive?
What is an accumulator in spark?
Is hive an impala requirement?
If there is certain data that we want to use again and again in different transformations, what should improve the performance?
Is it necessary to know java to learn hadoop?
Compare Apache Hadoop and Apache Spark?
What is cqlsh? And why is it used?
What is distributed copy (distcp)?
What is a partition in Hive?