Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Does google use spark?
Explain how big is ‘big data’?
What bit version that ambari needs and also list out the operating systems that are compatible?
What is closing out ledgers?
What is the purpose of textinputformat?
What are the file formats supported by spark?
Can you define udf?
I want to see the present working directory in UNIX from hive. Is it possible to run this command from hive?
What are the 2 modes used to run pig scripts?
What is cluster in apache spark?
What is the Internal Architecture of the Cassandra Database ?
Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?
Which spark library allows reliable file sharing at memory speed across different cluster frameworks?
What is the difference between DAG and Lineage?
Where are Hadoop’s configuration files located?