Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Different Running Modes of Apache Spark
Where can I get sample data to try?
What is Federation?
what is Speculative Execution?
What does heartbeat in hdfs means?
how would you modify that solution to only count the number of unique words in all the documents?
Is hadoop mandatory for spark?
What is non-dfs used in hdfs web console
What is the purpose of retention period in Kafka cluster?
How do sparks work?
What do you mean by shuffling and sorting in MapReduce?
Specify the different methods of hive?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
Name a few import control commands. How can Sqoop handle large objects?
How to set which framework would be used to run mapreduce program?