Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the most widely recognized info formats characterized in hadoop?
How you can use Akka with Spark?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
How is a keyspace created in cassandra?
What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)
Explain the term 'Topic Replication Factor'?
What is NameNode and DataNode in HDFS?
What are the various libraries available on top of Apache Spark?
Is apache spark a programming language?
Can we submit the mapreduce job from slave node?
What are the different parts of Hive ?
What is the difference between hadoop and other data processing tools?
What are the various uses of explode hive?
Discuss about the different tombstone markers used for deletion purposes in HBase.?
Can you explain ingestion in big data?