Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are keywords?
Explain what is big data?
Is a distributed machine learning framework on top of spark?
Mention what is the difference between apache kafka and apache storm?
Which code do we use to open the connection in Hbase?
Mention some machine learning algorithms exposed by mahout?
Explain Hive Thrift server?
Can you explain indexing?
How does HDFS ensure Data Integrity of data blocks stored in HDFS?
What is application master in spark?
Can you explain rack awareness?
Explain cassandra.
Explain the RDD properties?
What do you understand by composite type?
Does Partitioner run in its own JVM or shares with another process?