Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain how Hive Deserialize and serialize the data?
What is mllib?
Is it possible to provide multiple input to Hadoop? If yes then how?
Why do we need a password-less ssh in fully distributed environment?
What is the key difference between NameNode and DataNode in Hadoop?
Explain how HDFS communicates with Linux native file system?
What is the difference between an inputsplit and a block?
Can you explain logistic regression?
What are the relational operators available related to loading and storing in pig language?
Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Define the consistency levels for read operations in Cassandra?
What problem does Apache Flume solve?
What are the uses and applications of mahout ?
What is the use of coordinator node in read?
What are Paired RDD?