Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
442If we want to copy 10 blocks from one machine to another, but another machine can copy only 8.5 blocks, can the blocks be broken at the time of replication?
665
Whats the default port that jobtrackers listens ?
How does HDFS Index Data blocks? Explain.
Is Namenode machine same as DataNode machine as in terms of hardware?
What is the use of spark in big data?
What are the features of RDD, that makes RDD an important abstraction of Spark?
Define data integrity? How does hdfs ensure data integrity of data blocks stored in hdfs?
Explain the difference between an hdfs block and input split?
What is meant by rdd in spark?
What are the different clustering in mahout?
Is there an update statement?
Write command to copy a file from HDFS to linux(local).
What is Flume event?
Why spark is faster than hive?
What do you mean by the NameNode High Availability in hadoop?
what is (HS2) HiveServer2?