Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
730Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.
734Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?
162
What is hadoop sqoop?
What is Spark SQL?
While starting hadoop services, datanode service is not running?
What is project tungsten in spark?
Where can I find impala documentation?
Explain Working of MapReduce?
Explain the features of pseudo mode?
How blocks are distributed among all data nodes for a particular chunk of data?
Is avro supported?
How is Flume-NG different from Flume 0.9?
How many instances of a jobtracker run on hadoop cluster?
What is a DStream?
When should you use sequencefileinputformat?
Explain NameNode and DataNode in HDFS?
How many types of ambari repositories are available?