Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why rack awareness algorithm is used in hadoop?
How do I stop flume agent?
How does hdfs get a good throughput?
Which language is not supported by spark?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
Name various types of Cluster Managers in Spark.
What are the features of Standalone (local) mode?
What is lambda in spark?
Which one will you choose for a project –Hadoop MapReduce or Apache Spark?
State some DDL Command with brief Description?
What are the differences between relational databases and impala?
Explain Thrift & Protocol Buffers Vs. Avro?
Which channel type is faster in Flume?
What is the role of data transfer API in HCatalog?
What are ‘reduces’?