Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Explain Machine Learning library in Spark?
Explain what is wal and hlog in hbase?
What is a pipelinedrdd?
In which scenario Pig is better fit than MapReduce?
How can data transfer be minimized when working with Apache Spark?
What is the reason for creating a new metastore_db whenever Hive query is run from a different directory?
name few other popular column oriented databases like hbase.
Explain some Disadvantages of Avro?
What is secondary namenode?
Where can I get sample data to try?
How is indexing done in HDFS?
What are the core components of Hadoop?
What is a hive in big data?
If map reduce is inferior to spark then is there any benefit of learning it?
What does a Spark Engine do?