Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is connection_loss error?
Use of import-all-tables command in hadoop sqoop?
Give some advantages of Cassandra?
What is yarn in hadoop?
What are the possible Job roles?
How to get the single file as the output from MapReduce Job?
What is the use of dataframe in spark?
Differentiate between drop and truncate in cqlsh
State Disadvantages of Apache Kafka?
What is spark in python?
What is used to store data generally?
State some applications of HBase?
What is accumulators and broadcast variables in spark?
Explain HCatStorer APIs?
What is the difference between Caching and Persistence in Apache Spark?