Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Define actions in spark.
What is pregel api?
Explain the rudimentary difference between Cassandra and HBase?
What are the limitations of Hive?
Does Pig give any warning when there is a type mismatch or missing field?
Can you explain spark graphx?
Compare HBase vs RDBMS?
What are the limitations of importing RDBMS tables into Hcatalog directly?
Explain the difference between COUNT_STAR and COUNT functions in Apache Pig?
What is a kafka cluster?
What are the key features of Apache Spark that you like?
How is streaming implemented in spark?
What is setmaster in spark?
What is the difference between rdbms and hadoop?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?