Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the default configuration files that are used in hadoop?
What are the key points of data model of Cassandra?
Explain the core benefits for hadoop users by using the apache ambari?
Explain the role of the offset?
Explain the repartition() operation in Spark?
What is RDD lineage graph? How does it enable fault-tolerance in Spark?
What are the various InputFormats in Hadoop?
Name the most common input formats defined in hadoop?
How can you use producer api code?
Explain when to use explode in Hive?
List the various types of "Cluster Managers" in Spark.
When to choose "External Table" in Hive?
explain the key features of Apache Spark?
How to read file in HDFS?
Compare hive, hbase, and impala?