Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What does the following query do? Insert overwrite table employees partition (country, state) select ..., Se.cnty, se.st from staged_employees se;
921While loading data into a hive table using the load data clause, how do you specify it is a hdfs file and not a local file ?
730As part of optimizing the queries in hive, what should be the order of table size in a join query?
697
How to invoke Command Line Interface?
Can the region server will be located on all datanodes?
What is the difference between Primary, Partition and Cassandra ?
Kafka can be used for which kind of applications?
Which files are used by the startup and shutdown commands?
what are the different modes of Hive?
What does the “USE” command in the hive do?
Is it possible to run Apache Spark on Apache Mesos?
What are advantages of Spark over MapReduce?
What MapReduce framework consists of?
explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.
How would you import data from MYSQL into HDFS ?
What are the different ways of representing data in Spark?
Why MapReduce uses the key-value pair to process the data?
What are producers in kafka?