Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can an application connect to Hive run as a server?
Explain reduceByKey() Spark operation?
What are the main configuration parameters in a MapReduce program?
How apache spark works?
Mention what needs to be taken care while adding a column?
What is HDFS Federation?
How do I know if flume agent is running?
Can you define inputsplit in hadoop?
How is reporting controlled in hadoop?
What are the various data sources available in SparkSQL?
Explain the different types of repairs.
What is unstructured data?
Why MapReduce uses the key-value pair to process the data?
What relational operators can we use that are related to combining and splitting in Pig language?
Is it necessary to write a mapreduce job in java?