Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the differences between PIG and HIVE?
What is the difference between an hdfs block and input split?
Explain the use of tasktracker in the hadoop cluster?
How can we drop a table in HCatalog?
Mention what is distributed cache in hadoop?
Can the region server will be located on all datanodes?
Is it possible to split 100 lines of input as a single split in MapReduce?
How to call impala built-in functions?
How would an hadoop administrator deploy various components of hadoop in production?
Explain the level of parallelism in Spark Streaming? Also, describe its need.
How to load data into table created in hive ?
Explain first() operation in Apache Spark RDD?
Can you define rdd lineage?
What is spark database?
What is skew data?