Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What does the following query do? Insert overwrite table employees partition (country, state) select ..., Se.cnty, se.st from staged_employees se;
What impala use for authentication?
Explain pigstorage function?
How do you stop a running job gracefully?
What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?
Why do we need apache spark?
UPPER or UCASE function in Hive with example?
Why hive does not store metadata information in hdfs?
Can you explain heartbeat in hdfs?
What is Kundera in Cassandra?
What happens if you alter the block size of a column family on an already occupied database?
What is salting in spark?
Is spark streaming real time?
Do we need to install spark in all nodes?
What is a table generating function on hive?