Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the default file format to import data using Apache Sqoop?
Map reduce jobs are failing on a cluster that was just restarted. They worked before restart. What could be wrong?
What are the components of presto architecture?
what daemons run on a master node and slave nodes?
What types of costs are associated with creating the index on hive tables?
How to create custom key and custom value in MapReduce Job?
Is it necessary to kill the topology while updating the running topology?
Explain pigdump function?
How to load data into table created in hive ?
Why should we use presto?
Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?
What is spark tool?
Explain InputSplit in Hadoop?
Define the term thrift
What is a dstream in apache spark?