Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How do you run pig scripts on kerberos secured cluster?
What does ambari shell can provide?
What do you know about sequencefileinputformat?
Is spark built on top of hadoop?
What is the use of ycsb?
Explain Accumulator in Spark?
Where sorting is done on mapper node or reducer node in MapReduce?
How to change the replication factor of data which is already stored in HDFS?
What is tungsten in spark?
Can we run unix shell commands from the hive? Give example?
What is the use of MasterServer?
What is CTE Table in Hive?
explaine wal in hbase?
What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)
What is the prerequisite for Apache Hive installation?