Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Is kafka an etl tool?
What are the parameters of mappers and reducers?
Name three data source available in SparkSQL
What are the main classes of Data Transfer API?
What is apache presto?
Explain write ahead log(journaling) in spark?
How many maximum jvm can run on a slave node?
What is the FlatMap Transformation in Apache Spark RDD?
How many types of rdd are there in spark?
What is hdfs in big data?
How do you overwrite replication factor?
How does lazy evaluation work in spark?
What is the role of the namenode?
How hbase uses zookeeper?
Can you join multiple fields in apache pig scripts?