Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is a commodity hardware? Does commodity hardware include RAM?
Explain HCatalog Architecture in Brief?
How many types of nosql databases?
Is it possible to run Apache Spark without Hadoop?
What is hbase fsck?
How ordering in hdfs is finished?
What is an Agent?
How can you set an arbitrary number of mappers to be created for a job in Hadoop?
While writing evaluate UDF, which method has to be overridden?
What are the various libraries available on top of Apache Spark?
What is lineage graph in Apache Spark?
What is the difference between logical and physical plans?
Define commit log?
What is a Hive variable? What for we use it?
Explain the role of Streams API?