Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is namenode?
Which java class handles the output record encoding into files which result from Hive queries?
What is action, how it process data in apache spark
Is kafka a amqp?
What are the major differences between Hadoop 2 and Hadoop 3?
What do you mean by Persistence?
Where is apache spark used?
What is the relationship between apache hadoop, hbase, hive and cassandra?
Explain the composite key?
What are file permissions in HDFS and how HDFS check permissions for files or directory?
Define a record reader?
Can you explain logistic regression?
What is Spark Streaming?
List the advantage of Parquet file in Apache Spark?
What are the common faults of the developer while using Apache Spark?