Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Give some points of pig for hadoop ?
Explain join() operation in Apache Spark?
What are configuration files in Hadoop?
Describe how hbase uses zookeeper?
Which components are used for stream flow of data?
How does hdfs provides good throughput?
What is the difference between a hadoop database and relational database?
What is hbase in hadoop?
What are common spark ecosystems?
How we can change Replication factor when Data is on the fly?
What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?
What is troubleshooting for impala?
What problem does Apache Pig solve?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
What is the need of key-value pair to process the data in MapReduce?