Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Name three data source available in SparkSQL
What are impala architecture components?
What are the basics of zookeeper api?
Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?
Explain the Reducer's Sort phase?
What are the main features of SPM in Cassandra?
Mention the best features of Apache Sqoop?
What does rdd stand for?
What do you understand by compaction?
what is a sequence file in Hadoop?
What is RDD?
Why do we use apache kafka?
What is cluster in apache spark?
Why hbase is a schema-less database?
Explain about the different channel types in Flume.