Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What can skew the mean?
What is the number of default partitioner in hadoop?
Is it possible to iterate through the rows of HBase table in reverse order?
Is it possible to leverage real-time analysis of the big data collected by Flume directly? If yes, then explain how?
Explain Apache Ambari architecture?
Why is flume used?
What is heartbeat in hadoop?
What is the block size in Hadoop?
What do you understand by bloom filter in cassandra?
Which command is available to show the current HBase user?
Do I need to know scala to learn spark?
How to process data using Transformation operation in Spark?
What is Cassandra Data Modelling ?
What is spark context spark session?
What is difference between split and block in hadoop?