Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?
Explain the Use of Hive?
Explain the different types of repairs.
What is the spark driver?
When we write a= load …, what does 'a' called?
can you explain about configuration files?
What do you mean by the high availability of a namenode? How is it achieved?
Compare Hadoop 2 and Hadoop 3?
What do you understand by Kundera?
Explain what is heartbeat in hdfs?
How is streaming implemented in spark?
Explain how to write the output into a file using storm?
Explain the concept of bloom filter?
What is the difference between external table and managed table?
Name three companies which is used Spark Streaming services