Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How rdd can be created in spark?
Give examples of some companies that are using Hadoop structure?
Explain how you can reduce churn in isr? When does broker leave the isr?
Explain the term Cluster?
What are ‘reduces’?
How does Cassandra delete data?
What are the limitations of the Pig?
Explain about transformations and actions in the context of RDDs.
What do you know by storage and compute node?
When to use Hive?
How mahout used with python ?
What is the difference between TextInputFormat and KeyValueInputFormat class?
what is the meaning of broker in Kafka?
While starting hadoop services, datanode service is not running?
What is the default maximum dynamic partition that can be created by a mapper reducer? How can you change it?