Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Doesn’t google have its very own version of dfs?
What is inputformat in hadoop?
What language is apache spark?
What size is recommended for each node?
Define Partitions?
Describe DataStaxOpsCenter?
Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Establish the difference between a node, cluster & data centres in Cassandra.
Explain the Features of HBase?
How does one create RDDs in Spark?
How do I download adobe spark?
How job tracker schedules an assignment?
Why do we use spark?
Explain the general mapreduce algorithm
What are the different modes in which we can configure/install Hadoop?