Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is meant by spark in big data?
Which language is best for spark?
Define Nodetool Utility?
What is spark deploy mode?
When would you use hbase?
How many InputSplits is made by a Hadoop Framework?
What are transformations in spark?
Does cloudera offer a vm for demonstrating impala?
If datanodes increase, then do we need to upgrade namenode?
Can you explain the common input formats in hadoop?
Does spark use hive?
What is compute and Storage nodes?
What does impala do for fast access?
What is a distributed cache in mapreduce framework?
Explain the common input formats in hadoop?