Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to change a number of mappers running on a slave in MapReduce?
Explain Spark map() transformation?
What is the difference between spark and scala?
How does an hadoop application look like or their basic components?
Explain the default level of parallelism in Apache Spark
What is inputsplit in hadoop? Explain.
How you can contact your client everyday ?
What is the key- value pair in Hadoop MapReduce?
Name the filter which accepts the page size as the parameter in hbase?
How can we create RDD in Apache Spark?
What happens when two users try to access to the same file in HDFS?
Why do the nodes are removed and added frequently in a hadoop cluster?
Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?
Explain the level of parallelism in Spark Streaming? Also, describe its need.
Hbase blocksize is configured on which level?