Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain a simple Map/Reduce problem.
What are the various uses of explode hive?
What if a namenode has no data?
What do you mean by column family in Cassandra?
Explain jsonloader, jsonstorage functions in pig?
how can we access the sub directories recursively?
List the advantage of Parquet file in Apache Spark?
What is driver memory and executor memory in spark?
Is apache spark a tool?
What is the communication channel between client and namenode/datanode?
Define Nodetool Utility?
Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?
What's rdd?
Why HCatalog?
Explain the default level of parallelism in Apache Spark