Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
In hadoop_pid_dir, what does pid stands for?
Explain the shuffle?
Can you explain sequence file in hadoop?
Name three data source available in SparkSQL
What is HDFS High Availability?
What are the machine learning algorithms supports in apache mahout?
What are the relational operators available related to loading and storing in pig language?
What are the functions of "Spark Core"?
What are problems with small files and hdfs?
List few benefits of spark over map reduce?
what is NameNode in Hadoop?
What is accumulators and broadcast variables in spark?
Explain the master class and the output class do?
Explain the RDD properties?
Give some points of hive for hadoop ?