Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is structured and unstructured data?
Name a few import control commands. How can Sqoop handle large objects?
when hadoop enter in safe mode?
What is difference between coalesce and repartition?
How can you see the list of stored jobs in sqoop metastore?
What is a rack?
How to add/delete a Node to the existing cluster?
Which among the two is preferable for the project- Hadoop MapReduce or Apache Spark?
Do I need scala for spark?
What the information segments utilized by hadoop are?
What are the different types of tables available in Hive?
how will you implement SQL in Spark?
List few differences between apache kafka and rabbitmq?
Can we have multiple entries in the master files?
In the Producer, when does QueueFullException occur?