Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain how does hadoop classpath plays a vital role in stopping or starting in hadoop daemons?
What is the difference between client mode and cluster mode in spark?
Is a distributed machine learning framework on top of spark?
Use of Help command in Hadoop sqoop?
Where can I get sample data to try?
how you can reduce churn in ISR? When does broker leave the ISR?
Define the Use of Pig?
Can we have different replication factor of the existing files in hdfs?
Define streaming access?
How will you list all the columns of a table using Apache Sqoop?
Mention what needs to be taken care while adding a column?
Which are the three modes in which hadoop can be run?
Define paired RDD in Apache Spark?
What are different types of filesystem?
Is apache spark a framework?