Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Is spark good for machine learning?
Explain what happens in text format?
How does master slave architecture in the hadoop?
How to perform the inter-cluster data copying work in HDFS?
Name the languages which are supported by apache spark and which one is most popular?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
What is the reason of using hbase?
What are the main hdfs-site.xml properties?
When to avoid secondary indexes?
What is Buckets in Hive?
What are the benefits/ advantages of Cassandra?
Why hive does not store metadata information in hdfs?
What is the functionality of jobtracker in hadoop? How many instances of a jobtracker run on hadoop cluster?
Do we need to give a password, even if the key is added in ssh?
Does HBase support SQL like syntax?