Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the advantages of using mapreduce with hadoop?
Wherever (Different Directory) I run the hive query, it creates new metastore_db, please explain the reason for it?
Explain MemStore?
What are the various configuration parameters required to run a mapreduce job?
How is the splitting of file invoked in Hadoop ?
How to configure hadoop to reuse JVM for mappers?
Define standalone mode in hbase?
What daemons run on master nodes?
Explain the features of stand alone (local) mode?
Define data cleansing?
What are the port numbers of namenode?
Define Nodetool Utility?
What is the logistic regression?
what is composite type in cassandra?
What are Actions? Give some examples.