Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
77What does the following query do? Insert overwrite table employees partition (country, state) select ..., Se.cnty, se.st from staged_employees se;
964While loading data into a hive table using the load data clause, how do you specify it is a hdfs file and not a local file ?
770
What is hadoop framework?
Can you explain logistic regression?
What is NameNode and DataNode in HDFS?
Hadoop Libraries and Utilities and Miscellaneous Hadoop Applications?
What are the benefits of using Spark with Apache Mesos?
Explain how does hadoop classpath plays a vital role in stopping or starting in hadoop daemons?
explain the use of blinkdb?
What is the difference between a node, a cluster, and data centre?
How do I know how many impala nodes are in my cluster?
Mention what are the most common input formats defined in hadoop?
What is Cassandra Data Modelling ?
Does the hdfs client decide the input split or namenode?
Talk about the concept of tunable consistency in Cassandra.
How does lazy evaluation work in spark?
What is a relation in Pig?