Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How many job tracker processes can run on a single Hadoop cluster?
Is ambari python clients can be used to make the good use of ambari api’s?
What are the consistency levels for write operations in Cassandra?
Define Spark-SQL?
According to IBM, what are the three characteristics of Big Data?
Explain how Hive Deserialize and serialize the data?
how you can reduce churn in ISR? When does broker leave the ISR?
What is javardd?
Does google use spark?
How Mapper is instantiated in a running job?
Can impala do user-defined functions (udfs)?
What are the modes in which Hadoop run?
What operations does rdd support?
What are the major areas where Ambari helps the system administrators to do?
What are the functionalities of jobtracker?