Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the difference between mahout & mllib?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
Why is spark good?
How can you connect an application
How to perform the inter-cluster data copying work in HDFS?
Why HDFS?
What is the use of illustrate in pig?
Is ambari python client can be used to make good use of ambari api’s?
Can spark work without hadoop?
Define parquet file format? How to convert data to parquet format?
What do you understand by Lazy Evaluation?
How tasks are created in spark?
How to open a connection in hbase?
What are different String functions available in PIG?
When do you have to avoid secondary indexes?