Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about ACID transactions in Hive?
How is a keyspace created in cassandra?
Can you define yarn?
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
Mention what is the number of default partitioner in Hadoop?
Where is Mapper output stored?
How should 'store' keyword is useful in pig scripts?
Define parquet file format? How to convert data to parquet format?
What are the file formats that Hive supports and can use be used for storage?
What are ‘reduces’?
What is spark repartition?
Tell something about the query language used in Cassandra Database?
Define consistency?
Why we use parallelize in spark?
What is Flume?