Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does inputsplit in mapreduce determines the record boundaries correctly?
List out the different stream grouping in apache storm?
In how many ways RDDs can be created? Explain.
Can you execute Hadoop dfs Commands from Hive CLI? How?
How can you schedule a sqoop job using Oozie?
Does spark need hdfs?
Is apache spark a tool?
Can we do online transactions(oltp) using hadoop?
Is reduce-only job possible in Hadoop MapReduce?
Is map like a pointer?
Can free-form SQL queries be used with Sqoop import command? If yes, then how can they be used?
What is the difference between Spark Transform in DStream and map ?
Why are Replications critical in Kafka?
Name different types of the data model?
What does producer api in kafka?