Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is connection_loss error?
What are the collection data types provided by CQL?
Explain what happens in textinformat ?
How tables are managed in apache tajo?
Why is Kafka technology significant to use?
What are the key differences between cassandra and traditional rdbms?
Why is Spark RDD immutable?
What is the difference between spark and hive?
How you can remove the element with a critical present in any other Rdd is Apache spark?
What is the purpose of ‘dump’ keyword in Pig?
What alternate way does HDFS provides to recover data in case a Namenode
How often DataNode send heartbeat to NameNode in Hadoop?
What is the key- value pair in MapReduce?
Explian the Limitations of HBase?
In a very huge text file, you want to just check if a particular keyword exists. How would you do this using Spark?