Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the types of transformation in RDD in Apache Spark?
Mention what is the maximum size of the message does kafka server can receive?
What is a tuple?
What is the default database provided by Apache Hive for metastore?
Is cache an action in spark?
Explain about Hadoop file system and processing framework?
Explain how RDDs work with Scala in Spark
How to Write a UDF function in Hive?
Mention how hadoop is different from other data processing tools?
Can you explain apache kafka?
hbase support syntax structure like sql. Yes or no?
What are some alternatives to apache kafka?
What alternate way does HDFS provides to recover data in case a Namenode
How you can use Akka with Spark?
Explain Spark coalesce() operation?