Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what is a cluster in cassandra?
What are spark stages?
Can you explain edge nodes in hadoop?
Explain InputFormat?
why should we use 'group' keyword in pig scripts?
What is a topic in kafka?
What is apache spark for beginners?
What are the advantages of pig language?
What do you understand by sstabl in cassandra?
What is the core of the job in MapReduce framework?
How hdfs is different from traditional file systems?
What are the Optimizations a developer can use during joins?
what is pig?
What are the relational operators available related to combining and splitting in pig language?
what are the main configuration parameters that user need to specify to run Mapreduce Job ?