Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Name the scalar data type and complex data types in Pig?
Can we create a hadoop cluster from scratch?
Is hadoop based on google mapreduce?
What are the different zkclientbindings?
What is the fundamental difference between a MapReduce InputSplit and HDFS block?
Is spark an etl?
What is the use of “resultset execute” method?
Why should I use spark?
Specify what the information segments utilized by hadoop are?
Why password is needed in ssh localhost?
Specify the different methods of hive?
Explain about the core components of a distributed Spark application?
What is Apache Avro?
How can we see all the hosts that are available in Ambari?
Can you change the block size of hdfs files?