Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
43
What problem does Apache Flume solve?
What are the different types of partitioners in cassandra?
How is streaming implemented in spark?
What is lambda in spark?
Does the archiving of hive tables give any space saving in hdfs?
Do we require two servers for the namenode and the datanodes?
Can you explain apache kafka?
What is the use of truncate command?
What are ‘reduces’?
What are the different functions available in pig latin language?
How can you add a new partition for the month December in the above partitioned table?
How to create and manage a view in HCatalog?
Explain the uses of Map Reduce in Pig?
What is keyspace in Cassandra?
How can we create rdds in apache spark?