Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
43
What is rack awareness in hadoop?
How can I restart namenode?
Can you join multiple fields in Apache
What do you know about the speculative execution?
Explain Spark map() transformation?
What daemons run on master nodes?
Explain cassandra.
Why do people use spark?
Is secondary namenode a substitute to the namenode?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
How to enable recycle bin or trash in hadoop?
How to restrict the number of lines to be printed in pig ?
How to Containerizing ZooKeeper With Docker?
What do you mean by Speculative execution in Apache Spark?
How do you run pig scripts on kerberos secured cluster?