Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What file systems does spark support?
What do you know about keyvaluetextinputformat?
Explain hive udf function?
What is the task of Spark Engine
Explain data flow in Flume?
Where do you specify the Mapper Implementation?
Define catalog tables in HBase?
What is the difference between piglatin and hiveql?
Explain bagtotuple?
What is action, how it process data in apache spark
What is metastore?
You have a file personal_data.txt in the HDFS directory with 100 records. You want to see only the first 5 records from the employee.txt file. How will you do this?
What happens when an action is executed in spark?
How is recovery achieved in Ambari?
Is map like a pointer?