Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) List the benefits of Spark over MapReduce.
What is used to store data generally?
What other technologies have you used in hadoop sta ck?
Explain HCatStorer APIs?
You have a file personal_data.txt in the HDFS directory with 100 records. You want to see only the first 5 records from the employee.txt file. How will you do this?
What is executor memory and driver memory in spark?
Explain HCatOutputFormat?
While writing evaluate UDF, which method has to be overridden?
Can a partition be archived? What are the advantages and Disadvantages?
Do streamers make money from sparks?
How many compaction types are in HBase?
Virtual Box & Ubuntu Installation?
Explain api create or replace tempview()?
Explain first() operation in Apache Spark?
What are the relational operators available related to loading and storing in pig language?