Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about the execution plans of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
You have a file employee.txt in the hdfs directory with 100 records. You want to see only the first 10 records from the employee.txt file. How will you do this?
1004
How data or file is read in Hadoop HDFS?
Explain Spark Streaming with Socket?
What is the message broker?
How many Daemon processes run on a Hadoop system?
Can you list down the limitations of using Apache Spark?
What do you know about sequencefileinputformat?
If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?
How can you send large messages with kafka (over 15mb)?
Cassandra is written in which language?
Does cassandra support acid tractions?
Explain the concept of compaction in Cassandra?
What do you understand by the super column in cassandra?
what is next step after mapper or maptask?
How much space will the split occupy in Mapreduce?
What are the benefits of block transfer?