Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Write a query to insert a new column(new_col int) into a hiev table (htab) at a position before an existing column (x_col)
Name some Complex types of Data types, Avro Supports?
how can you check whether Namenode is working beside using the jps command?
What do you understand about yarn?
Name the three layers, Ambari supports?
What are some typical functions of job tracker in hadoop?
Do we need to give a password, even if the key is added in ssh?
Is spark better than hadoop?
How can you compare Hadoop and Spark in terms of ease of use?
Explain about the different types of trformations on dstreams?
What are the features of Spark?
Define Mem-table in Cassandra?
What is the importance of — the split-by clause in running parallel import tasks in sqoop?
What is the jobtracker and what it performs in a hadoop cluster?
Explain what is memtable in cassandra?