Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can Ambari manage multiple clusters?
If you omit the overwrite clause while creating a hive table,what happens to file which are new and files which already exist?
Which language is more suitable for text analytics? R or python?
What is external shuffle service in spark?
What is the default port of presto?
Is kafka big data?
What are the benefits/ advantages of Cassandra?
Use of export command in hadoop sqoop?
What do you know about nlineinputformat?
What are the components of a Hive query processor?
What are the tools used in big data?
What is a combiner and where you should use it?
What combiners are and when you should use a combiner in a mapreduce job?
Explain Accumulator in Spark?
Compare RDBMS with Hadoop MapReduce.