Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the key points of data model of Cassandra?
What are the benefits of lazy evaluation?
What is spark tool in big data?
What are channel selectors?
Does Apache Sqoop have a default database?
For a Hadoop job, how will you write a custom partitioner?
Explain accumulators in apache spark.
What is data skew and how do you fix it?
MapReduce Types and Formats and Setting up a Hadoop Cluster?
Can hbase run without hadoop?
What is a task instance in hadoop? Where does it run?
What do you know about Partition in Kafka?
Does spark use mapreduce?
What is kafka technology?
Can you explain about the indexing process in hdfs?