Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can we use Ambari Python Client to use of Ambari API’s?
What is spark reducebykey?
Define Partitions?
Can NameNode and DataNode be a commodity hardware?
Can sqoop use spark?
What are the restriction to the key and value class ?
What is the Internal Architecture of the Cassandra Database ?
What are different types of filesystem?
what is the default replication factor in HDFS?
What is a row in cassandra? And what are the different elements of it?
Can a spark cause a fire?
How can you minimize data transfers when working with Spark?
How do I know how many impala nodes are in my cluster?
Can we run spark on windows?
In hbase what is column families?