Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Is spark built on top of hadoop?
What does consumer api in kafka?
Whether the output of mapper or output of partitioner written on local disk?
What is a Column family in hbase?
Which command do we use to run HBase Shell?
Can you explain record reader?
Why we need compression and what are the different compression format supported?
What is the unit of data that flows through a flume agent?
What is Apache Spark Machine learning library?
What is CTAS Table in Hive?
For a Hadoop job, how will you write a custom partitioner?
Are spark dataframes distributed?
Does Apache Sqoop have a default database?
Who invented hadoop?
In which directory hadoop is installed?