Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What are the functions of presto?
What is dataframe api?
What is shuffle spill in spark?
Is it possible to share data files between different components?
What is the roadmap for apache driver version one.0?
Explain the term paired RDD in Apache Spark?
Can you give a detailed overview about the Big Data being generated by Facebook?
How you can remove the element with a critical present in any other Rdd is Apache spark?
What is write ahead log(journaling) in Spark?
Mention if we can name view same as the name of a Hive table?
Which command is used to list all the tables in a database or list all the columns in a table?
What is dag – directed acyclic graph?
how can you check whether Namenode is working beside using the jps command?
Define Actions.
On which port does ssh work?