Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?
Can you explain spark core?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
What does the file hadoop-metrics.properties do?
Explain how cassandra writes changed data into commitlog?
Does spark use zookeeper?
Explain pigdump function?
What are the different life cycle commands in ambari?
What is the heartbeat used for?
What is Reducer in Hadoop?
What is Cassandra-Cqlsh?
How do I get better performance with spark?
Give the command to see the indexes on a table?
Explain tokenize?