Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is accumulators and broadcast variables in spark?
List the various types of "Cluster Managers" in Spark.
What are the different types of partitioners in cassandra?
What is the difference between TextinputFormat and KeyValueTextInputFormat class?
What are different hdfs dfs shell commands to perform copy operation?
Define ttl in hbase?
List out the other components of cassandra?
Can we do real-time processing using spark sql?
There seem to be certain management tools in Cassandra. What are they?
How does hdfs give great throughput?
Can we have multiple entries in the master files?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
How to create database statement in apache tajo?
What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
What is the disadvantage of spark sql?