Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...



Big Data Interview Questions
Questions Answers Views Company eMail

Why is BlinkDB used?

315

What is the advantage of a Parquet file?

340

What are the key features of Apache Spark that you like?

408

What do you understand by SchemaRDD?

330

How can you achieve high availability in Apache Spark?

455

Define a worker node?

471

Name a few companies that use Apache Spark in production?

345

What is the difference between persist() and cache()?

335

Which spark library allows reliable file sharing at memory speed across different cluster frameworks?

275

What does the Spark Engine do?

333

How Spark uses Akka?

332

How Spark handles monitoring and logging in Standalone mode?

347

What is Hadoop serialization?

Capital One,

750

Explain a simple Map/Reduce problem.

Capital One,

786

Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?

Twitter,

787


Un-Answered Questions { Big Data }

what job does the conf class do?

847


Who invented spark?

423


How do I use spark with big data?

313


What are the methods to set up the local repository in different methods?

44


Can I install spark on windows?

297


Why HDFS stores data using commodity hardware despite the higher chance of failures?

44


Define a column family?

85


What is Sqoop?

5


Why we are using flume?

85


What is the best practice to deploy the secondary name node?

495


Explain the common input formats in hadoop?

520


Differentiate between PigLatin and Hive?

741


What is KeyValueTextInputFormat in Hadoop?

586


What do you understand by logging in cassandra?

89


What is spark sqlcontext?

312