Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...



Big Data Interview Questions
Questions Answers Views Company eMail

Explain pipe() operation. How it writes the result to the standard output?

304

Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?

360

How to identify that given operation is transformation/action in your program?

303

explain the use of blinkdb?

317

How do you parse data in xml? Which kind of class do you use with java to parse data?

357

Explain parquet file?

307

What is lazy evaluation and how is it useful?

316

How is transformation on rdd different from action?

358

What is a dataset? What are its advantages over dataframe and rdd?

313

What is pagerank?

302

What is dag – directed acyclic graph?

320

Explain schemardd?

373

Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.

349

When we create an rdd, does it bring the data and load it into the memory?

360

What does reduce action do?

293


Un-Answered Questions { Big Data }

How hbase uses zookeeper?

1


What is the importance of — the split-by clause in running parallel import tasks in sqoop?

5


What is flume used for?

85


Why do we need MapReduce during Pig programming?

718


Explain what is logging in Cassandra?

87


Give the sqoop command to see the content of the job named myjob?

5


What does serdes mean in apache kafka?

660


What is the default replication factor in Hadoop and how will you change it?

656


Why do we use persist () on links rdd?

316


List of the some best tools that can be useful for data-analysis?

483


Can you explain broadcast variables?

318


How does gossip protocol work?

81


What is shuffle read and shuffle write in spark?

326


Explain what do you understand by cassandra- cql collections?

85


What are 3 core dimension of big data?

604