Home
About Us
Contact Us
Bookmark
Saved Bookmarks
Current Affairs
General Knowledge
Chemical Engineering
UPSEE
BSNL
ISRO
BITSAT
Amazon
ORACLE
Verbal Ability
→
Programming
→
ASP Dot net Interview Questions in Programming
→
What is the difference between coalesce and repart...
1.
What is the difference between coalesce and repartition in Spark?
Answer»
Coalesce
Repartition
It is used for definitely decreasing the number of partitions used in a Dataframe.
This method can decrease or increase the number of partitions used in a Dataframe.
It
USES
the existing partitions to minimize the
AMOUNT
of data being shuffled in a Dataframe.
It just creates
NEW
partitions and while doing a full shuffle.
The partitions through this method are of variable sizes.
The partitions in this method are
ROUGHLY
the same sizes.
Show Answer
Discussion
No Comment Found
Post Comment
Related InterviewSolutions
What is memoryview object in Python?
What are Closures in Python?
What do you mean by duck typing? How does Python implement duck typing?
Give a brief overview of various data compression and archiving APIs in Python’s standard library.
What are the characteristics of Counter object in Python?
How are warnings generated in Python?
What is the difference between assertion and exception? Explain with the help of suitable Python examples.
Python script written using Python 2.x syntax is not compatible with Python 3.x interpreter. How can one automatically port Python 2.x code to Python 3.x?
Give a brief comparison of vars() and dir() built-in functions in Python’s standard library.
What are hashing algorithms? How can we obtain md5 hash value of a string in Python?
Reply to Comment
×
Name
*
Email
*
Comment
*
Submit Reply
Your experience on this site will be improved by allowing cookies. Read
Cookie Policy
Reject
Allow cookies