-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-14874][SQL][Streaming] Remove the obsolete Batch representation #12638
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #56798 has finished for PR 12638 at commit
|
|
Test build #56824 has finished for PR 12638 at commit
|
| private val fs = basePath.getFileSystem(sqlContext.sparkContext.hadoopConfiguration) | ||
|
|
||
| override def addBatch(batchId: Long, data: DataFrame): Unit = { | ||
| override def addData(batchId: Long, data: DataFrame): Unit = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't need to change this name. Its still a batch of data and one of the parameters is still named batch.
|
It's fine to remove the class, but lets avoid unneeded renaming. |
|
Sure, so I'm closing this PR since the removal itself is not worthy for committers to process. |
|
To be clear, if there's a completely unused class, I think it's worth the
|
|
@marmbrus thanks for the reminder! Since I've reverted the renaming, and I've checked there's no other completely unused class in package |
|
Test build #56957 has finished for PR 12638 at commit
|
|
some build issues unrelated to this PR |
|
Jenkins retest this please |
|
Test build #56996 has finished for PR 12638 at commit
|
| } | ||
|
|
||
| /** | ||
| * Returns the next batch of data that is available after `start`, if any is available. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This doc should be updated, following Source.getBatch()'s doc change from Returns the next batch of data that is available after start, if any is available. to Returns the data that is between the offsets (start, end].
|
just rebased to master to resolve some conflicts |
|
Test build #57073 has finished for PR 12638 at commit
|
|
Thanks, merging to master. |
What changes were proposed in this pull request?
The
Batchclass, which had been used to indicate progress in a stream, was abandoned by [SPARK-13985][SQL] Deterministic batches with ids and then became useless.This patch:
Batchclassdoes some related renaming(update: this has been reverted)How was this patch tested?
N/A