[SPARK-14874][SQL][Streaming] Remove the obsolete Batch representation #12638

lw-lin · 2016-04-23T11:51:10Z

What changes were proposed in this pull request?

The Batch class, which had been used to indicate progress in a stream, was abandoned by [SPARK-13985][SQL] Deterministic batches with ids and then became useless.

This patch:

removes the Batch class
~~does some related renaming~~ (update: this has been reverted)
fixes some related comments

How was this patch tested?

N/A

SparkQA · 2016-04-23T13:14:20Z

Test build #56798 has finished for PR 12638 at commit c79cba9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

lw-lin · 2016-04-24T02:34:59Z

@marmbrus @tdas would you mind taking a look? Thanks! :-)

SparkQA · 2016-04-24T03:57:25Z

Test build #56824 has finished for PR 12638 at commit c79cba9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2016-04-25T17:52:25Z

sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSink.scala

  private val fs = basePath.getFileSystem(sqlContext.sparkContext.hadoopConfiguration)

-  override def addBatch(batchId: Long, data: DataFrame): Unit = {
+  override def addData(batchId: Long, data: DataFrame): Unit = {


We don't need to change this name. Its still a batch of data and one of the parameters is still named batch.

marmbrus · 2016-04-25T17:52:56Z

It's fine to remove the class, but lets avoid unneeded renaming.

lw-lin · 2016-04-26T02:18:53Z

Sure, so I'm closing this PR since the removal itself is not worthy for committers to process.
@marmbrus thanks for the review!

marmbrus · 2016-04-26T03:38:48Z

To be clear, if there's a completely unused class, I think it's worth the
time to delete it (dead code is confusing for people trying to learn the
code base).
On Apr 25, 2016 7:20 PM, "Liwei Lin" [email protected] wrote:

Closed #12638 #12638.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#12638 (comment)

lw-lin · 2016-04-26T04:08:05Z

@marmbrus thanks for the reminder!

Since I've reverted the renaming, and I've checked there's no other completely unused class in package o.a.s.sql.execution.streaming, seems this is ready to go (pending tests). So would you mind take another look? Thanks!

SparkQA · 2016-04-26T04:12:36Z

Test build #56957 has finished for PR 12638 at commit 14e6900.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

lw-lin · 2016-04-26T04:27:18Z

some build issues unrelated to this PR

lw-lin · 2016-04-26T11:46:21Z

Jenkins retest this please

SparkQA · 2016-04-26T13:58:10Z

Test build #56996 has finished for PR 12638 at commit 14e6900.

This patch fails Spark unit tests.
This patch does not merge cleanly.
This patch adds no public classes.

…-batch

lw-lin · 2016-04-27T01:56:30Z

sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala

  }

  /**
-   * Returns the next batch of data that is available after `start`, if any is available.


This doc should be updated, following Source.getBatch()'s doc change from Returns the next batch of data that is available after start, if any is available. to Returns the data that is between the offsets (start, end].

lw-lin · 2016-04-27T01:57:41Z

just rebased to master to resolve some conflicts

SparkQA · 2016-04-27T03:20:40Z

Test build #57073 has finished for PR 12638 at commit 653fa52.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2016-04-27T17:24:40Z

Thanks, merging to master.

remove the useless Batch class

c79cba9

lw-lin closed this Apr 23, 2016

lw-lin changed the title ~~[SPARK-14874][SQL][Streaming] Cleanup the useless Batch class~~ [SPARK-14874][SQL][Streaming] Remove the obsolete Batch representation Apr 24, 2016

lw-lin reopened this Apr 24, 2016

marmbrus reviewed Apr 25, 2016
View reviewed changes

lw-lin closed this Apr 26, 2016

lw-lin added 2 commits April 26, 2016 12:03

revert renaming

60aaf97

revert renaming

14e6900

lw-lin reopened this Apr 26, 2016

Merge remote-tracking branch 'refs/remotes/origin/master' into remove…

653fa52

…-batch

lw-lin reviewed Apr 27, 2016
View reviewed changes

asfgit closed this in a234cc6 Apr 27, 2016

lw-lin deleted the remove-batch branch April 28, 2016 09:46

[SPARK-14874][SQL][Streaming] Remove the obsolete Batch representation #12638

[SPARK-14874][SQL][Streaming] Remove the obsolete Batch representation #12638

Uh oh!

Conversation

lw-lin commented Apr 23, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Apr 23, 2016

Uh oh!

lw-lin commented Apr 24, 2016

Uh oh!

SparkQA commented Apr 24, 2016

Uh oh!

marmbrus Apr 25, 2016

Choose a reason for hiding this comment

Uh oh!

marmbrus commented Apr 25, 2016

Uh oh!

lw-lin commented Apr 26, 2016

Uh oh!

marmbrus commented Apr 26, 2016

Uh oh!

lw-lin commented Apr 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Apr 26, 2016

Uh oh!

lw-lin commented Apr 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lw-lin commented Apr 26, 2016

Uh oh!

SparkQA commented Apr 26, 2016

Uh oh!

lw-lin Apr 27, 2016

Choose a reason for hiding this comment

Uh oh!

lw-lin commented Apr 27, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Apr 27, 2016

Uh oh!

marmbrus commented Apr 27, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lw-lin commented Apr 23, 2016 •

edited

Loading

lw-lin commented Apr 26, 2016 •

edited

Loading

lw-lin commented Apr 26, 2016 •

edited

Loading

lw-lin commented Apr 27, 2016 •

edited

Loading