Remove more legacy Runner v1 cruft. #27512

robertwb · 2023-07-14T22:39:34Z

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
Update CHANGES.md with noteworthy changes.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

See CI.md for more information about GitHub Actions CI.

robertwb · 2023-07-14T22:39:47Z

R: @tvalentyn

github-actions · 2023-07-14T22:40:54Z

Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control

codecov · 2023-07-14T22:54:12Z

Codecov Report

Merging #27512 (c65c703) into master (41e6628) will decrease coverage by 0.56%.
Report is 304 commits behind head on master.
The diff coverage is 76.00%.

@@            Coverage Diff             @@
##           master   #27512      +/-   ##
==========================================
- Coverage   71.16%   70.61%   -0.56%     
==========================================
  Files         861      860       -1     
  Lines      104547   103875     -672     
==========================================
- Hits        74401    73350    -1051     
- Misses      28597    28976     +379     
  Partials     1549     1549

Flag	Coverage Δ
python	`79.61% <76.00%> (-0.76%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
...on/apache_beam/runners/portability/flink_runner.py	`59.61% <50.00%> (ø)`
...on/apache_beam/runners/portability/spark_runner.py	`67.34% <50.00%> (ø)`
...apache_beam/runners/portability/portable_runner.py	`74.74% <71.42%> (-1.11%)`	⬇️
sdks/python/apache_beam/transforms/environments.py	`87.70% <72.00%> (-0.67%)`	⬇️
sdks/python/apache_beam/runners/runner.py	`85.41% <93.75%> (+30.81%)`	⬆️
sdks/python/apache_beam/portability/python_urns.py	`100.00% <100.00%> (ø)`
...ache_beam/runners/portability/expansion_service.py	`91.83% <100.00%> (ø)`

... and 49 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

robertwb · 2023-07-18T21:01:00Z

Run Python_PVR_Flink PreCommit

tvalentyn

noting that some tests & lint failed on latest snapshot.

tvalentyn · 2023-07-18T21:49:30Z

sdks/python/apache_beam/transforms/environments.py

+@Environment.register_urn(python_urns.EMBEDDED_PYTHON_LOOPBACK, None)
+class PythonLoopbackEnvironment(EmbeddedPythonEnvironment):
+  """Used as a stub when the loopback worker has not yet been started."""
+  def to_runner_api_parameter(self, context):


Should we add the typehint? I think it might be: # type: (PipelineContext) -> typing.Tuple[str, message.Message]

tvalentyn · 2023-07-21T17:18:59Z

sdks/python/apache_beam/runners/runner.py

+    )
+
+    # TODO: https://github.com/apache/beam/issues/19168
+    # portable runner specific default


we can and plan to make this a default for dataflow as well: #26996

to follow up, Dataflow runner no longer stages SDK from pypi and expects containers to have it.

tvalentyn · 2023-07-21T17:22:34Z

sdks/python/apache_beam/runners/runner.py

+    if options.view_as(SetupOptions).sdk_location == 'default':
+      options.view_as(SetupOptions).sdk_location = 'container'
+
+    return self.run_full_pipeline(


What is the semantic distinction between run_pipeline vs run_full_pipeline? It sounds like run_pipeline could run exectute subgraphs, but it calls into run_full_pipeline, which is supposed to run the entire graph.

Mostly it's just a type distinction, but for backwards compatibility and the fact that only names (not type signatures) are used to resolve methods in Python I needed to call it something different. (IIRC, the old version could execute subgraphs at some point, I don't know if anyone uses that capability anymore.)

i see, thanks. the only alternative that comes to mind is run_portable_pipeline(), but not sure if that would be a better name.

Sorry for crashing, but I got tripped up by this already when writing code on top of these changes. I think any of run_portable_pipeline / run_pipeline_proto / run_pipeline_from_proto would be a bit clearer

cc: @robertwb

Legacy runners can still override the Pipeline-object-taking run_pipeline method, but this now has a default implementation. As part of this it was necessary to refactor environments to make loopback less of a special case.

robertwb · 2023-08-04T16:20:22Z

Ping on this @tvalentyn

tvalentyn · 2023-08-09T18:21:04Z

Run Python_Integration PreCommit 3.11

github-actions bot added the python label Jul 14, 2023

Remove more legacy Runner v1 cruft.

b333268

robertwb force-pushed the no-v1-runner branch 4 times, most recently from 2587133 to d59749f Compare July 18, 2023 20:23

tvalentyn reviewed Jul 21, 2023

View reviewed changes

Make runner entrypoint more portable.

27c7cb3

Legacy runners can still override the Pipeline-object-taking run_pipeline method, but this now has a default implementation. As part of this it was necessary to refactor environments to make loopback less of a special case.

robertwb force-pushed the no-v1-runner branch from c4624dc to 27c7cb3 Compare July 28, 2023 17:08

tvalentyn approved these changes Aug 9, 2023

View reviewed changes

Rename run_full_pipeline to run_portable_pipeline.

c65c703

robertwb merged commit 1755dd5 into apache:master Aug 9, 2023
76 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove more legacy Runner v1 cruft. #27512

Remove more legacy Runner v1 cruft. #27512

robertwb commented Jul 14, 2023

robertwb commented Jul 14, 2023

github-actions bot commented Jul 14, 2023

codecov bot commented Jul 14, 2023 •

edited

Loading

robertwb commented Jul 18, 2023

tvalentyn left a comment

tvalentyn Jul 18, 2023

robertwb Jul 21, 2023

tvalentyn Jul 21, 2023

robertwb Jul 21, 2023

tvalentyn Aug 9, 2023 •

edited

Loading

tvalentyn Jul 21, 2023

robertwb Jul 21, 2023

tvalentyn Jul 21, 2023

hjtran Jul 28, 2023

tvalentyn Aug 9, 2023

robertwb commented Aug 4, 2023

tvalentyn commented Aug 9, 2023

Remove more legacy Runner v1 cruft. #27512

Remove more legacy Runner v1 cruft. #27512

Conversation

robertwb commented Jul 14, 2023

GitHub Actions Tests Status (on master branch)

robertwb commented Jul 14, 2023

github-actions bot commented Jul 14, 2023

codecov bot commented Jul 14, 2023 • edited Loading

Codecov Report

robertwb commented Jul 18, 2023

tvalentyn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tvalentyn Aug 9, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

robertwb commented Aug 4, 2023

tvalentyn commented Aug 9, 2023

codecov bot commented Jul 14, 2023 •

edited

Loading

tvalentyn Aug 9, 2023 •

edited

Loading