Unify operator logging on self.log by pankajastro · Pull Request #2681 · astronomer/astronomer-cosmos

pankajastro · 2026-05-14T15:53:39Z

PR #1108 established the convention that operator code uses self.log (Airflow's LoggingMixin) so messages land in the per-task-instance log shown in the Airflow UI, while library code uses cosmos.log.get_logger for branding and scoped log levels. PR #1079 deviated from that convention in three call sites because pytest's caplog could not capture self.log at the time; #1157 was filed to track cleaning up the workaround.

Current pytest/Airflow does capture self.log -- the neighbouring assertions in tests/operators/test_virtualenv.py already match self.log calls in the same file. Convert the three holdover sites to self.log so that:

aws_ecs.py and gcp_cloud_run_job.py: the ECS/Cloud Run task result is emitted into the same task log as the "Running command" line above it instead of disappearing to worker stdout.
virtualenv.py: the "Waiting for virtualenv lock" message routes to the task log alongside the other lock messages in the same method.

With the conversion the module-level logger = get_logger(__name__) and its import become dead in all three files; drop them.

test_run_command_with_virtualenv_dir, which asserts caplog.text.count("Waiting for virtualenv lock to be released") == 2, still passes after the change, confirming caplog captures self.log here.

closes #1157

PR #1108 established the convention that operator code uses self.log (Airflow's LoggingMixin) so messages land in the per-task-instance log shown in the Airflow UI, while library code uses cosmos.log.get_logger for branding and scoped log levels. PR #1079 deviated from that convention in three call sites because pytest's caplog could not capture self.log at the time; #1157 was filed to track cleaning up the workaround. Current pytest/Airflow does capture self.log -- the neighbouring assertions in tests/operators/test_virtualenv.py already match self.log calls in the same file. Convert the three holdover sites to self.log so that: - aws_ecs.py and gcp_cloud_run_job.py: the ECS/Cloud Run task result is emitted into the same task log as the "Running command" line above it instead of disappearing to worker stdout. - virtualenv.py: the "Waiting for virtualenv lock" message routes to the task log alongside the other lock messages in the same method. With the conversion the module-level `logger = get_logger(__name__)` and its import become dead in all three files; drop them. test_run_command_with_virtualenv_dir, which asserts caplog.text.count("Waiting for virtualenv lock to be released") == 2, still passes after the change, confirming caplog captures self.log here. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

This PR aligns remaining operator logging call sites with the established convention of using Airflow task logging via self.log, so messages appear in task instance logs rather than module-level Cosmos logs.

Changes:

Removed unused get_logger imports and module-level loggers from affected operators.
Replaced three remaining module-level logger.info(...) calls with self.log.info(...).

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
`cosmos/operators/virtualenv.py`	Routes virtualenv lock wait logging through the operator task logger.
`cosmos/operators/gcp_cloud_run_job.py`	Logs Cloud Run execution results through the operator task logger.
`cosmos/operators/aws_ecs.py`	Logs ECS task execution results through the operator task logger.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

codecov · 2026-05-15T07:20:21Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 98.07%. Comparing base (4867faf) to head (4380538).
⚠️ Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2681      +/-   ##
==========================================
+ Coverage   98.03%   98.07%   +0.03%     
==========================================
  Files         105      105              
  Lines        7843     7837       -6     
==========================================
- Hits         7689     7686       -3     
+ Misses        154      151       -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

pankajkoti

Do we feel confident enough about the changes in aws_ecs.py and gcp_cloud_run_job.py? I guess we don't have dedicated tests for those, right? Happy to merge if you feel confident/were able to test it.

pankajastro · 2026-06-02T19:44:49Z

Confident, yes. No log-asserting tests like the virtualenv change has, but test_dbt_aws_ecs_build_and_run_cmd and test_dbt_gcp_cloud_run_job_build_and_run_cmd (+ interceptor tests) do exercise the changed build_and_run_cmd path — re-ran both files, 16 passed. It's just swapping the logging sink (logger → self.log, the standard LoggingMixin already used a line above), so only the routing changes. Good to merge.

The Logging section told contributors to "get loggers via `cosmos.log.get_logger`" unconditionally, which would lead someone working in an operator to add a module-level `get_logger` — the exact inconsistency #1157 set out to fix. Scope that guidance to library/module-level code and state the operator rule: inside operators and hooks (anything with `LoggingMixin`), log via `self.log` so messages land in the per-task-instance log shown in the Airflow UI. Examples updated to show both cases. This locks in the outcome of the logging cleanup (#2670, #2680, #2681) so it doesn't silently regress.

pankajastro requested a review from jbandoro as a code owner May 14, 2026 15:53

Copilot AI review requested due to automatic review settings May 14, 2026 15:53

pankajastro requested review from a team, corsettigyg and dwreeves as code owners May 14, 2026 15:53

pankajastro requested review from pankajkoti and tatiana May 14, 2026 15:53

pankajastro temporarily deployed to internal May 14, 2026 15:53 — with GitHub Actions Inactive

Copilot started reviewing on behalf of pankajastro May 14, 2026 15:54 View session

Copilot AI reviewed May 14, 2026

View reviewed changes

tatiana assigned pankajkoti May 27, 2026

pankajkoti approved these changes Jun 2, 2026

View reviewed changes

pankajastro merged commit 320cfa3 into main Jun 2, 2026
244 of 245 checks passed

pankajastro deleted the issue-1157-self-log-consistency branch June 2, 2026 19:45

pankajastro mentioned this pull request Jun 3, 2026

docs: document self.log vs get_logger logging convention #2764

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify operator logging on self.log#2681

Unify operator logging on self.log#2681
pankajastro merged 1 commit into
mainfrom
issue-1157-self-log-consistency

pankajastro commented May 14, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

codecov Bot commented May 15, 2026

Uh oh!

pankajkoti left a comment

Uh oh!

pankajastro commented Jun 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pankajastro commented May 14, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

codecov Bot commented May 15, 2026

Codecov Report

Uh oh!

pankajkoti left a comment

Choose a reason for hiding this comment

Uh oh!

pankajastro commented Jun 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants