[Agent] [Ops] Implement formatting settings contract in output writers (#303) by stranske · Pull Request #308 · stranske/Counter_Risk

stranske · 2026-03-01T01:44:25Z

Source: Issue #303

Automated Status Summary

Scope

Runner settings currently capture formatting_profile, but output writers do not consistently apply operator-configurable formatting behavior. This blocks a key operator requirement (decimal/currency/accounting controls) for finicky presentation standards.

Tasks

Define and document the formatting settings schema (profile + explicit knobs where needed).
Parse formatting settings from runner/CLI settings payload into a typed runtime object.
Apply formatting rules in primary output writers (at minimum generated workbook writes and static table render outputs).
Ensure default behavior preserves template formatting when no override is selected.
Add tests that verify currency/decimal/accounting behavior for representative output cells/tables.
Update operator documentation with supported formatting options and defaults.

Acceptance criteria

Selecting a non-default formatting profile changes output formatting in documented surfaces.
Default profile preserves current template-driven formatting behavior.
Automated tests verify at least decimal precision, currency symbol handling, and negative/accounting style behavior.
Runner/CLI docs describe exactly what formatting settings are honored today.

Head SHA: 2369c02
Latest Runs: ✅ success — Gate
Required: gate: ✅ success

Workflow / Job	Result	Logs
.github/workflows/autofix.yml	❌ failure	View run
Agents PR Event Hub	⏭️ skipped	View run
Agents PR Meta	❔ in progress	View run
Dependabot Auto-merge	⏭️ skipped	View run
Gate	✅ success	View run
Health 45 Agents Guard	✅ success	View run

Copilot

Pull request overview

Adds an initial “formatting profile” contract layer to the Counter Risk codebase by defining a small, typed formatting-policy resolver and validating its behavior with unit tests.

Changes:

Introduce counter_risk.formatting with FormattingPolicy, profile normalization, and profile→policy resolution.
Add unit tests covering profile normalization and expected Excel number-format strings for supported profiles.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
`src/counter_risk/formatting.py`	Defines supported formatting profiles and resolves them into a typed `FormattingPolicy` (Excel number formats or `None` to preserve template formatting).
`tests/test_formatting.py`	Adds baseline unit tests for normalization and policy resolution outputs.

src/counter_risk/formatting.py

agents-workflows-bot · 2026-03-01T01:48:54Z

🤖 Keepalive Loop Status

PR #308 | Agent: Codex | Iteration 0/5

Current State

Metric	Value
Iteration progress	[----------] 0/5
Action	wait (missing-agent-label)
Disposition	skipped (transient)
Gate	success
Tasks	0/10 complete
Timeout	45 min (default)
Timeout usage	6m elapsed (14%, 39m remaining)
Keepalive	❌ disabled
Autofix	❌ disabled

🔍 Failure Classification

agents-workflows-bot · 2026-03-01T01:48:54Z

Keepalive Work Log (click to expand)

Time (UTC)	Agent	Action	Result	Files	Progress	Commit	Gate
2026-03-01 01:48:54	Codex	wait (missing-agent-label-transient)	skipped	—	0/10	—	cancelled
2026-03-01 01:53:49	Codex	wait (missing-agent-label-transient)	skipped	—	0/10	—	failure
2026-03-01 02:00:32	Codex	wait (missing-agent-label-transient)	skipped	—	0/10	—	success
2026-03-01 02:01:43	Codex	wait (missing-agent-label-transient)	skipped	—	0/10	—	—
2026-03-01 02:07:16	Codex	wait (missing-agent-label-transient)	skipped	—	0/10	—	success

stranske · 2026-03-01T01:53:25Z

Reviewed inline note: this branch now includes runtime wiring (CLI -> pipeline -> historical output writer) for ; scope is no longer helpers-only. No further changes needed for that comment.

stranske · 2026-03-01T01:53:33Z

Reviewed inline note: this branch includes runtime wiring (CLI -> pipeline -> historical output writer) for formatting_profile; scope is no longer helpers-only. No further code changes were needed for that comment.

github-actions · 2026-03-01T02:08:53Z

Provider Comparison Report

Provider Summary

Provider	Model	Verdict	Confidence	Summary
openai	gpt-5.2	CONCERNS	74%	Implements a typed formatting profile/policy resolver and threads a formatting_profile from CLI/settings through the pipeline OutputContext to the historical workbook writer, applying Excel number_...
anthropic	claude-sonnet-4-5-20250929	PASS	92%	The implementation successfully addresses the core acceptance criteria for formatting settings contract. Key achievements: (1) Formatting schema is defined with four profiles (default, currency, ac...

📋 Full Provider Details (click to expand)

openai

Model: gpt-5.2
Verdict: CONCERNS
Confidence: 74%
Scores:
- Correctness: 8.0/10
- Completeness: 7.0/10
- Quality: 8.0/10
- Testing: 8.0/10
- Risks: 4.0/10
Summary: Implements a typed formatting profile/policy resolver and threads a formatting_profile from CLI/settings through the pipeline OutputContext to the historical workbook writer, applying Excel number_format overrides for notional and counterparties cells for currency/accounting/plain profiles while leaving default as template-driven (no override). Adds automated tests covering decimal precision via format strings, currency symbol handling, and accounting-style behavior, plus docs describing honored settings and current scope. However, acceptance criteria implying broader output-writer coverage (e.g., static table renders) is not met in code, and default preservation is not directly asserted in tests. Additional unrelated GUI/chat/logging dependency changes expand risk/scope.
Concerns:
- Acceptance scope called for applying formatting rules in "primary output writers" including "static table render outputs"; this PR wires formatting into the historical workbook append path only (pipeline -> OutputContext -> _merge_historical_workbook). README explicitly states current scope is limited to historical workbook. If there are other table/static outputs expected by the criterion, they are not updated here.
- Formatting settings schema is implemented only as a profile selector -> resolved policy (FormattingPolicy with two number-format fields). The task list mentions "explicit knobs where needed"; no additional explicit knobs beyond the profile are supported/parsed.
- Default behavior preservation is implemented by not overriding number_format when policy fields are None, but tests do not explicitly assert that template formatting is preserved under the default profile (they only validate currency/accounting overrides).
- Non-formatting changes are bundled (GUI runner, chat provider dependency guidance, chat logging modes, new hard dependencies on langchain-openai/langchain-anthropic). This increases change surface area and operational risk relative to the PR’s stated formatting focus (though not necessarily incorrect).

anthropic

Model: claude-sonnet-4-5-20250929
Verdict: PASS
Confidence: 92%
Scores:
- Correctness: 9.0/10
- Completeness: 9.0/10
- Quality: 9.0/10
- Testing: 8.0/10
- Risks: 8.0/10
Summary: The implementation successfully addresses the core acceptance criteria for formatting settings contract. Key achievements: (1) Formatting schema is defined with four profiles (default, currency, accounting, plain) in src/counter_risk/formatting.py with clear resolution logic. (2) Settings are parsed from runner/CLI via --formatting-profile flag and propagated through the pipeline to output writers. (3) Historical workbook writer applies formatting rules to notional and counterparty columns based on selected profile. (4) Default profile preserves template formatting (None values). (5) Comprehensive tests verify currency/accounting/plain formatting behavior in tests/test_formatting.py and tests/pipeline/test_run_pipeline.py. (6) Documentation updated in README.md with supported profiles and scope.

The implementation is well-structured with proper separation of concerns (formatting.py module, OutputContext extension, pipeline threading). Code quality is high with type hints, clear naming, and defensive validation. Testing coverage includes unit tests for formatting resolution and integration tests for workbook cell formatting.

Limitations: Formatting currently only applies to historical workbook append operations, not to all output surfaces mentioned in the original scope (static table renders). The GUI runner addition (while valuable) is tangential to the core formatting requirement and introduces additional complexity. Chat logging changes are well-tested but represent behavioral changes that operators should be aware of.

Overall, the core formatting contract is correctly implemented and tested, meeting the primary acceptance criteria despite some scope limitations.

Concerns:
- Formatting profile application is limited to historical workbook append rows only - other output surfaces mentioned in scope (static table renders) are not yet implemented
- GUI runner implementation adds significant new surface area with Tkinter dependency that may have platform-specific behavior not fully covered by tests
- Chat logging mode changes alter default behavior (transcript always on) which could impact existing deployments expecting no logging overhead
- Missing dependency detection relies on importlib.util.find_spec which may not catch all edge cases in frozen/packaged environments

Agreement

Correctness: scores within 1 point (avg 8.5/10, range 8.0-9.0)
Quality: scores within 1 point (avg 8.5/10, range 8.0-9.0)
Testing: scores within 1 point (avg 8.0/10, range 8.0-8.0)

Disagreement

Dimension	openai	anthropic
Verdict	CONCERNS	PASS
Completeness	7.0/10	9.0/10
Risks	4.0/10	8.0/10

Unique Insights

openai: Acceptance scope called for applying formatting rules in "primary output writers" including "static table render outputs"; this PR wires formatting into the historical workbook append path only (pipeline -> OutputContext -> _merge_historical_workbook). README explicitly states current scope is limited to historical workbook. If there are other table/static outputs expected by the criterion, they are not updated here.; Formatting settings schema is implemented only as a profile selector -> resolved policy (FormattingPolicy with two number-format fields). The task list mentions "explicit knobs where needed"; no additional explicit knobs beyond the profile are supported/parsed.; Default behavior preservation is implemented by not overriding number_format when policy fields are None, but tests do not explicitly assert that template formatting is preserved under the default profile (they only validate currency/accounting overrides).; Non-formatting changes are bundled (GUI runner, chat provider dependency guidance, chat logging modes, new hard dependencies on langchain-openai/langchain-anthropic). This increases change surface area and operational risk relative to the PR’s stated formatting focus (though not necessarily incorrect).
anthropic: Formatting profile application is limited to historical workbook append rows only - other output surfaces mentioned in scope (static table renders) are not yet implemented; GUI runner implementation adds significant new surface area with Tkinter dependency that may have platform-specific behavior not fully covered by tests; Chat logging mode changes alter default behavior (transcript always on) which could impact existing deployments expecting no logging overhead; Missing dependency detection relies on importlib.util.find_spec which may not catch all edge cases in frozen/packaged environments

🔍 LangSmith Traces

feat(formatting): add profile resolution module and baseline tests

3e780e6

Copilot AI review requested due to automatic review settings March 1, 2026 01:44

Copilot started reviewing on behalf of stranske March 1, 2026 01:44 View session

Copilot AI reviewed Mar 1, 2026

View reviewed changes

src/counter_risk/formatting.py Show resolved Hide resolved

feat(formatting): apply runtime profiles to historical output writers

ac754d8

Fix pipeline tests for formatting profile wiring

2369c02

stranske merged commit 391281b into main Mar 1, 2026
276 checks passed

stranske added the verify:compare Runs verifier comparison mode after merge label Mar 1, 2026

stranske temporarily deployed to agent-standard March 1, 2026 02:01 — with GitHub Actions Inactive

stranske mentioned this pull request Mar 1, 2026

[Agent] Mop-up: harden LangChain runtime imports + pipeline test compatibility #310

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Agent] [Ops] Implement formatting settings contract in output writers (#303)#308

[Agent] [Ops] Implement formatting settings contract in output writers (#303)#308
stranske merged 3 commits intomainfrom
codex/issue-303-formatting-contract

stranske commented Mar 1, 2026 •

edited by stranske-keepalive bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

agents-workflows-bot bot commented Mar 1, 2026 •

edited

Loading

Uh oh!

agents-workflows-bot bot commented Mar 1, 2026 •

edited

Loading

Uh oh!

stranske commented Mar 1, 2026

Uh oh!

stranske commented Mar 1, 2026

Uh oh!

Uh oh!

github-actions bot commented Mar 1, 2026

openai

anthropic

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stranske commented Mar 1, 2026 • edited by stranske-keepalive bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Status Summary

Scope

Tasks

Acceptance criteria

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

agents-workflows-bot bot commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🤖 Keepalive Loop Status

Current State

🔍 Failure Classification

Uh oh!

agents-workflows-bot bot commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stranske commented Mar 1, 2026

Uh oh!

stranske commented Mar 1, 2026

Uh oh!

Uh oh!

github-actions bot commented Mar 1, 2026

Provider Comparison Report

Provider Summary

openai

anthropic

Agreement

Disagreement

Unique Insights

🔍 LangSmith Traces

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stranske commented Mar 1, 2026 •

edited by stranske-keepalive bot

Loading

agents-workflows-bot bot commented Mar 1, 2026 •

edited

Loading

agents-workflows-bot bot commented Mar 1, 2026 •

edited

Loading