Docs fix #2301

pggPL · 2025-10-24T10:30:11Z

Description

Our documentation returned a lot of warnings and it seems that some of them were rational. It turned out that half of our PyTorch API was not rendered. This PR fixes all the warnings and forces Github workflow to error out if any docs warning will appear.

The most problematic error was related to cyclic imports - this resulted in part of our PyTorch API not being rendered. Other were mostly related to wrong formatting and some warnings didn't result in anything wrong.

Our docs for 2.9 contains only part of PyTorch API exposed in 2.8, so this will need urgent fix I will update in separate PR.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Checklist:

[ x] I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Pawel Gadzinski <[email protected]>

greptile-apps

Greptile Overview

Greptile Summary

This PR resolves critical documentation rendering issues where approximately half of the PyTorch API was not being rendered in the generated documentation. The root causes were cyclic import dependencies and reStructuredText formatting violations. The fix involves restructuring imports across JAX and PyTorch modules to break circular dependencies (primarily moving QuantizeLayout and torch_version imports to their source modules), correcting RST section underlines to match header text lengths, converting docstring formatting to proper RST syntax, and enforcing strict documentation builds in CI by adding the -W flag to treat warnings as errors.

Important Files Changed

Filename	Score	Overview
`.github/workflows/docs.yml`	5/5	Added `-W` flag to Sphinx builds to make documentation warnings fatal in CI
`transformer_engine/pytorch/utils.py`	5/5	Converted `torch_version` from module-level import to cached function to break cyclic dependencies
`transformer_engine/pytorch/cross_entropy.py`	5/5	Converted `parallel_cross_entropy` from bare function reference to documented wrapper function
`transformer_engine/jax/quantize/quantizer.py`	5/5	Removed `QuantizeLayout` from `__all__` to break circular dependency chain
`transformer_engine/jax/quantize/hadamard.py`	5/5	Changed `QuantizeLayout` import from delayed local import to top-level C++ extension import
`transformer_engine/jax/cpp_extensions/misc.py`	5/5	Moved `QuantizeLayout` import from relative path to C++ extension to break cycle
`transformer_engine/jax/cpp_extensions/activation.py`	5/5	Moved `QuantizeLayout` import from quantize module to C++ extension module
`transformer_engine/jax/cpp_extensions/quantization.py`	5/5	Moved `QuantizeLayout` import from quantize module to C++ extension module
`transformer_engine/jax/cpp_extensions/gemm.py`	5/5	Moved `QuantizeLayout` import from quantize module to C++ extension module
`transformer_engine/jax/cpp_extensions/normalization.py`	5/5	Moved `QuantizeLayout` import from quantize module to C++ extension module
`transformer_engine/jax/dense.py`	5/5	Moved `QuantizeLayout` import from local module to top-level package
`transformer_engine/pytorch/jit.py`	5/5	Changed `torch_version` import from current module to utils submodule
`transformer_engine/pytorch/distributed.py`	5/5	Changed `torch_version` import from package-level to explicit utils module
`transformer_engine/pytorch/quantization.py`	5/5	Reorganized imports with proper blank line separation for clarity
`transformer_engine/pytorch/module/linear.py`	5/5	Changed `torch_version` import to come from utils submodule
`transformer_engine/pytorch/module/layernorm_linear.py`	5/5	Changed `torch_version` import to come from utils submodule
`transformer_engine/pytorch/module/layernorm_mlp.py`	4/5	Changed `torch_version` import and improved docstring RST formatting
`transformer_engine/pytorch/ops/_common.py`	5/5	Changed `torch_version` import to come from utils submodule
`transformer_engine/pytorch/ops/basic/l2normalization.py`	5/5	Changed `torch_version` import to come from utils submodule
`transformer_engine/pytorch/transformer.py`	5/5	Fixed `torch_version` import and improved docstring RST formatting
`transformer_engine/pytorch/module/base.py`	5/5	Converted docstring code blocks from markdown to RST syntax
`transformer_engine/pytorch/attention/dot_product_attention/dot_product_attention.py`	5/5	Improved RST formatting of docstring with proper inline code and bullet lists
`transformer_engine/pytorch/attention/multi_head_attention.py`	5/5	Improved RST formatting of docstring with proper inline code and bullet lists
`transformer_engine/jax/flax/transformer.py`	5/5	Converted ASCII table in docstring to RST table directive
`docs/conf.py`	4/5	Added custom logging filter to suppress unavoidable duplicate namespace warnings
`docs/api/pytorch.rst`	4/5	Reorganized API documentation with new sections and deprecated functions section
`docs/debug.rst`	5/5	Added blank line after copyright header to fix RST parsing
`docs/debug/api.rst`	5/5	Added blank line after copyright header to fix RST parsing
`docs/debug/3_api_features.rst`	5/5	Extended section underline to match header length
`docs/debug/3_api_debug_setup.rst`	5/5	Extended section underlines to match header lengths
`docs/debug/4_distributed.rst`	5/5	Extended section underlines to match header lengths
`docs/debug/2_config_file_structure.rst`	5/5	Extended section underlines to match header lengths
`docs/debug/1_getting_started.rst`	0/5	Section underlines still don't match header text lengths exactly - will cause RST warnings
`docs/examples/attention/attention.ipynb`	5/5	Updated API reference links to lowercase format for Sphinx compatibility
`docs/examples/te_gemma/tutorial_generation_gemma_with_te.ipynb`	5/5	Fixed broken internal documentation link

Confidence score: 2/5

This PR contains one file (docs/debug/1_getting_started.rst) with incorrect RST section underline lengths that will cause Sphinx warnings, directly contradicting the PR's goal of eliminating all warnings
All other changes are formatting and import refactoring with minimal risk, but the incomplete fix in 1_getting_started.rst will cause the new CI check (which treats warnings as errors) to fail
Pay close attention to docs/debug/1_getting_started.rst - the underlines need to match the exact character length of each section header, not use a fixed-width approach

_{35 files reviewed, 8 comments}

_{Edit Code Review Agent Settings | Greptile}

docs/api/pytorch.rst

greptile-apps · 2025-10-24T10:32:00Z