-
Notifications
You must be signed in to change notification settings - Fork 416
Perform vale spelling checks on notebooks #896
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Perform vale spelling checks on notebooks #896
Conversation
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
|
Warning Rate limit exceeded@dagardner-nv has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 9 minutes and 21 seconds before requesting another review. ⌛ How to resolve this issue?After the wait time has elapsed, a review can be triggered using the We recommend that you space out your commits to avoid hitting the rate limit. 🚦 How do rate limits work?CodeRabbit enforces hourly rate limits for each developer per organization. Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout. Please see our FAQ for further information. 📒 Files selected for processing (2)
WalkthroughAdds notebook-to-Markdown conversion in documentation CI, then runs Vale on docs and converted notebooks. Updates Vale vocabulary. Edits multiple example notebooks’ markdown/text. Adds nbconvert as a dev dependency. Adjusts temp directory handling, error handling, and cleanup behavior in the CI script. Changes
Sequence Diagram(s)sequenceDiagram
autonumber
actor Dev as Developer
participant CI as CI: documentation_checks.sh
participant JN as jupyter nbconvert
participant Vale as Vale
Dev->>CI: Run documentation checks
Note over CI: Discover DOC_FILES and NOTEBOOK_FILES
CI->>JN: Convert *.ipynb -> Markdown (EXPORT_DIR)
alt Conversion fails
JN-->>CI: Non-zero exit
CI->>CI: Cleanup EXPORT_DIR (unless preserved)
CI-->>Dev: Exit non-zero
else Conversion succeeds
JN-->>CI: Converted Markdown paths
CI->>Vale: Lint DOC_FILES + converted notebooks
Vale-->>CI: Exit code (RETVAL)
CI->>CI: Cleanup or preserve EXPORT_DIR
CI-->>Dev: Return RETVAL
end
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~25 minutes Pre-merge checks and finishing touches✅ Passed checks (3 passed)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 8
📜 Review details
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Pro
⛔ Files ignored due to path filters (1)
uv.lockis excluded by!**/*.lock
📒 Files selected for processing (7)
ci/scripts/documentation_checks.sh(1 hunks)ci/vale/styles/config/vocabularies/nat/accept.txt(4 hunks)examples/notebooks/1_getting_started.ipynb(6 hunks)examples/notebooks/2_add_tools_and_agents.ipynb(6 hunks)examples/notebooks/3_observability_evaluation_and_profiling.ipynb(2 hunks)examples/notebooks/launchables/GPU_Cluster_Sizing_with_NeMo_Agent_Toolkit.ipynb(2 hunks)pyproject.toml(1 hunks)
🧰 Additional context used
📓 Path-based instructions (3)
**/*
⚙️ CodeRabbit configuration file
**/*: # Code Review Instructions
- Ensure the code follows best practices and coding standards. - For Python code, follow
PEP 20 and
PEP 8 for style guidelines.- Check for security vulnerabilities and potential issues. - Python methods should use type hints for all parameters and return values.
Example:def my_function(param1: int, param2: str) -> bool: pass- For Python exception handling, ensure proper stack trace preservation:
- When re-raising exceptions: use bare
raisestatements to maintain the original stack trace,
and uselogger.error()(notlogger.exception()) to avoid duplicate stack trace output.- When catching and logging exceptions without re-raising: always use
logger.exception()
to capture the full stack trace information.Documentation Review Instructions - Verify that documentation and comments are clear and comprehensive. - Verify that the documentation doesn't contain any TODOs, FIXMEs or placeholder text like "lorem ipsum". - Verify that the documentation doesn't contain any offensive or outdated terms. - Verify that documentation and comments are free of spelling mistakes, ensure the documentation doesn't contain any
words listed in the
ci/vale/styles/config/vocabularies/nat/reject.txtfile, words that might appear to be
spelling mistakes but are listed in theci/vale/styles/config/vocabularies/nat/accept.txtfile are OK.Misc. - All code (except .mdc files that contain Cursor rules) should be licensed under the Apache License 2.0,
and should contain an Apache License 2.0 header comment at the top of each file.
- Confirm that copyright years are up-to date whenever a file is changed.
Files:
pyproject.tomlci/vale/styles/config/vocabularies/nat/accept.txtexamples/notebooks/2_add_tools_and_agents.ipynbexamples/notebooks/3_observability_evaluation_and_profiling.ipynbci/scripts/documentation_checks.shexamples/notebooks/1_getting_started.ipynbexamples/notebooks/launchables/GPU_Cluster_Sizing_with_NeMo_Agent_Toolkit.ipynb
examples/**/*
⚙️ CodeRabbit configuration file
examples/**/*: - This directory contains example code and usage scenarios for the toolkit, at a minimum an example should
contain a README.md or file README.ipynb.
- If an example contains Python code, it should be placed in a subdirectory named
src/and should
contain apyproject.tomlfile. Optionally, it might also contain scripts in ascripts/directory.- If an example contains YAML files, they should be placed in a subdirectory named
configs/. - If an example contains sample data files, they should be placed in a subdirectory nameddata/, and should
be checked into git-lfs.
Files:
examples/notebooks/2_add_tools_and_agents.ipynbexamples/notebooks/3_observability_evaluation_and_profiling.ipynbexamples/notebooks/1_getting_started.ipynbexamples/notebooks/launchables/GPU_Cluster_Sizing_with_NeMo_Agent_Toolkit.ipynb
{scripts/**,ci/scripts/**}
📄 CodeRabbit inference engine (.cursor/rules/general.mdc)
Shell or utility scripts belong in scripts/ or ci/scripts/ and must not be mixed with library code
Files:
ci/scripts/documentation_checks.sh
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
- GitHub Check: CI Pipeline / Check
🔇 Additional comments (14)
examples/notebooks/launchables/GPU_Cluster_Sizing_with_NeMo_Agent_Toolkit.ipynb (2)
9-9: LGTM!The updated description clearly explains the notebook's purpose using proper grammar and spelling. The change from a toolkit-specific description to a generic example-based explanation improves clarity.
423-423: LGTM!The conversion from HTML Bootstrap-styled alert to Markdown note format improves consistency and simplicity. The formatting is correct, and the instruction is clear.
examples/notebooks/1_getting_started.ipynb (5)
63-63: LGTM! Spelling correction applied.The typo "resuability" has been correctly changed to "reusability" in the text.
185-189: LGTM! Formatting improvements for tool references.The formatting changes consistently present FastAPI, WebSockets, and MCP with proper capitalization and inline code styling.
264-264: LGTM! Framework name capitalization standardized.The change properly capitalizes "LlamaIndex" to match the official framework name.
276-276: LGTM! NIM capitalization correction.The text now properly capitalizes "NIM" as an acronym/product name.
289-289: LGTM! Framework reference standardized.The change to "LangChain/LangGraph's Tavily search API" properly capitalizes both framework names and uses correct possessive form.
examples/notebooks/2_add_tools_and_agents.ipynb (6)
34-34: LGTM! Path reference formatted as inline code.The path reference has been appropriately wrapped in inline code formatting for better readability.
71-71: LGTM! Path reference formatted consistently.
119-119: LGTM! Framework name capitalized correctly."LlamaIndex" now matches the official product name capitalization.
130-130: LGTM! Technical section references formatted as inline code.References to configuration sections (
embedderandllms) are now properly formatted with inline code styling.
220-230: LGTM! Technical terms and identifiers formatted consistently.All technical references (file paths, variable names like
user_input_manager,AIMessage,conditional_edge,check_hitl_approval) are now properly wrapped in inline code formatting for consistency and readability.
261-261: LGTM! Variable reference formatted as inline code.The
user_input_managervariable reference is now properly formatted.ci/scripts/documentation_checks.sh (1)
20-21: LGTM! File collection updated correctly.The script now collects both Markdown/RST files and notebook files separately, excluding CHANGELOG and LICENSE as intended.
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Signed-off-by: David Gardner <[email protected]>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Signed-off-by: David Gardner <[email protected]>
Signed-off-by: David Gardner <[email protected]>
|
/merge |
Description
nv_internaldirectoryBy Submitting this PR I confirm:
Summary by CodeRabbit
Documentation
Chores