[Docs] Get Started in 5 Mins#2398
Conversation
There was a problem hiding this comment.
Hi @lzdanski, thanks for putting together this first PR!
I left some inline comments, but here’s some higher-level feedback to consider:
- Should we clearly highlight this as new or recently reviewed content, so it’s distinguishable from older material?
- Should this page be added to the doctree?
- Do we want to recommend that end users install Git and clone a full repository just for a quickstart?
- What are the tradeoffs between recommending Astro CLI versus plain Airflow standalone? Using Astro CLI introduces additional tooling overhead (containers, extra dependencies, etc.) that might not be necessary for a “Get Started” guide. We may want to consider an alternative set of steps that are agnostic to both Git and Astro CLI, and instead rely on Airflow standalone. For example:
$ python3 -m venv venv
$ source venv/bin/activate
$ pip install astronomer-cosmos
$ mkdir dags
$ curl -L https://raw.githubusercontent.com/astronomer/cosmos-demo/main/dags/basic/simple_dag.py -o dags/simple_dag.py
$ export AIRFLOW_HOME=`pwd`
$ airflow standalone
- Do we want to include any screenshots in the quickstart?
|
@lzdanski Do you have any questions about the feedback? I'd love to see the feedback addressed today, so we can move forward with the documentation improvements. |
tatiana
left a comment
There was a problem hiding this comment.
I’d suggest using “DAG” rather than “Dag correct,” as it preserves the original acronym for Direct Acyclic Graph. We’ve been consistent with this capitalisation throughout the Cosmos repository documentation, and it would be great to maintain that consistency here as well.
I’m aware that the Apache Airflow community has had various discussions on this topic, and different conventions are used (e.g., Dag or dag). However, since this is within this repository, aligning with our established convention of “DAG” would help keep the documentation clear and consistent - especially for readers who may not be familiar with Airflow-specific approaches.
Alternatively, I’m equally comfortable using more general terms like “workflow” or “pipeline” if that better suits the context. For people starting, it is likely more intuitive.
Could you please update the references accordingly? I’ve pointed out a few instances, but there may be others.
yetudada
left a comment
There was a problem hiding this comment.
Just left a few comments
yetudada
left a comment
There was a problem hiding this comment.
I'm happy on the changes but just see @pankajastro's last comments.
- Renames *Source Nodes Rendering* page to *Managing Sources* per navigation structure discussion - Updates all relative links so that images and code examples render as expected - Adds redirects for all relocated content - Aligns getting started guide navigation labels to be more consistent ### Things this PR does **not** cover that are addressed in other PRs: - This PR does not align titles and navigation labels for newly relocated Execution Modes docs and overview doc (lzdanski#2) - It does not add additional content to newly created index landing pages for the subfolders of the new TOC categories - It does not add new content to the Getting Started/ Cosmos Fundamentals section - It does not move `Profiles` docs into `References` (#2424) ## Restructuring work - Restructures the Getting Started section to have a dedicated space for core concepts called "Cosmos Fundamentals", and another section for getting started guides. <img width="1459" height="506" alt="Screenshot 2026-03-01 at 10 03 01 PM" src="https://github.com/user-attachments/assets/c3e13648-bcc5-481a-958c-3e48fcb5ed6e" /> - Renames "Configuration" to "Guides", Moves execution modes docs from "Getting started" to "Guides" section, adds new categories to the "Guides" section <img width="1462" height="787" alt="Screenshot 2026-03-01 at 10 03 08 PM" src="https://github.com/user-attachments/assets/0a60a3d1-780c-4bb2-b40e-c643c5abb407" /> - Adds a new category of docs called "Optimize Performance" for advanced configuration topics <img width="1446" height="530" alt="Screenshot 2026-03-01 at 10 07 23 PM" src="https://github.com/user-attachments/assets/8bba6494-4125-4704-9a64-594a66a16e6d" /> - Adds a section called "References" for glossary/configs/and other reference material. <img width="1402" height="368" alt="Screenshot 2026-03-01 at 10 07 35 PM" src="https://github.com/user-attachments/assets/d4cf12d2-c3c4-40ce-9d19-6951514c1bc3" /> ## Related Issue(s) Related to the following open PR's: - #2398 - #2416 - #2424
1.14.0 (2026-04-07) --------------------- Breaking Changes * Drop support for Airflow versions earlier than **2.9** by @jedcunningham in #2288 * Fix inclusion of package models and selection/exclusion behavior by @pankajkoti in #2357 * ``ExecutionMode.WATCHER``: The per-node ``*_status`` XCom value is now a dict (``{"status": "<status>", "outlet_uris": [...]}``) instead of a plain string. Any custom code that reads these internal XCom keys directly will need to be updated by @pankajkoti in #2507 Features * Add cluster policy support for ``ExecutionMode.WATCHER`` sensor retries by @astro-anand in #2293 * Add debug mode to track memory utilization by @tatiana in #2327 * Add FQN selection support for ``LoadMode.DBT_MANIFEST`` by @pankajastro in #2375 * Introduce interceptors for Cosmos tasks by @tatiana in #2419 * Add config to allow disabling dag versioning by @pankajkoti in #2470 * Implement TaskGroups by models folder by @maximilianoarcieri and @tatiana in #1566, #2469, and #2420 * feat: implement DbtTestWatcherOperator by @michal-mrazek in #2447 * Add source freshness aware execution for ``ExecutionMode.WATCHER`` by @pankajastro and @tatiana in #2467 * Note: Like ``ExecutionMode.WATCHER``, this feature is experimental and its interface and implementation can change in the future. * Add Airflow 3.2 support by @pankajastro and @pankajkoti in #2472 Enhancements * Add watcher mode support for dbt test node states by @michal-mrazek in #2318 * Rename watcher-mode sensor retry queue and reuse it for producer tasks by @pankajastro in #2331 * Fix leaked semaphore warnings in Airflow 3 by resetting dbt adapters by @pankajkoti in #2335 * Improve dbt Fusion support and related tests by @tatiana in #2356 * Default Snowflake profile mappings to four threads by @tatiana in #2374 * Attempt to remove Pydantic as a dependency by @tatiana in #2377 * Log dbt-core and adapter versions in watcher consumer tasks by @pankajastro in #2412 * Log model errors in watcher consumer on dbt node failure by @pankajastro in #2431 * Reduce XCom read/write for tracking node state and errors in ConsumerWatcher task by @pankajastro in #2471 * Remove duplicate debug log in watcher subprocess path by @tatiana in #2494 * Simplify and unify WATCHER implementation regardless of InvocationMode by @tatiana in #2498 * Switch to lazy imports in cosmos/__init__.py by @pankajkoti in #2531 Bug Fixes * Handle invalid YAML errors with ``LoadMode.DBT_MANIFEST`` and ``RenderConfig.selector`` by @YourRoyalLinus in #2316 * Populate ``compiled_sql`` for ``InvocationMode.SUBPROCESS`` in ``ExecutionMode.WATCHER`` by @pankajkoti in #2319 * Fix select/exclude type mismatch by @tatiana in #2364 * Set ``emit_datasets=False`` for ``DbtTest*`` operators by @pankajastro in #2365 * Set correct queue priority for watcher producer tasks by @pankajastro in #2372 * Preserve ``extra_context`` for watcher consumer task instances by @pankajkoti in #2381 * Respect ``deferrable=False`` from ``operator_args`` on watcher consumer sensors by @pankajkoti in #2384 * Fix watcher queue precedence and add documentation by @pankajastro in #2391 * Do not set ``compiled_sql`` on ``ExecutionMode.WATCHER`` producers by @pankajkoti in #2440 * Remove const attribute for ``__cosmos_telemetry_metadata__`` dag param by @pankajkoti in #2466 * Remove timeout override from Cosmos watcher sensors by @tatiana and @claude in #2478 * Remove forced ``retries=0`` from watcher producer operators by @tatiana in #2479 * RFC: Add patch for newer versions of amazon provider when running dbt on EKS by @aoelvp94 in #2481 * Fix ``cosmos_debug_max_memory_mb`` XCom not pushed in Watcher sensor tasks by @tatiana in #2503 * Fix ``TestBehavior.NONE`` and ``TestBehavior.AFTER_ALL`` exclude ignored with selectors in ``ExecutionMode.WATCHER`` by @pankajkoti in #2511 * Move dataset emission for ``ExecutionMode.WATCHER`` from producer to consumer sensors by @pankajkoti in #2507 Docs * Document cluster policy configuration for ``ExecutionMode.WATCHER`` sensor tasks by @pankajastro in #2315 * Remove outdated docs for the dbt docs plugin with Airflow 3 by @pankajastro in #2353 * Make Watcher DBT Execution Queue heading clickable by @pankajastro in #2354 * Update ``ExecutionMode.WATCHER`` documentation regarding test node implementation by @jroachgolf84 in #2355 * Fix ``pre_dbt_fusion`` configuration rendering by @pankajastro in #2369 * Add documentation for including/excluding nodes based on FQN by @pankajastro in #2371 * Update watcher execution mode documentation by @tatiana in #2380 * Add documentation for ``DbtSeedLocalOperator`` by @jroachgolf84 in #2383 * Fix miscellaneous Sphinx warnings by @pankajastro in #2395 * Improve contributing documentation by @lzdanski in #2397 * Add **Get Started in 5 Minutes** guide by @lzdanski in #2398 * Add Sphinx redirects package for documentation redirects by @lzdanski in #2407 * Restructure **Getting Started** and **Guides** sections by @lzdanski in #2418 * Add open-source quickstart by @lzdanski in #2439 * Fix documentation redirects by @lzdanski in #2442 * Restructure and refactor reference documentation by @lzdanski in #2443 * Add execution modes decision documentation by @lzdanski in #2444 * Add **Core Concepts** page to Getting Started by @lzdanski in #2448 * Add guide: *How Cosmos Works* by @lzdanski in #2449 * Update **Getting Started** overview and index pages by @lzdanski in #2452 * Add guide: *How Cosmos Runs dbt* by @lzdanski in #2453 * Fix miscellaneous documentation links by @lzdanski in #2454 * Add Mermaid diagrams and execution mode diagrams by @lzdanski and @tatiana in #2459 * Add documentation for memory optimization options by @pankajastro in #2340 * Fix typo in watcher execution mode docs by @evanvolgas in #2485 * Fix minor documentation issues by @evanvolgas in #2489 * Add troubleshooting note for dbt debug logs in ExecutionMode.WATCHER by @tatiana in #2491 * docs: unify RST header styles across documentation by @jigangz in #2473 * docs: fix env var for rich logging by @vricciardulli in #2514 * docs: update dbt project path example for Airflow 3 Astro compatibility by @yeoreums in #2512 * Document missing Cosmos Airflow config settings in cosmos-conf.rst by @tatiana in #2515 * Split security-privacy policy doc and add dependency cooldown by @pankajkoti in #2519 * Add performance optimization and troubleshooting docs by @pankajkoti in #2521 * Update copyright year to 2026 by @tayloramurphy in #2527 * docs: Updating "Project Policies" to "Policies" in menu bar by @jroachgolf84 in #2526 Others * Fix tests after removing support for Airflow versions earlier than 2.9 by @tatiana in #2321 * Enable listener tests for Airflow 3.1 by @pankajastro in #2348 * Accept ``int`` or ``float`` for ``cosmos_debug_max_memory_mb`` in integration tests by @pankajkoti in #2352 * Update ``CODEOWNERS`` to prioritize ``oss-integrations`` by @tatiana in #2359 * Fix automatic reviewer assignment in GitHub by @tatiana and @phanikumv in #2360 * Improve PyPI tagging by @tatiana in #2363 * Add integration tests for dbt Fusion and ``ExecutionMode.WATCHER`` by @tatiana in #2373 * Fix Zizmor check by @tatiana in #2376 * Remove ``methodtools`` dependency by @tatiana in #2378 * Improve comments on #2389 by @evanvolgas in #2394 * Refactor ``load_from_dbt_manifest`` to reduce code complexity by @pankajkoti in #2399 * Refactor ``_handle_no_precursors_or_descendants`` to reduce complexity by @pankajkoti in #2400 * Improve issue templates by @tatiana in #2401 * Avoid running tests when only docs change by @tatiana in #2402 * Add ``no-reload`` target for serving docs locally by @pankajkoti in #2405 * Fix test hash checks on macOS by @tatiana in #2406 * Attempt deterministic dbt project copy in test fixtures by @pankajkoti in #2409 * Pin ``virtualenv <21`` due to hatch incompatibility in CI by @pankajkoti in #2410 * Revert virtualenv pin for hatch installation in CI by @pankajkoti in #2426 * Add version comments for commit SHA pinned GitHub Actions by @pankajkoti in #2436 * Fix ``hatch run docs:build`` issues by @tatiana in #2437 * Minor code improvements by @dnskr in #2446 * Pre-commit autoupdate by @pre-commit-ci in #2367, #2396, #2422, #2451, #2468, #2495, and #2516 * Add file to support Claude understanding the Cosmos repository by @tatiana in #2458 * Dependency updates by @dependabot in #2368, #2425, #2435, #2465, #2475, #2504, #2518, and #2528 * Isolate Scarf telemetry integration test into its own CI job by @pankajkoti and @claude in #2477 * ci: upgrade Airflow version to 3.1 in MyPy type-check job by @yeoreums in #2506 * Add commit message guidelines to CLAUDE.md by @pankajkoti in #2509 * Extend skipping tests in CI for more non-code file changes by @pankajkoti in #2510 * Add Dependabot pre-commit support with 7-day cooldown by @pankajkoti in #2517 * Enforce zero warnings policy for documentation by @dnskr in #2513 Co-authored-by: Pankaj Koti <pankajkoti699@gmail.com> Co-authored-by: Tatiana Al-Chueyr <tatiana.alchueyr@gmail.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Pankaj Koti <pankajkoti699@gmail.com> Co-authored-by: Tatiana Al-Chueyr <tatiana.alchueyr@gmail.com>
Description
Adapts the Cosmos eBook "Run locally in 5 minutes" guide for the docs.
Related Issue(s)
Breaking Change?
None expected.
Checklist