Add cross-process cache for Python dependency checks. by TinyuZhao · Pull Request #438 · pioarduino/platform-espressif32

TinyuZhao · 2026-03-20T05:22:12Z

Description:

Related issue (if applicable): fixes #423

Checklist:

The pull request is done against the latest develop branch
Only relevant files were touched
Only one feature/fix was added per PR, more changes are allowed when changing boards.json
I accept the CLA

Summary by CodeRabbit

Performance
- Dependency installation now uses a shared cache and cross-process coordination to avoid redundant installs and speed up repeated setups.
Improvements
- More robust handling when concurrent processes run setup, with timeouts to avoid unsynchronized installs.
- Expanded logging with detailed timing for connectivity and dependency installation phases.

coderabbitai · 2026-03-20T05:22:19Z

📝 Walkthrough

Walkthrough

Added cross-process dependency caching and file-lock coordination to builder/penv_setup.py, with fingerprinting of env/deps, cache state files under platformio_dir/.cache, validation of cached environments, timed logging for connectivity and dependency phases, and lock-based install orchestration with timeout/error behavior.

Changes

Cohort / File(s)	Summary
Dependency caching, locking, and logging `builder/penv_setup.py`	Added shared cache state files and fingerprinting (based on `python_deps`, Python version, and penv markers). Implemented file-based lock acquisition/release with timeout for cross-process coordination, helpers to read/write/validate cache state, and logic to skip/install deps depending on cache validity and `has_network`. Expanded timing/logging for connectivity checks and dependency install steps; updated `_setup_python_environment_core` to avoid duplicate installs in-process and to handle lock contention and error paths.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Proc as Process (requesting)
  participant LockFS as File Lock (.cache/.lock)
  participant CacheFS as Cache State (.cache/state.json)
  participant Network as Network/Index
  participant Penv as Python virtualenv

  Proc->>LockFS: try acquire lock (timeout)
  alt lock acquired
    LockFS-->>Proc: lock granted
    Proc->>CacheFS: read fingerprint
    alt fingerprint valid
      CacheFS-->>Proc: valid -> skip install
    else fingerprint missing/invalid
      Proc->>Network: check has_internet_connection
      alt network available
        Network-->>Proc: reachable
        Proc->>Penv: install python_deps (uv etc.)
        Penv-->>Proc: install results
        Proc->>CacheFS: write new fingerprint/state
      else network unavailable
        Network-->>Proc: unreachable
        Proc->>CacheFS: if valid fallback else error and exit
      end
    end
    Proc->>LockFS: release lock
  else lock timeout
    LockFS-->>Proc: cannot acquire
    Proc->>CacheFS: if valid use cache else error and exit
  end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

set uv cache dir to .platformio/.cache folder for easy clean up #386: Modifies builder/penv_setup.py dependency/install flow and UV cache propagation; closely related to the cache/lock/fingerprint changes here.

Poem

🐰 A lock, a cache, a fingerprint bright,

Processes queue in patient delight.
Once installed, the meadow hums free—
Hops saved time for builds and me! 🥕✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 35.71% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: adding cross-process caching for Python dependency checks to avoid unnecessary repeated checks.
Linked Issues check	✅ Passed	The changes implement local verification/caching of Python dependencies (`#423`), avoiding unnecessary network checks and improving build responsiveness for users in constrained network environments.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to addressing issue `#423` by implementing Python dependency caching; no unrelated modifications detected.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

TinyuZhao · 2026-03-20T05:23:41Z

@coderabbitai full review

coderabbitai · 2026-03-20T05:23:48Z

✅ Actions performed

Full review triggered.

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@builder/penv_setup.py`:
- Around line 122-124: The lock wait loop uses lock_file and timeout_sec and
currently allows code later to proceed to the dependency-installation block even
after a timeout; change this to fail-fast: ensure the TimeoutError raised on
lock acquisition is not caught and ignored or followed by an unsynchronized
fallback, and instead propagate the TimeoutError out of the penv setup function
so no installation code runs; remove or alter any try/except or conditional that
swallows TimeoutError and prevents early exit, and ensure the code path that
performs dependency installation (the block that installs packages into the
penv) only runs when the lock was successfully acquired.
- Around line 83-90: The current _deps_fingerprint uses only python_deps,
sys.version_info and penv_python path so a recreated virtualenv at the same path
can produce the same cache key; update _deps_fingerprint to also include a penv
instance marker (readable unique id) from the virtualenv itself (e.g., a marker
file or a value from pyvenv.cfg inside the venv directory) or, if missing, the
venv directory mtime/UUID created at venv creation; include that marker in the
payload alongside python_deps and sys.version_info so cache misses on venv
recreation, and ensure the same change is applied to the call site referenced
around the logic at the lines that check/cache (the branch around lines 633-635)
so the new fingerprint is used when deciding whether to skip installs.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 06e4f3d9-3d54-4a0e-be93-e38e6df5fa2e

📥 Commits

Reviewing files that changed from the base of the PR and between cef91da and 6675b0d.

📒 Files selected for processing (1)

builder/penv_setup.py

TinyuZhao · 2026-03-20T06:18:33Z

@coderabbitai full review

coderabbitai · 2026-03-20T06:18:43Z

✅ Actions performed

Full review triggered.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

builder/penv_setup.py (2)

662-667: Minor TOCTOU: fingerprint computed before lock acquisition.

The fingerprint is computed at line 663 before the lock is acquired at line 666. If another process recreates the penv directory between these two points, the fingerprint (which includes pyvenv.cfg stats) would be stale.

This is a narrow race window and the secondary validation at line 679 provides a safety net, so this is unlikely to cause practical issues. However, for maximum correctness, consider recomputing the fingerprint after acquiring the lock.

♻️ Suggested fix

         state_file, lock_file = _deps_state_paths(platformio_dir)
-        fingerprint = _deps_fingerprint(penv_python)
         try:
             lock_wait_started_at = time.monotonic()
             lock_fd = _acquire_file_lock(lock_file)
             print(f"[penv] Dependency lock acquired in {time.monotonic() - lock_wait_started_at:.2f}s")
+            fingerprint = _deps_fingerprint(penv_python)
         except TimeoutError as e:
             lock_fd = None
+            fingerprint = _deps_fingerprint(penv_python)
             if _is_penv_dependency_cache_valid(state_file, fingerprint, penv_python):

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@builder/penv_setup.py` around lines 662 - 667, The fingerprint is computed by
_deps_fingerprint(penv_python) before acquiring the dependency lock via
_acquire_file_lock(lock_file), creating a TOCTOU window; after obtaining the
lock (after lock_fd is set) recompute the fingerprint (e.g., call
_deps_fingerprint(penv_python) again and overwrite the earlier value) and then
proceed with the existing validation logic that uses fingerprint and state_file
so the fingerprint reflects the locked state of the penv directory.

133-136: Consider chaining the TimeoutError for better diagnostics.

Per Python best practices, exceptions raised within an except clause should be chained to preserve the original context. This helps with debugging.

♻️ Suggested fix

         except FileExistsError:
             if time.monotonic() - started >= timeout_sec:
-                raise TimeoutError(f"Timeout waiting lock: {lock_file}")
+                raise TimeoutError(f"Timeout waiting lock: {lock_file}") from None
             time.sleep(0.1)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@builder/penv_setup.py` around lines 133 - 136, The except block currently
swallows the original FileExistsError; change the handler to capture the
original exception (e.g., "except FileExistsError as e:") and when raising the
TimeoutError include the original as the cause using exception chaining (raise
TimeoutError(f"Timeout waiting lock: {lock_file}") from e) so the original
FileExistsError context (including where it occurred) is preserved; this touches
the except FileExistsError block that references started, timeout_sec, lock_file
and time.monotonic().

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@builder/penv_setup.py`:
- Around line 163-178: The import check currently only validates a small
hardcoded set of packages in the subprocess.run block and can miss missing deps;
update the cache validation to programmatically import all packages from the
python_deps list (or at least include the missing critical packages like
"cryptography", "intelhex", "pyelftools") instead of the fixed "certifi,
platformio, rich, yaml" string so the subprocess invocation using penv_python
verifies every dependency; modify the code that builds the "-c" import string
(or loop over python_deps) and keep the same exception handling around
subprocess.run to return False on failure.

---

Nitpick comments:
In `@builder/penv_setup.py`:
- Around line 662-667: The fingerprint is computed by
_deps_fingerprint(penv_python) before acquiring the dependency lock via
_acquire_file_lock(lock_file), creating a TOCTOU window; after obtaining the
lock (after lock_fd is set) recompute the fingerprint (e.g., call
_deps_fingerprint(penv_python) again and overwrite the earlier value) and then
proceed with the existing validation logic that uses fingerprint and state_file
so the fingerprint reflects the locked state of the penv directory.
- Around line 133-136: The except block currently swallows the original
FileExistsError; change the handler to capture the original exception (e.g.,
"except FileExistsError as e:") and when raising the TimeoutError include the
original as the cause using exception chaining (raise TimeoutError(f"Timeout
waiting lock: {lock_file}") from e) so the original FileExistsError context
(including where it occurred) is preserved; this touches the except
FileExistsError block that references started, timeout_sec, lock_file and
time.monotonic().

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: ab1f90aa-7b5e-43a6-9520-9e9d4151c0bc

📥 Commits

Reviewing files that changed from the base of the PR and between cef91da and 6e161da.

📒 Files selected for processing (1)

builder/penv_setup.py

coderabbitai · 2026-03-20T06:23:30Z

+    try:
+        subprocess.run(
+            [
+                penv_python,
+                "-c",
+                "import certifi, platformio, rich, yaml",
+            ],
+            check=True,
+            stdout=subprocess.DEVNULL,
+            stderr=subprocess.DEVNULL,
+            timeout=10,
+        )
+    except (subprocess.CalledProcessError, subprocess.TimeoutExpired, FileNotFoundError):
+        print("[penv] Dependency cache invalidated: required Python packages are missing from penv")
+        return False
+


⚠️ Potential issue | 🟡 Minor

Cache validation only verifies a subset of dependencies.

The import check validates certifi, platformio, rich, yaml, but python_deps includes many more critical packages (cryptography, intelhex, pyelftools, etc.). If one of the unchecked packages is missing or corrupted, the cache would still be considered valid, potentially causing build failures later.

Consider either expanding the import check to include more critical packages, or documenting why this subset is sufficient.

💡 Suggested improvement

subprocess.run( [ penv_python, "-c", - "import certifi, platformio, rich, yaml", + "import certifi, platformio, rich, yaml, cryptography, intelhex", ], check=True, stdout=subprocess.DEVNULL, stderr=subprocess.DEVNULL, timeout=10, )

🧰 Tools

🪛 Ruff (0.15.6)

[error] 164-164: subprocess call: check for execution of untrusted input

(S603)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@builder/penv_setup.py` around lines 163 - 178, The import check currently only validates a small hardcoded set of packages in the subprocess.run block and can miss missing deps; update the cache validation to programmatically import all packages from the python_deps list (or at least include the missing critical packages like "cryptography", "intelhex", "pyelftools") instead of the fixed "certifi, platformio, rich, yaml" string so the subprocess invocation using penv_python verifies every dependency; modify the code that builds the "-c" import string (or loop over python_deps) and keep the same exception handling around subprocess.run to return False on failure.

Jason2866 · 2026-03-20T10:01:38Z

No, will not add this complex logic which will introduce for sure problems.

TinyuZhao · 2026-03-20T10:11:06Z

Have a better way to solve this problem?

TinyuZhao · 2026-03-20T10:12:42Z

The wait time before compilation is really frustrating, especially when conducting hardware testing.

Jason2866 · 2026-03-20T10:30:43Z

Setting the offline flag does solve. pioarduino is Open Source. Fork and implement anything you want.

TinyuZhao · 2026-03-20T10:50:08Z

Setting the offline flag does solve. pioarduino is Open Source. Fork and implement anything you want.

All right, then. What kind of computer are you using? I don’t think it has anything to do with the specs, though. That’s really strange.

Jason2866 · 2026-03-20T10:59:58Z

pioarduino is tested and working with MacOS, Linux (different ones) and Windows. Developing on MacOS.

TinyuZhao · 2026-03-20T15:05:26Z

pioarduino is tested and working with MacOS, Linux (different ones) and Windows. Developing on MacOS.

I'm also on macOS. Could you tell me which version of UV you're using?

Jason2866 · 2026-03-20T16:53:07Z

uv 0.10.11 (006b56b12 2026-03-16)

TinyuZhao added 2 commits March 20, 2026 10:21

Add a log of the time taken for each phase.

a24324d

Add cross-process cache for Python dependency checks.

6675b0d

coderabbitai Bot reviewed Mar 20, 2026

View reviewed changes

Comment thread builder/penv_setup.py

Comment thread builder/penv_setup.py

TinyuZhao mentioned this pull request Mar 20, 2026

The wait before compilation is quite frustrating. #423

Closed

Avoid false cache hit after penv recreation.

6e161da

coderabbitai Bot reviewed Mar 20, 2026

View reviewed changes

Jason2866 closed this Mar 20, 2026

Conversation

TinyuZhao commented Mar 20, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description:

Checklist:

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

TinyuZhao commented Mar 20, 2026

Uh oh!

coderabbitai Bot commented Mar 20, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

TinyuZhao commented Mar 20, 2026

Uh oh!

coderabbitai Bot commented Mar 20, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Jason2866 commented Mar 20, 2026

Uh oh!

TinyuZhao commented Mar 20, 2026

Uh oh!

TinyuZhao commented Mar 20, 2026

Uh oh!

Jason2866 commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TinyuZhao commented Mar 20, 2026

Uh oh!

Jason2866 commented Mar 20, 2026

Uh oh!

TinyuZhao commented Mar 20, 2026

Uh oh!

Jason2866 commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TinyuZhao commented Mar 20, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 20, 2026 •

edited

Loading

Jason2866 commented Mar 20, 2026 •

edited

Loading