chore(deps): update MLX packages (minor) #32

renovate · 2025-12-22T01:53:36Z

Note: This PR body was truncated due to platform limits.

This PR contains the following updates:

Package	Change	Age	Confidence
mlx	`==0.21.0` → `==0.30.1`
mlx-lm	`==0.20.3` → `==0.30.2`
mlx-vlm	`==0.1.1` → `==0.3.9`

Release Notes

ml-explore/mlx (mlx)

`v0.30.1`

Compare Source

Highlights

RDMA over thunderbolt with the JACCL backend (macOS >= 26.2) (some numbers)
NAX with JIT so that they can be used in MLX Swift
CUDA improvements
- Many improvements to SDPA (masking, T_q != T_kv)
- Faster quantize/dequantize
- QQMM to make use of faster tensor cores
- Fix in col reduce speeds up training

What's Changed

patch + fix docs build by @awni in #2799
Fix macos release target and linux arm release by @awni in #2802
Fix cuda allocator copy condition by @awni in #2800
[CUDA] Partly fix random for large sizes by @awni in #2798
patch bump for future version by @awni in #2804
Centralize NAX condition by @awni in #2811
Tolerance for some ops tests on cuda by @awni in #2815
Fix typo: refs/head/main => refs/heads/main by @zcbenz in #2818
Add float64 Eig and complex64 SVD/Eig support (Fixes #2708) by @harsh-sutariya in #2737
Fix mx.core.load type annotation by @CC-Yeh in #2819
Force cudaGraphExec reinstantiation when clusters are used by @andportnoy in #2813
Bump actions/checkout from 5 to 6 by @dependabot[bot] in #2828
Fix mx.core.linspace type annotation by @CC-Yeh in #2820
[CUDA] Exit on crash and more helpful errors by @awni in #2830
[CUDA] Add debug env to save cuda graphs to dot files by @zcbenz in #2825
[CUDA] Output of SDPA should have same layout with inputs by @zcbenz in #2826
Merge build-cuda and build-linux actions by @zcbenz in #2783
[CUDA] Support array mask in SDPA by @zcbenz in #2822
[CUDA] Faster rms norm for small dimension by @awni in #2838
Added clarification to apply_fn parameter of apply_to_modules by @yuchaoran2011 in #2831
[CUDA] Use cuDNN attention when T_q != T_kv by @zcbenz in #2843
[CUDA] Migrate conv code to new cuDNN APIs by @zcbenz in #2847
Support more Numpy interfaces for masked_scatter by @CC-Yeh in #2832
use thread local cpature mode by @awni in #2850
Fix export scatters by @awni in #2852
Reduce JVP by @awni in #2854
Fix graph updating by @awni in #2857
Fix init from double by @awni in #2861
Update gumbel function signature parameters by @tianenchong in #2868
Added support for pytree types that inherit from tuple and typing.namedtuple by @romanoneg in #2845
Layer norm throws on dimension mismatch by @awni in #2870
fix compile copying by @awni in #2871
Do a PyPi release for cuda on arm by @awni in #2866
Add a 2-pass col reduce for CUDA by @angeloskath in #2863
[CUDA] Faster general copy by @awni in #2873
[CUDA] Release build for cuda 13 by @awni in #2872
Make allocator::malloc throw on allocation failure by @zcbenz in #2874
[Metal] No copy array init by @awni in #2875
Try not to fail when there should be memory available by @awni in #2869
[CUDA] Enable more graphs to be updatable by @awni in #2883
Fix docs: replace nonexistent mx.random.randn with mx.random.normal by @Satyam12singh in #2890
Allow events in sub graph to be updatable by @awni in #2886
bump minimum required Python version by @ngoldbaum in #2891
do not use simd neon intrinsics on x86 by @davidkoski in #2893
Fix input buffer donation in compile by @CC-Yeh in #2897
Update nanobind pin to most recent version by @ngoldbaum in #2896
fp quantize by @nastya236 in #2892
Fix grad in place updates by @awni in #2899
[CUDA] Add host nodes to subgraph types for graph update by @awni in #2901
fix: possible heap-buffer-overflow in RandomBits::eval_cpu (follow for new ASAN CI tests) by @incertum in #2877
Fix ccache getting disabled by @zcbenz in #2905
Fix attention for large sizes by @awni in #2903
No VJP for mask or sinks in attention by @awni in #2909
Bump actions/upload-artifact from 5 to 6 by @dependabot[bot] in #2911
Bump actions/download-artifact from 6 to 7 by @dependabot[bot] in #2912
Use CUDA runtime headers from local python package by @zcbenz in #2906
DOC : Add compile state example by @Satyam12singh in #2910
qqmm by @nastya236 in #2789
Thunderbolt RDMA communications backend by @angeloskath in #2808
Add JIT support for NAX kernels by @jagrit06 in #2916
Fix warnings for the NAX build by @angeloskath in #2921

New Contributors

@dependabot[bot] made their first contribution in #2828
@yuchaoran2011 made their first contribution in #2831
@tianenchong made their first contribution in #2868
@romanoneg made their first contribution in #2845
@Satyam12singh made their first contribution in #2890
@ngoldbaum made their first contribution in #2891

Full Changelog: ml-explore/mlx@v0.30.0...v0.30.1

`v0.30.0`

Compare Source

Highlights

Support for Neural Accelerators on M5 (macOS >= 26.2)

What's Changed

Fix AdamW weight_decay default value in docstring by @goingreen in #2557
Fix dequantize python sig by @wrmsr in #2562
fix copies in sdpa by @awni in #2563
chore: Update Docs With Slice Copy Example by @krishi-saripalli in #2559
Fixed several type annotations in the MLX stubs which degraded to Unknown/Any by @Maalvi14 in #2560
typing: add type hints to mlx.core.array, linalg, and random by @XXXXRT666 in #2565
Set ccache size before building by @zcbenz in #2570
Faster fully depthwise-separable 1D conv by @awni in #2567
Fix a few ccache cache miss by @zcbenz in #2573
Some tweaks in cmake files by @zcbenz in #2574
Add batch offsets for mx.fast.rope by @awni in #2564
[CUDA] Use GEMM with epilogue instead of AddMM by @zcbenz in #2569
[CUDA] Fix alpha not respected when using bias epilogue by @zcbenz in #2578
Fix flaky addmm tests by @zcbenz in #2581
Adding Relu2 by @Goekdeniz-Guelmez in #2582
Add sdpa with sinks by @awni in #2558
[CUDA] Set bias as input when using bias epilogue by @zcbenz in #2584
[CUDA] Fix NCCL stub for release build by @awni in #2587
patch bump by @awni in #2588
Refactor code examples to use 'gelu' by @umbertomig in #2592
Fix metal scan by @awni in #2591
Fix typo in average_gradients function call by @umbertomig in #2594
No copy batch rope by @awni in #2595
Update export function example for array input by @umbertomig in #2598
Expose mx.depends to Python by @awni in #2606
fix: library loading for swift dynamic frameworks by @bilousoleksandr in #2568
Detect cache thrashing in LRUCache by @zcbenz in #2600
Lower sorted QMM gather threshold by @awni in #2609
implement Convolution::output_shape by @josharian in #2601
Avoid producing NaN in attention by @awni in #2608
[CUDA] Recycle CUDA events by @zcbenz in #2604
[CUDA] fix cudaGraphLaunch by @CC-Yeh in #2613
Support pickling array for bfloat16 by @CC-Yeh in #2586
New tuning for small K gemv by @jagrit06 in #2620
Allow None input to compiled functions by @awni in #2621
Compiled should not end in broadcast by @angeloskath in #2622
Bump the version by @angeloskath in #2627
[CUDA] Make CudaEvent work with multi-device by @zcbenz in #2614
Fix incorrect path and typos by @aisk in #2630
Fix for max block dim by @awni in #2631
Compile now can attach arbitrary data to an entry by @angeloskath in #2634
[CUDA] Wait for tasks in cuda by @awni in #2636
Fix status message by @angeloskath in #2638
fix cross entropy axis param by @awni in #2641
Faster triu, tril, where with scalar by @awni in #2644
[CUDA] Add a small column specialization to reduce by @angeloskath in #2642
[CUDA] Fix flaky test by @awni in #2646
Configure CMake to export compile_commands.json by @andportnoy in #2645
Faster complex matmul by @CC-Yeh in #2571
Fix compile when outputs change by @awni in #2648
Speed up compile for node with many parents by @awni in #2649
Fix and refactor row-reduce by @angeloskath in #2650
[CUDA] Fix jit file cache for large kernel names by @angeloskath in #2656
Fix all_gather vjp by @awni in #2654
Fix fast synch when fence is waited before a command buffer is created by @awni in #2657
Fix cumulative operations when axis=None by @aisk in #2653
Export with callback by @awni in #2612
bump patch by @awni in #2658
Enable addmm low-precision cpu by @awni in #2661
Precise sigmoid by @awni in #2659
Debug cuda conv by @awni in #2662
Speed up scalars part 2 by @awni in #2669
Normalize README bullet formatting and other Markdown small fixes by @Mistobaan in #2671
Modified sort behavior when running CPU or Metal to match NumPy/JAX by @Maalvi14 in #2667
remove unused unary file by @awni in #2672
Nccl timeout by @nastya236 in #2673
suppress gcc 10.1 warnings by @awni in #2679
patch bump by @awni in #2680
Improved mx.split() docs by @Maalvi14 in #2689
fix warnings showing up with -Wall by @andresy in #2692
Einsum error msg improvement by @Maalvi14 in #2690
optionally load metallib from framework by @davidkoski in #2702
Fix addmm cpu for beta != 1.0 by @awni in #2699
Add mx.median op by @awni in #2705
bump python by @awni in #2694
Fp8 conversion by @awni in #2686
fix: linux-{fedora}x86_64-build by @incertum in #2707
Add quantize/dequantize for mxfp8 and nvfp4 by @awni in #2688
Migrate CircleCI to GitHub Actions by @madrob in #2716
Fix KeyError for missing domain_uuid_key in Thunderbolt setup by @thechriswebb in #2682
fix memory count bug by @awni in #2717
Fix the order of hosts in the ring by @angeloskath in #2718
Fix docs path by @madrob in #2719
Use faster dequant for fp4 by @awni in #2720
update: add linux fedora container CI - CPP build test only by @incertum in #2722
add null check -- the bundleIdentifier is optional by @davidkoski in #2709
Fix compile multi capture by @awni in #2678
Set up publishing to PyPI and Test-PyPI by @madrob in #2721
Check isnan in maximum / minimum with CPU backend by @aisk in #2652
Fix addmm with empty matrices and beta != 1.0 by @harsh-sutariya in #2715
skip self-hosted runners on forks by @madrob in #2730
only build for macos 14 and up by @awni in #2731
don't test when doing release by @awni in #2734
Make cpu binary_op easily accessible by @angeloskath in #2733
fix property name by @madrob in #2736
Nccl reduce scatter, all gather by @nastya236 in #2727
[CUDA] Reduce use of managed memory by @awni in #2725
Shapeless support for zeros/ones_like by @CC-Yeh in #2726
Compatibility with pip-installed openmpi by @pcuenca in #2741
Fix release builds by @awni in #2746
patch bump by @awni in #2750
Fix dequantize python sig (dtype default) by @wrmsr in #2752
remove circle by @awni in #2753
Fix irregular_strides benchmark shape type by @wrmsr in #2754
Linux on arm by @awni in #2751
minor debugging for publishing by @madrob in #2739
Export custom kernel by @awni in #2756
Fix slice with negative strides by @awni in #2758
[CUDA] Check CUDA error in synchronize by @zcbenz in #2757
fix release by @awni in #2759
[CUDA] cuDNN forward attention by @zcbenz in #2743
Fix exporting with constants by @awni in #2769
Separate test-linux from build-linux/cuda in GitHub Actions by @zcbenz in #2765
[CUDA] Use arch specific targets when possible by @awni in #2771
Fix MPI distributed tests with CUDA backend by @zcbenz in #2775
Fix warnings with cmake 4.1 by @zcbenz in #2774
Use ccache in GitHub Actions by @zcbenz in #2773
[CUDA] Tune ops per buffer based on device by @awni in #2761
fix release 2 by @awni in #2767
Run CI for pushes by @zcbenz in #2777
Remove pip cache in GitHub Actions by @zcbenz in #2776
Build and test with multiple CUDA versions by @zcbenz in #2780
Use std::optional for mask_arr arg by @zcbenz in #2763
Do not run CPU tests in CUDA builds by @zcbenz in #2784
Test every commit in main branch by @zcbenz in #2781
Fix nightly build by @zcbenz in #2785
Remove unneeded tests in nightly build by @zcbenz in #2786
Fix building with CUDA < 12.8 by @zcbenz in #2782
Avoid duplicate CI runs when starting a PR from upstream branch by @zcbenz in #2788
build docs on linux by @awni in #2787
[CUDA] cuDNN backward attention by @zcbenz in #2762
more accurate rope fallback by @awni in #2792
Fix version tag by @awni in #2790
version by @awni in #2797
Add Masked Scatter by @CC-Yeh in #2663
Add Neural Accelerator Support by @jagrit06 in #2772

New Contributors

@goingreen made their first contribution in #2557
@krishi-saripalli made their first contribution in #2559
@Maalvi14 made their first contribution in #2560
@XXXXRT666 made their first contribution in #2565
@umbertomig made their first contribution in #2592
@bilousoleksandr made their first contribution in #2568
@josharian made their first contribution in #2601
@aisk made their first contribution in #2630
@Mistobaan made their first contribution in #2671
@incertum made their first contribution in #2707
@thechriswebb made their first contribution in #2682
@harsh-sutariya made their first contribution in #2715
@pcuenca made their first contribution in #2741

Full Changelog: ml-explore/mlx@v0.29.0...v0.30.0

`v0.29.4`

Compare Source

🚀

`v0.29.3`

Compare Source

⏭️

`v0.29.2`

Compare Source

⬆️

`v0.29.1`

Compare Source

🚀

`v0.29.0`

Compare Source

Highlights

Support for mxfp4 quantization (Metal, CPU)
More performance improvements, bug fixes, features in CUDA backend
mx.distributed supports NCCL back-end for CUDA

What's Changed

[CUDA] Optimize set_mm_device_pointers for small ndim by @zcbenz in #2473
Fix logsumexp/softmax not fused for some cases by @zcbenz in #2474
Use CMake <4.1 to avoid the nvpl error by @angeloskath in #2489
Fix incorrect interpretation of unsigned dtypes in reduce ops by @abeleinin in #2477
make code blocks copyable by @Dan-Yeh in #2480
Rename cu::Matmul to CublasGemm by @zcbenz in #2488
Faster general unary op by @awni in #2472
The naive_conv_2d is no longer used by @zcbenz in #2496
Remove the hack around SmallVector in cpu compile by @zcbenz in #2494
Clean up code handling both std::vector and SmallVector by @zcbenz in #2493
[CUDA] Fix conv grads with groups by @zcbenz in #2495
Update cuDNN Frontend to v1.14 by @zcbenz in #2505
Ensure small sort doesn't use indices if not argsort by @angeloskath in #2506
Ensure no oob read in gemv_masked by @angeloskath in #2508
fix custom kernel test by @awni in #2510
No segfault with uninitialized array.at by @awni in #2514
Fix lapack svd by @awni in #2515
Split cuDNN helpers into a separate header by @zcbenz in #2491
[CUDA] Add GEMM-based fallback convolution kernels by @zcbenz in #2511
Fix docs by @russellizadi in #2518
Fix overflow in large filter small channels by @angeloskath in #2520
[CUDA] Fix stride of singleton dims before passing to cuDNN by @zcbenz in #2521
Custom cuda kernel by @angeloskath in #2517
Fix docs omission by @angeloskath in #2524
Fix power by @awni in #2523
NCCL backend by @nastya236 in #2476
[CUDA] Nccl pypi dep + default for cuda by @awni in #2526
Fix warning 186-D from nvcc by @zcbenz in #2527
[CUDA] Update calls to cudaMemAdvise and cudaGraphAddDependencies for CUDA 13 by @andportnoy in #2525
nccl default for backend=any by @awni in #2528
Fix allocation bug in NCCL by @awni in #2530
Enable COMPILE_WARNING_AS_ERROR for linux builds in CI by @zcbenz in #2534
[CUDA] Remove thrust in arange by @zcbenz in #2535
Use nccl header only when nccl is not present by @awni in #2539
Allow pathlib.Path to save/load functions by @awni in #2541
Remove nccl install in release by @awni in #2542
[CUDA] Implement DynamicSlice/DynamicSliceUpdate by @zcbenz in #2533
Remove stream from average grads so it uses default by @awni in #2532
Enable cuda graph toggle by @awni in #2545
Tests for save/load with Path by @awni in #2543
Run CPP tests for CUDA build in CI by @zcbenz in #2544
Separate cpu compilation cache by versions by @zcbenz in #2548
[CUDA] Link with nccl by @awni in #2546
[CUDA] Use ConcurrentContext in concatenate_gpu by @zcbenz in #2549
[CUDA] fix sort by @awni in #2550
Add mode parameter for quantization by @awni in #2499
Bump xcode in circle by @awni in #2551
Fix METAL quantization in JIT + fix release build by @awni in #2553
Faster contiguous gather for indices in the first axis by @awni in #2552
version bump by @awni in #2554
Fix quantized vjp for mxfp4 by @awni in #2555

New Contributors

@Dan-Yeh made their first contribution in #2480
@russellizadi made their first contribution in #2518
@andportnoy made their first contribution in #2525

Full Changelog: ml-explore/mlx@v0.28.0...v0.29.0

`v0.28.0`

Compare Source

Highlights

First version of fused sdpa vector for CUDA
Convolutions in CUDA
Speed improvements in CUDA normalization layers, softmax, compiled kernels, overheads and more

What's Changed

[CUDA] Fix segfault on exit by @awni in #2424
[CUDA] No occupancy query for launch params by @awni in #2426
[CUDA] More sizes for gemv by @awni in #2429
Add more CUDA architectures for PyPi package by @awni in #2427
Use ccache in CI by @zcbenz in #2414
[CUDA] Use aligned vector in Layer Norm and RMS norm by @awni in #2433
Cuda faster softmax by @awni in #2435
Remove the kernel arg from get_launch_args by @zcbenz in #2437
Move arange to its own file by @zcbenz in #2438
Use load_vector in arg_reduce by @zcbenz in #2439
Make CI faster by @zcbenz in #2440
[CUDA] Quantized refactoring by @angeloskath in #2442
fix circular reference by @awni in #2443
[CUDA] Fix gemv regression by @awni in #2445
Fix wrong graph key when using concurrent context by @zcbenz in #2447
Fix custom metal extension by @awni in #2446
Add tests for export including control flow models and quantized models by @junpeiz in #2430
[CUDA] Backward convolution by @zcbenz in #2431
[CUDA] Save primitive inputs faster by @zcbenz in #2449
[CUDA] Vectorize generated kernels by @angeloskath in #2444
[CUDA] Matmul utils initial commit by @angeloskath in #2441
Fix arctan2 grads by @angeloskath in #2453
Use LRU cache for cuda graph by @zcbenz in #2448
Add missing algorithm header to jit_compiler.cpp for Linux builds by @zamderax in #2460
Default install cuda on linux by @awni in #2462
fix wraps compile by @awni in #2461
Feat: add USE_SYSTEM_FMT CMake option by @GaetanLepage in #2219
Use SmallVector for shapes and strides by @zcbenz in #2454
Fix install tags by @awni in #2464
Faster gather qmm sorted test by @awni in #2463
Fix cublas on h100 by @awni in #2466
revert default cuda install by @awni in #2465
feat: support a destinations based in tree flatten/unflatten by @LVivona in #2450
Fix typo in metal command encoder by @angeloskath in #2471
Update CUDA sdpa by @jagrit06 in #2468
version by @awni in #2470

New Contributors

@junpeiz made their first contribution in #2430
@zamderax made their first contribution in #2460
@GaetanLepage made their first contribution in #2219
@LVivona made their first contribution in #2450

Full Changelog: ml-explore/mlx@v0.27.1...v0.28.0

`v0.27.1`

Compare Source

Highlights

Initial PyPi release of the CUDA back-end.
CUDA back-end works for well with mlx-lm:
- Reasonably fast for LLM inference
- Supports single-machine training and LoRA fine-tuning

What's Changed

Avoid invoking allocator::malloc when creating CUDA event by @zcbenz in #2232
Share more common code in Compiled by @zcbenz in #2240
Avoid atomic updates across CPU/GPU in CUDA event by @zcbenz in #2231
Perf regression fix by @angeloskath in #2243
Add profiler annotations in common primitives for CUDA backend by @zcbenz in #2244
Default strict mode for module update and update_modules by @awni in #2239
Fix linux linking error by @awni in #2248
Improve metal elementwise kernels by @awni in [#2247](https://github.com/ml-explore/mlx/pull

Configuration

📅 Schedule: Branch creation - "before 10am on monday" in timezone Europe/Berlin, Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

👻 Immortal: This PR will be recreated if closed unmerged. Get config help if that's undesired.

If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

claude · 2025-12-22T01:56:17Z

PR Review: MLX Dependencies Upgrade (v0.21.0 → v0.30.1)

Summary

This PR upgrades the core MLX dependencies from quite old versions to the latest releases:

mlx: 0.21.0 → 0.30.1 (9 minor versions)
mlx-lm: 0.20.3 → 0.30.0 (9 minor versions)
mlx-vlm: 0.1.1 → 0.3.9 (significant version jump)

Code Quality & Best Practices ✅

Strengths:

Automated dependency management - Using Renovate for tracking upstream changes is excellent
Fixed versioning in setup.py - Pinning exact versions provides stability
Range-based requirements.txt - Using >= allows flexibility while the exact versions are controlled in setup.py

Potential Concerns:

Large version jump - Jumping 9 minor versions in one go is risky. MLX has had significant changes including:
- New RDMA over Thunderbolt support (macOS >= 26.2)
- CUDA improvements (if relevant to future plans)
- Neural Accelerator support for M5 chips
- Many API refinements and potential breaking changes
API compatibility risk - The code uses several MLX APIs that may have changed:
- mlx_lm.generate() with sampler API (invoice_detector.py:461-468)
- mlx_vlm.generate() return type (GenerationResult handling, invoice_detector.py:632-648)
- mlx_lm.load() and mlx_vlm.load() signatures (model_manager.py:117-156)

Potential Bugs & Issues ⚠️

Critical:

mlx-lm sampler API change - Around v0.28, mlx-lm changed from temperature parameter to sampler-based generation. Your code already handles this (invoice_detector.py:458-459), which is good, but needs testing.
mlx-vlm GenerationResult API - The code handles the new GenerationResult object (invoice_detector.py:648) but this should be verified with the new version.

Recommended Testing:

Test VLM-based invoice detection with the new mlx_vlm.generate() API
Test text LLM mode with the new mlx_lm.generate() sampler API
Verify model loading still works with both text and vision models
Check if any deprecation warnings appear

Performance Considerations 🚀

Positive:

MLX 0.30.x includes significant performance improvements:
- Faster RMS norm for small dimensions
- 2-pass column reduce for CUDA (if applicable)
- Faster general copy operations
- Optimized quantize/dequantize operations
- QQMM (quantized matrix multiplication) improvements

Expected Impact:

Invoice detection and extraction should be noticeably faster
Model loading may be more efficient with caching improvements

Security Considerations 🔒

Good:

No direct security vulnerabilities in the dependency changes themselves
MLX 0.30.x includes security fixes:
- Heap-buffer-overflow fix in RandomBits::eval_cpu (#2877)
- Proper error handling improvements

Recommendations:

Review the full MLX changelog for any security-related fixes
Ensure HuggingFace authentication still works correctly (no changes needed based on code review)

Test Coverage 📊

Concerns:

No test changes in this PR - A dependency upgrade of this magnitude should include:
- Regression tests to ensure existing functionality works
- Tests for any new APIs being used
- Integration tests for the invoice detection pipeline

Recommendation:
Before merging, run the full test suite:

make test

And specifically test:

tests/test_invoice_detector.py - Core VLM/LLM functionality
tests/test_model_manager.py - Model loading
End-to-end invoice processing with a real PDF

Migration Risks 🚨

Breaking Changes to Watch:

Minimum Python version - MLX 0.30+ bumped minimum Python to 3.9 (already met)
PyPI releases - Now includes CUDA ARM releases (not relevant for Apple Silicon)
API changes - Several ops had signature changes (check if any affect your code)

Recommendations 📝

Before Merging:

✅ Run full test suite - Ensure all tests pass
✅ Manual testing - Test with real invoice PDFs
✅ Check for deprecation warnings - Run with verbose logging
⚠️ Consider gradual upgrade - If issues arise, consider upgrading in smaller steps (0.21→0.25→0.30)

After Merging:

Monitor for any runtime issues in production
Update documentation if any user-facing behavior changed
Consider adding integration tests for the new MLX versions

Verdict

CONDITIONAL APPROVAL ✅ with testing required

This upgrade brings important performance improvements and bug fixes, but the large version jump requires thorough testing before merging. The code appears well-structured to handle the changes (good job on the sampler API handling!), but verification is essential.

Action Items:

Run make test and ensure all tests pass
Test with sample invoice PDFs using both VLM and text LLM modes
Check logs for any deprecation warnings
If any issues appear, consider incremental upgrade or code adjustments

Review generated following CLAUDE.md guidelines

renovate bot added critical dependencies mlx labels Dec 22, 2025

renovate bot force-pushed the renovate/mlx-mlx-packages branch from cc363b3 to 86b3663 Compare January 6, 2026 01:12

chore(deps): update MLX packages

b30ffaf

renovate bot force-pushed the renovate/mlx-mlx-packages branch from 86b3663 to b30ffaf Compare January 6, 2026 05:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(deps): update MLX packages (minor) #32

chore(deps): update MLX packages (minor) #32

Uh oh!

renovate bot commented Dec 22, 2025 •

edited

Loading

Uh oh!

claude bot commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

chore(deps): update MLX packages (minor) #32

Are you sure you want to change the base?

chore(deps): update MLX packages (minor) #32

Uh oh!

Conversation

renovate bot commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release Notes

Highlights

What's Changed

New Contributors

Highlights

What's Changed

New Contributors

Highlights

What's Changed

New Contributors

Highlights

What's Changed

New Contributors

Highlights

What's Changed

Configuration

Uh oh!

claude bot commented Dec 22, 2025

PR Review: MLX Dependencies Upgrade (v0.21.0 → v0.30.1)

Summary

Code Quality & Best Practices ✅

Potential Bugs & Issues ⚠️

Performance Considerations 🚀

Security Considerations 🔒

Test Coverage 📊

Migration Risks 🚨

Recommendations 📝

Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

renovate bot commented Dec 22, 2025 •

edited

Loading