Apply migrations based on migrator filename by h-vetinari · Pull Request #2369 · conda-forge/conda-smithy

h-vetinari · 2025-08-22T03:23:11Z

Currently the logic of migration application takes into account migrator_ts in two places. For one, the ordering of how different migrators are applied

conda-smithy/conda_smithy/configure_feedstock.py

Line 908 in 51c64d0

migration_variants.sort(key=lambda fn_v: (fn_v[1]["migrator_ts"], fn_v[0]))

which has become relevant for how we're sorting the migrations for 3.13t (nogil) and 3.14, due to the way how additional_zip_keys works. The second place is to determine whether migrations are applied (between local ones and the ones from the pinning) or deleted:

conda-smithy/conda_smithy/configure_feedstock.py

Line 2758 in 51c64d0

elif ts in migrations_in_cfp:

This caused #2367, conda-forge/conda-forge-pinning-feedstock#7686 but more importantly #2368: to base the application of migrations on these timestamps is just very confusing.

And it's also completely unnecessary, because we have a completely consistent model that we already use most of the time: that is, migrations get applied if the migrator file in .ci_support with the filename foo.yaml matches a migrator of that filename in the global pinning. There are overrides based on use_local and migration_number, but that's the baseline. So to avoid the effect of these timestamps on migrator application, just make that choice based on the migrator filename in all cases.

Closes #2368

Very likely also addresses #2336

h-vetinari · 2025-08-22T04:20:43Z

This might actually also solve #2336

isuruf · 2025-08-22T09:21:23Z

+* Avoid using migration timestamps to determine whether migrations should be applied. The logic for ``use_local`` and ``migration_number`` remains unchanged, but migrations are now uniformly applied based on the name of the migrator file, and timestamps are only used to order the application of migrators (#2368).
+


Let's use both the filename and the timestamp. It's hard to keep track of which filenames were used before, that's why the timestamp was used as a unique identifier.

Can you explain how you envision this? To me it still sounds like an inconsistent (or at least very hard-to-follow) mental model. What I'd like is a process that's as simple as:

if the filename appears in https://github.com/conda-forge/conda-forge-pinning-feedstock/tree/main/recipe/migrations and the feedstock-local .ci_support/migrations, then the respective migrator gets applied during rerendering.

(handle use_local: and migration_number on top of that)

From what I can tell, we'd never need to care about filenames which occurred in the past, only those that are directly present in the current pinning.

we'd never need to care about filenames which occurred in the past

Let's say we had a migrator foo.yaml applied in the past and still present in the feedstock. This migrator was closed and deleted in conda-forge-pinning-feedstock.

Later, another migrator with the same name foo.yaml is applied. Then conda-smithy will consider that old foo.yaml still present in the feedstock is the same as the new foo.yaml and use the new migrator as that file is newer and has updates.

But isn't that how it should work? The foo.yaml in the global pinning takes precedence, unless overridden by use_local: true.

If the feedstock had been rerendered between the deletion of the old foo.yaml and the creation of the new foo.yaml (both from the POV of the global pinning), the old one would have been deleted in the feedstock anyway. So the fact that it gets replaced with the new one (if it exists) to me sounds like a feature, not a bug

It's a feature, yes, but it also means that we have to keep track of what was used before to not accidentally use this feature. My example was about two completely different migrators with the same name foo.yaml and they should not override each other.

The REST API will find up to 4,000 repositories that match your filters and return results from those repositories.

The check would fail on a single hit in exactly the same way as for 10'000 hits. The upper limit is irrelevant for the check I have in mind

I am almost 100% sure the text implies github won't search all of our repositories. It says

To keep the REST API fast for everyone, we limit the number of repositories a query will search through.

They are not limiting the results, but instead the number of repos searched.

I think your interpretation is incorrect. The limitation is not about searching (which is based on a computed index), but about how many results can (reasonably) be fetched.

It literally says "repositories a query will search through." I don't know how else to parse that very explicit statement about what is limited since it literally says they limit the number of repositories they search.

The way I parse this is that the first part is colloquial, and the second part actually reveals the limitation ("finding", not "searching"; that distinction is irrelevant in common parlance, but not here). That argument is bolstered by the fact that their search API even provides an explicit value for whether the results are complete

Got { "total_count": 0, "incomplete_results": false, # <---- "items": [ ] }

In your interpretation, "incomplete_results" would always have to be true in orgs with more than 4000 repos. This is not what's observed in practice.

Finally, my interpretation is consistent with the observation that searching a pre-computed index does not provide a way to tell the query "please stop at X repos" (because the index itself has no notion of individual repos, just an amalgamation of their contents), it only provides a way to limit how many results are returned.

chrisburr · 2025-08-22T12:38:55Z

@h-vetinari looks like there is still more to fix for windows: conda-forge/pycurl-feedstock#35

But hard to tell from my phone, maybe the comma?

h-vetinari · 2025-08-22T12:43:32Z

But hard to tell from my phone, maybe the comma?

Yeah, that's why I had added 390bdd9 (now merged through #2370)

beckermr · 2025-08-22T13:03:14Z

I'll cut a release after this PR is done.

h-vetinari · 2025-08-23T04:54:01Z

I'll cut a release after this PR is done.

I think this PR will have to wait till we have a chance to do something like conda-forge/conda-forge-pinning-feedstock#7691.

If you want I can still do a reduced version of this that switches to determining uniqueness by a tuple of (filename, ts), but getting the timestamp out (my original goal) doesn't seem feasible right now.

beckermr · 2025-08-23T14:07:24Z

I think actually we can close this pr and merge my pr that ensures timestamps are unique.

h-vetinari · 2025-08-23T21:42:28Z

I think actually we can close this pr and merge my pr that ensures timestamps are unique.

We can merge the timestamp PR. I'll keep this open though - it's still a worthwhile change, just that we can't apply it yet.

… migrator_ts

h-vetinari requested a review from a team as a code owner August 22, 2025 03:23

This was referenced Aug 22, 2025

Rebuild for python 3.14 conda-forge/pycurl-feedstock#35

Merged

avoid reuse of old migrator_ts in 3.14 migration conda-forge/conda-forge-pinning-feedstock#7686

Merged

h-vetinari force-pushed the slash_timestamp branch from ddde66e to 969ef0f Compare August 22, 2025 03:54

h-vetinari added the don't squash-merge me label Aug 22, 2025

isuruf reviewed Aug 22, 2025

View reviewed changes

beckermr requested changes Aug 22, 2025

View reviewed changes

Comment thread conda_smithy/configure_feedstock.py Outdated

h-vetinari marked this pull request as draft August 22, 2025 12:07

h-vetinari changed the title ~~Apply migrations based on migrator filename; remove slashes from variant names~~ Apply migrations based on migrator filename Aug 22, 2025

h-vetinari force-pushed the slash_timestamp branch from 390bdd9 to 7f50007 Compare August 22, 2025 12:35

h-vetinari force-pushed the slash_timestamp branch 2 times, most recently from bbfa27d to ff9025f Compare August 22, 2025 12:46

h-vetinari mentioned this pull request Aug 22, 2025

Add check for migrator filename uniqueness conda-forge/conda-forge-pinning-feedstock#7691

Draft

This was referenced Aug 22, 2025

feat: first pass at uuids for migrations conda-forge/conda-forge-pinning-feedstock#7692

Closed

feat: test for unique timestamps conda-forge/conda-forge-pinning-feedstock#7693

Merged

h-vetinari mentioned this pull request Aug 23, 2025

Rebuild for python 3.14 conda-forge/multiprocess-feedstock#56

Merged

h-vetinari mentioned this pull request Aug 26, 2025

make migration application depend on (base) filename, not just migrator_ts #2374

Merged

switch migration application to be determined by (base) filename, not…

7fe8dc0

… migrator_ts

h-vetinari force-pushed the slash_timestamp branch from ff9025f to 9d59a9d Compare August 27, 2025 01:00

remove timestamp from migration logic and get_migrations_in_dir

8ad787f

h-vetinari force-pushed the slash_timestamp branch from 9d59a9d to 8ad787f Compare August 27, 2025 01:02

h-vetinari removed the don't squash-merge me label Aug 27, 2025

h-vetinari mentioned this pull request Aug 31, 2025

smithy applies migration not present in feedstock if timestamps match with local migration that should have already been deleted #2368

Closed

h-vetinari mentioned this pull request Nov 14, 2025

Smithy does not delete closed cudnn910 migration #2424

Closed

		* Avoid using migration timestamps to determine whether migrations should be applied. The logic for ``use_local`` and ``migration_number`` remains unchanged, but migrations are now uniformly applied based on the name of the migrator file, and timestamps are only used to order the application of migrators (#2368).

Uh oh!

Conversation

h-vetinari commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

h-vetinari commented Aug 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

h-vetinari Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

h-vetinari Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chrisburr commented Aug 22, 2025

Uh oh!

h-vetinari commented Aug 22, 2025

Uh oh!

beckermr commented Aug 22, 2025

Uh oh!

h-vetinari commented Aug 23, 2025

Uh oh!

beckermr commented Aug 23, 2025

Uh oh!

h-vetinari commented Aug 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

h-vetinari commented Aug 22, 2025 •

edited

Loading

h-vetinari Aug 22, 2025 •

edited

Loading

h-vetinari Aug 22, 2025 •

edited

Loading