-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use hash, not experiment names in all DVC commands and as IDs in the table #3945
Comments
You can still get collisions on the short hash. Are you suggesting using the full hash as the ID? |
No need for this - one thing is to see some ambiguous text, another one is not being able to run an action (and overall this is DVC's issue, not extension's).
This is I'm less worried to be honest - this is indeed rare. |
Update the story a bit with more context. Set |
Made a PR in DVCLive to reduce the possibility https://github.com/iterative/dvclive/pull/576/files . If merged it becomes less priority, but still a bug - we can't use names as unique IDs. |
iterative/dvc#9490 has been merged on the DVC side. |
Put a |
The discussion in iterative/dvc#9485 is related |
I also ran into this issue when I wanted to remove an experiment that I had saved under the same name with respect to the latest branch HEAD. Basically I did a manual "rebase", but then used the "Experiments" context menu for convenience to remove the original experiment.
The experiment that was deleted was the latest, though I right-clicked on the older one. If I had done it using the CLI, it would have been obvious that the name was ambiguous. But using the VS Code context menu I assumed it would then know which one I want to delete given I right-clicked on it's entry in the table. It seems if one was able to use
For now, I have to remove the old experiment before creating the updated new one:
|
E.g.
/usr/local/bin/python -m dvc plots diff workspace testy-leno
.We should not be using
testy-leno
- it's ambiguous.Only if we don't know the hash yet it's fine to use name (or even better - some other id? how does DVC handle a collision like this? how does it identify them)?
We can't use these names as IDs in the table either. Now it seems action selection and other stuff works for duplicates simultaneously.
Read this thread for the context: https://iterativeai.slack.com/archives/CB41NAL8H/p1684707770045189 . We easily hit duplicates since that repo sets random seed (quite regular story in ML) and our experiment names repeat as we add more commits. Here is the logic how they are generated:
https://github.com/iterative/dvclive/blob/main/src/dvclive/dvc.py#L150
The text was updated successfully, but these errors were encountered: