-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Closed
Labels
feature requestRequesting a new featureRequesting a new feature
Description
Many issues require reconfiguration of stages and even pipelines:
- In Make
dvc runhandle files with same name but different path #973 @Hong-Xiang was asking about reusing (reconfigurable) pipelines. - Discord discussion with vern from 11/27/18: "it's annoying to write them all (stages) out by hand and then do it again for each color (parameter)."
- pipelines: parametrize using environment variables / DVC properties #1416 parametrize pipeline \ step - not config file, just parameters.
- How to manage repetitive
dvc runcommands (like unpacking of many zip files)? #1119 repetitive commands. I see a similarity with parametrizable commands where only a single output is in use and without creating a separate directory for each experiment (./output.pinstead ofgs1/output.p).
A concept of reconfigurable-stage should be introduced in DVC.
Open questions:
- how to pass config file (do we need multiple config files)?
- how to pass params (should we override config by the params or it is a separate concept)?
- how to specify input files?
- how to specify the output directory (
gs1/)? - how to specify an output file without an output directory ( see
./output.pinstead ofgs1/output.pfrom the above)? - how to make it work for pipelines?
- Should build cache repro: use build cache for deterministic stages #1234 be part of the solution? It will allow caching intermediate results of reusable stages (if step1 is the same in a few pipeline instances).
UPDATE: #1214 might be also related to this issue.
UPDATE2: Add a quote from vern and open question 7.
efiop, olveirap, tivvit, dberenbaum, alexvoronov and 22 more
Metadata
Metadata
Assignees
Labels
feature requestRequesting a new featureRequesting a new feature