Skip to content

Conversation

@felixschmitz
Copy link
Collaborator

The main function inside dataset_merging/helper.py is the user-facing interface for creating the dataset the user is interested in. Hence, the function should be robust, yet lean.

The function depends on user-specified input (variables and timeframe of interest) and output from previous tasks (mainly the "merged metadata mapping" that maps variable names to data file names).
The task needs to depend on this mapping for the completeness of the DAG, yet the user should not worry about specifying it for creating a dataset.

@felixschmitz felixschmitz requested a review from hmgaudecker July 23, 2025 15:45
@felixschmitz felixschmitz self-assigned this Jul 23, 2025
@felixschmitz
Copy link
Collaborator Author

This is just a first suggestion. Looking for feedback on the function create_dataset inside dataset_merging/helper.py.
Tests and commit history need to be fixed.

Copy link
Collaborator

@hmgaudecker hmgaudecker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Had a look, but lacking the context... 🤷 No rush.

@felixschmitz felixschmitz linked an issue Jul 31, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

ENH: Simplify create_dataset_from_variables

3 participants