Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix 178: Add data type checking and validation while performing agent plan action #428

Merged
merged 12 commits into from
Jul 7, 2023

Conversation

Sardhendu
Copy link
Contributor

@Sardhendu Sardhendu commented Jul 3, 2023

Add data validation and test cases

TODO PR Review:

  1. [DONE] Test plan with all available dataset
  2. [Not Required] Remove redundancy with data loading with code reusability.
  3. [Confirmed] confirm if the validation is at par with the different tasks.
  • Validates that all the label columns are Strict String: Pydatic BaseModel validation.
  • Validates Data format for all required data rows for seed.csv. Separate validation for each Task.
  • Adds validation that the column names in seed.csv matches the column names parsed from example_template.
  • Adds validation for parity between config/prompt/label and sed.csv label columns values.

src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/read_datasets.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/__init__.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/__init__.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
src/autolabel/data_loaders/validation.py Outdated Show resolved Hide resolved
@Sardhendu Sardhendu requested a review from nihit July 7, 2023 21:13
Copy link
Contributor

@nihit nihit left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

from autolabel.configs import AutolabelConfig

logger = logging.getLogger(__name__)
from typing import Union
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

redundant import, this is already imported on line 1

@Sardhendu Sardhendu changed the title Fix 178 Fix 178: Add data type checking and validation during plan Jul 7, 2023
@Sardhendu Sardhendu changed the title Fix 178: Add data type checking and validation during plan Fix 178: Add data type checking and validation while performing agent plan action Jul 7, 2023
@nihit nihit merged commit 6ed14a5 into refuel-ai:main Jul 7, 2023
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants