Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Spellcheck benchmark dataset and evaluation algorithm #340

Merged
merged 79 commits into from
Jun 18, 2024

Conversation

jeremyarancio
Copy link
Collaborator

@jeremyarancio jeremyarancio commented Apr 11, 2024

What

Creation of the benchmark and evaluation algorithm to evaluate the spellcheck

Benchmark

The benchmark is composed of 247 lists of ingredients from 3 data sources:

  • 30% of the old dataset composed of manually corrected lists of ingredients in French from the previous work by Lucain W. Unmodified lists of ingredients are removed.
  • 15 manually corrected lists of ingredients in different languages (used to prompt engineer OpenAI on the Spellcheck task)
  • 100 lists of ingredients with the tag 50-percent-unknown corrected with GPT-3.5. It follows the correction guidelines.

Argilla to validate benchmark

Lists of ingredients corrected with GPT-3.5 are checked and modified to respect the spellcheck guidelines.

Evaluation algorithm

An evaluation algorithm is created to estimate the performance of the Spellcheck.
It calculates the Precision-Recall of the correction based on text sequences (Original-Reference-Prediction) by using tokenization and alignment algorithm.

spellcheck/README.md Outdated Show resolved Hide resolved
spellcheck/README.md Outdated Show resolved Hide resolved
spellcheck/README.md Outdated Show resolved Hide resolved
spellcheck/README.md Outdated Show resolved Hide resolved
jeremyarancio and others added 26 commits May 13, 2024 17:13
- Evaluation is performed on the Sagemaker instance  - Evaluation data are sent to S3  - Metaflow human evaluation step get and push to Argilla
@raphael0202 raphael0202 self-requested a review June 18, 2024 10:20
@raphael0202 raphael0202 merged commit 3c5cddf into develop Jun 18, 2024
@raphael0202 raphael0202 deleted the spellcheck_validation_data branch June 18, 2024 10:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants