-
-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: dq_category_specific_ingredient_percent_2 #9606
Conversation
|
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## main #9606 +/- ##
==========================================
+ Coverage 49.28% 49.31% +0.02%
==========================================
Files 66 66
Lines 20546 20560 +14
Branches 4946 4951 +5
==========================================
+ Hits 10126 10139 +13
Misses 9132 9132
- Partials 1288 1289 +1 ☔ View full report in Codecov by Sentry. |
My only suggestion would be seeing if there's an existing way to say "yes, this fails a data quality check for category, but no; it is not because the data is wrong". This would cater for scenarios like https://www.npr.org/2020/10/01/919189045/for-subway-a-ruling-not-so-sweet-irish-court-says-its-bread-isnt-bread or https://www.delish.com/food-news/a49216/things-you-didnt-know-about-pringles/ (Potato Chips made from dried potato vs potato crisp vs potato chip)
Otherwise, this somewhat implies that categories can only be derived from regulations |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me
Yes, this is a good point. We really would like to have that. It is not only for this PR, but more generic. To be able to shutdown alerts that are false positives. |
What
Add quality facets for jams having too small fruit quantity:
errors:
info:
Following comment from @CharlesNepote and @aleene, I tried (maybe not perfectly done) to write the thresholds values directly in the taxonomy.
This is only for specific ingredients, only for lower than the value provided. If it works well, in future PR, it could be generalized for ingredients and nutriments and for maximal values (for labels it can be minimum (sugar, for example) or maximum (fibers, for example)). Also, to do eventually in the future, detect if jam/jelly should be extra-jam/extra-jelly.
Screenshot
Related issue(s) and discussion
Part of #1414