-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement consistent naming and language for talking about scoring rules #476
Comments
Decisions:
|
Should this be |
Phew... Probably |
I'm going to close this in favour of #610 to limit the number of open issues that talk about the same thing... |
Update
We took the following decisions on what needs to be implemented
rules_point
,rules_binary
etc.score()
isrules
pairwise_comparison()
isscore
Things that need to be done:
apply_metrics
toapply_rules
Discussion:
We need a consistent naming scheme for
score()
The two contenders were
Recently discussed this with @sbfnk.
Proposal 1
"metric" would denote
score()
We would use sentences such as "you can pass a metric to
score()
, or "you can call a metric directly".The proposal doesn't really have a good term to talk about the output of a "metric" (i.e., a scoring function). Calling the output of a "metric" also "metric" would maybe be a bit confusing.
The argument within
score()
would be calledmetrics
The argument within
add_pairwise_comparison()
that denotes what the pairwise comparison should be computed for would be calledmetric
.Default metrics could be the output of a function called
default_metrics("point")
ormetrics_point()
Proposal 2
"scoring rule" would denote
score()
"a score"
We would use sentences such as "you can pass a list of scoring rules to
score()
, or "you can call a scoring rule directly". And "scores are computed by scoring rules".The argument within
score()
would be calledscoring_rules
The argument within
add_pairwise_comparison()
that denotes what the pairwise comparison should be computed for would be calledscoring_rule
score
?for
? ("What score should pairwise comparisons be computed for?")Default scoring rules could be the output of a function called
default_scores("point")
ordefault_rules("point")
orrules_point()
Alternative proposals
The text was updated successfully, but these errors were encountered: