Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconciliation API returning indexes as scores? #8

Open
ianengelbrecht opened this issue Oct 21, 2024 · 3 comments
Open

Reconciliation API returning indexes as scores? #8

ianengelbrecht opened this issue Oct 21, 2024 · 3 comments

Comments

@ianengelbrecht
Copy link

Sorry to keeping hammering on here...

Using the recon service from OpenRefine, if only one candidate is returned we get a recon score for that candidate which looks like a percentage, but if more than one candidate is returned the score looks like it's the index of the candidate in the result (1, 2, 3, etc).

@rogerhyam
Copy link
Owner

Yes these scores are somewhat arbitrary. The standard doesn't define a measure and when you are doing complex things like name matching it is tough to come up with something meaningful. What counts more or less highly towards the score?

https://www.w3.org/community/reports/reconciliation/CG-FINAL-specs-0.2-20230410/#a-note-on-candidate-retrieval-and-scoring

I just give the reverse ranking number if there are multiple candidate and 100 (top score!) if it is the only one.

I'm open to suggestions of what the scores should be based on. The main use case is simply to present the list to the user so anything other than alphabetical will probably get kickback!

@ianengelbrecht
Copy link
Author

Could I suggest Levenshtein distance reported as a percentage? That would give some sense at least of how close the match is.

image

@ianengelbrecht
Copy link
Author

Okay so I've been using this form of Levenshtein to compare results from WFO with the names I have and it's relatively useful - filtering out matches below a certain threshold. It doesn't work for autonyms as autonyms don't have authors in WFO, but hopefully we'll have those at some point in future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants