Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Creating linkages between concept content and related concepts #88

Open
ronaldtse opened this issue Oct 4, 2019 · 2 comments
Open

Creating linkages between concept content and related concepts #88

ronaldtse opened this issue Oct 4, 2019 · 2 comments
Labels
enhancement New feature or request

Comments

@ronaldtse
Copy link
Member

From @dr-shorthair :

https://www.geolexica.org/concepts/12/
For example, each of Note 1 and Note 2 refer to other concepts from the lexicon. These should be hyperlinks. Missed opportunity …

This is something that we hope to improve on but due to the TC 211 MLGT input being an Excel file, it can be cumbersome to create direct linkages through inference of text.

For example, suppose we have the concepts “coordinate” and “coordinate reference system”. During automatic parsing of the content, presented with “coordinate reference system” we cannot be sure whether the usage of “coordinate” is of the former or the latter.

We will need to evolve out of the Excel file to do something like this. This sort of problem also applies to handling math; Excel can’t cut it.

Any suggestions?

@ronaldtse ronaldtse added the enhancement New feature or request label Oct 4, 2019
@dr-shorthair
Copy link

I definitely agree that Excel is not capable of serving as the point-of-truth for all this.
My preference would be to move to a semantic platform - start with SKOS which allows for skos:related and sub-properties. You'll probably find you need to define further sub-properties in due course.

But the initial transformation is likely to be painful. I had an initial go at it with Andrew Jones about 3 years ago, but didn't have funding to pursue it properly. There are some Excel-->RDF and CSV-->RDF pipelines available to get things started, but I'm sure there would be a big manual cleanup involved as well. Might be a good student project somewhere?

@ronaldtse
Copy link
Member Author

@dr-shorthair Geolexica already transforms all the Excel data into a "term YAML" format; it's just not displayed or served under Geolexica because of the TMG's fear that someone will import that file.

It is now super easy to generate SKOS from the term YAML file (that is, as long as the TMG agrees that it's okay for people to bulk download the data).

In fact, Reese and I already cleaned up as much as we could of the MLGT/terminology repository (the source data) during the first import. Machine-readability is already a solved problem, what's remaining here is policy... 😉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants