Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suggestions for vocabularies to do: waterBody, island and island group and sampleSizeUnit and organismQuantityType #145

Open
ManonGros opened this issue Jul 3, 2024 · 8 comments
Labels
content Label for issue concerning vocabulary content difficult Not going to be an easy one

Comments

@ManonGros
Copy link
Collaborator

Thanks Cecilie for your presentation!

I think that it would be great to have a vocabulary for waterBody (to enable discovery and use of freshwater and marine data).
Similarly, if there could be vocabularies for island and island group, the community might appreciate.

One (difficult) vocabulary that would be great to have would be units! Especially sampleSizeUnit and organismQuantityType. It would facilitate interoperability of the data provided. I don't know how doable this is but that would help a lot in analysis.

@ManonGros ManonGros added enhancement New feature or request difficult Not going to be an easy one and removed enhancement New feature or request labels Jul 3, 2024
@ymgan
Copy link

ymgan commented Jul 4, 2024

Hey, I don't know if there is any plan in the future to have vocabulary also for the measurementUnit of emof extension. OBIS recommends using emof for samplingSizeUnit and BODC vocab P06 collection is the one we use for measurement unit and IDs. If GBIF decides to have vocab for units later on, perhaps the P06 collection can be helpful!

Cheers

@CecSve CecSve modified the milestone: bb Jul 4, 2024
@ManonGros ManonGros added the content Label for issue concerning vocabulary content label Oct 10, 2024
@ManonGros
Copy link
Collaborator Author

Suggestion of Vocabulary for waterBody from the draft of Freshwater Guide: "Recommended best practice is to use a controlled vocabulary such as the Getty Thesaurus of Geographic Names."
See also: gbif/doc-freshwater-data-publishing-guide#17

@CecSve
Copy link
Collaborator

CecSve commented Jan 16, 2025

waterBody relevant vocabulary: http://vocab.nerc.ac.uk/collection/C19/current/

@rubenpp7
Copy link

rubenpp7 commented Jan 16, 2025

Marine regions offers controlled terms for islands, water masses (waterBody?), archipelagos (island groups) and many more oceanic and coastal objects.

They also have some freshwater objects such as rivers and lakes although I'm not sure of how well maintained the freshwater part is since they are dedicated to the Marine realm mainly.


Regarding OrganismQuantityType, OBIS recommends using terms from the P01 collection of BODC (e.g. https://github.com/EMODnet/EMODnetBiocheck/blob/576fa36aa600ae0ca4e4c4853fe110d7a295bb75/files/workingEnvironment.R#L37-L57).

P01 terms are quite specific but if something broader is preferred the BODC semantic model links those P01 terms to terms in S06 for parameters.

P01 is also a huge collection but one can filter only the biotic parameters by using the related SeaDataNet biological format biotic parameters term

@sformel-usgs
Copy link

For waterBody, island and island group consider also mapping in the GCMD keywords. These are used by NOAA and NASA, who both are OBIS and GBIF contributors.

Term browser: https://gcmd.earthdata.nasa.gov/KeywordViewer/

  1. waterBody could start at Oceans and drill down into the terms below it (both json and rdf are available): https://gcmd.earthdata.nasa.gov/kms/concept/ff03e9fc-9882-4a5e-ad0b-830d8f1186cb?format=json

  2. Terms relevant to island and island group are confusing in GCMD. Some are listed under Oceans, like Canary Islands, and others under Continents, like Prince Edward Island. We could perhaps work with the GCMD team to make sure we identify them all.

@ben-norton
Copy link

Two notes.

  1. The GCDM is a great thesaurus, but it has a few problems. You describe one of them. The hierarchy:
    Continent > North America > Canada > Prince Edward Island doesn't make a lot of sense. Central America and the United States of America are siblings of Canada, which doesn't make a lot of sense either. It's just not consistent. Placing Chukchi Sea under Arctic Ocean makes thematic sense (both are bodies of water, one encapsulating the other), but grouping the Canary Islands and the Gulf of Mexico means the only shared characteristic is geographic location, which isn't very meaningful.
  2. Water bodies are tough. EnvThes puts ocean under marine ecosystem with a good definition: https://vocabs.lter-europe.net/envthes/en/page/21811 using a different approach than the GCDM. AgroVoc places oceans as a type of basin (https://agrovoc.fao.org/browse/agrovoc/en/page/c_f6b144da), which I think is rather clever. The GEMET has a good definition, but has a structure where things are grouped into "superclasses". Here, river, lake, and ocean belong to the Hydrosphere group. All of these are much more meaningful than the GCDM.

@sformel-usgs
Copy link

@ben-norton I completely agree that GCMD shouldn't be the primary vocabulary. I just wanted to point out that it would be valuable to map those terms as part of this controlled vocabulary because we know that some contributors will be using GCMD terms.

@CecSve
Copy link
Collaborator

CecSve commented Feb 3, 2025

Thanks @sformel-usgs and @ben-norton for providing links to sources that might be valuable to reconcile.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
content Label for issue concerning vocabulary content difficult Not going to be an easy one
Projects
None yet
Development

No branches or pull requests

6 participants