-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Higher Geog fix work (formerly Many islands lack country) #7660
Comments
The Aleutian Islands are a total mess with this issue. Country=NULL includes many 1000s of specimens georeferenced over water, even very near land like this one, https://arctos.database.museum/guid/UAM:Mamm:113877. Some specimens are collected in international waters and country=NULL is valid but that is a tiny minority of specimens being classified by Arctos as country=NULL. |
I agree we have to fix probably best by island and group by group. I think I'll start with a little GIS work to identify EEZ for country vs international waters, and of course consult with collections. I think the new loc_attribute for waterbody will still allow association with specific bodies of seas and oceans but many collections will want the primary hg to be to country. On the Geog committee project now. |
country + state are 100% expected for all the records I manage. I have many searches that are limited by asserted state = Alaska and would be most unhappy if some were missed because of this. |
If "this" involves geography and you think something should be different, #7666. |
We're starting work on this at MVZ, specifically the locs with "North Pacific Ocean, Bering Sea" as HG. Aleutians are firmly part of US:AK so starting there by changing HG to US:AK. We're still fixing and checking out how best to do in bulk, so keep you posted Dusty if we need help. This is tied to the new loc attribute of "waterbody" ArctosDB/code-table-work#83 |
USGS has boundary shapefiles, https://www.sciencebase.gov/catalog/item/59d5b565e4b05fe04cc53a91, that look fully inclusive of all islands and surrounding waters in AK at least. Can all localities falling within the Alaska boundary be given Country=USA and State=AK ? |
Thanks Aren-- looks like I have your ok to do this for locs in AK waters, so I'm going to go ahead. The shapefile will be helpful! so far most of these are only MVZ and UAM records only and agreed, this is just a clean-up task. |
@dustymc I'm starting a spreadsheet for a bulk update where we can change the HG and the spec_loc. What are the minimum fields needed for that?
Do you need/want localityID? anything else? less or more? Our current working spreadsheet has a lot more since we are verifying with verbatim and locality attributes before any updates. For most we are returning to the verbatim locality but cleaning as needed. |
Assume those will be gone/merged before you're done typing (because they probably will be).
Ditto.
Maybe better to do this in smaller batches? Spreadsheets like this seem to always find a way to clash with themselves, but I'm up for whatever. I think just HIGHER_GEOG is sufficient, but see above, I'm always surprised.... Also first line in that spreadsheet Bristol Bay, no specific locality that'll break any geolocate-like-thing, and feeding those are (most of) why specloc exists. Also no specific locality if we're cleaning anyway..... |
ok that's perfect. |
@dustymc For Monday: first CSV for HG batch updating (109 rows to load). Let us know if anything needs tweaking format-wise. Kat applied some python to do overhaul but I am still checking every one. I see some dup localities but am ignoring for now and will fix in another pass (probably with the usual Arctos tools). thx! |
@mkoo updates from CSV in #7660 (comment) complete. |
OK here's batch #2 and #3 @dustymc this is the rest for Bering Sea. We tried to keep to origianl spec_loc as much as possible and make the localities consistent so we can merge dups more easily later if desired. Several localities with complicated info were manually edited to make sure all the components were captured (orig forms in attributes and verbatim were checked and left as is for tracking) ThX! |
HG cleanup- batch2.csvI was not able to find a locality ID for these, I removed them from the update: These failed with UPDATE 1834 successfully updated HG cleanup batch3.csvI was not able to find a locality ID for these, I removed them from the update: These failed with UPDATE 1194 successfully updated |
Thanks Dusty, I will go over the rest manually-- a lot are just the same 'no specific locality' business so probably best to review all the geog details anyway. |
I'm closing this issue since the original batch work is done and whatever failed was manually fixed. |
Issue Documentation is http://handbook.arctosdb.org/how_to/How-to-Use-Issues-in-Arctos.html
Describe the bug
If I query arctos for country null, in MVZ I get 8763 records in our voucher collection. There are a lot that are unused catalog numbers, truly pelagic records. But, if you map them, you see that thousands are plotted, many of them near shore islands to Mexico and Alaska.
I recall this was a github issue at some point, but I have not found that issue. What is the status of fixing this? If someone queried for country = Mexico, they will not get these island records.
To Reproduce
Steps to reproduce the behavior:
query for country = NULL
Expected behavior
Islands that are in a country should be findable by that country.
Screenshots
attached
Priority
Kinda high since users are potentially not seeing all of our records.
The text was updated successfully, but these errors were encountered: