-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow variably ranked acceptedness of identifications (was: order identifications on catalog record) #3540
Comments
Thanks for creating this issue @Jegelewicz. Here's what I have listed in my 'manage collection' section for taxonomy ordering also - seems like there should be some connection between the two? I wonder if it's possible to have these with a "drag row here" box like with agents in the catalog record? This is helpful for ordering the sequence of ownership for objects. I'd like to do the same with the taxonomy related to the biological materials our items are made from, based on the abundance of the material on the piece. For this fish skin parka, I'd want "Parka" first, then "salmon" "caribou" and lastly "alder". |
"Impossible" isn't the right word, but it might be close enough - that would be incredibly complicated, and I don't think there's any way, no matter what resources we had, to make it predictable, at least not without completely breaking our taxonomy model.
That's not much problem, but it's significant development. It would support things like "based on the abundance of the material on the piece." It would also provide a nonmagical path to "cultural first" (even when the cultural item uses a name that was created for biology and is managed in some mineral-centric classification) - users could just put it there. It might be a workload increase - that would need discussed-->planned-->understood before proceeding. It also provides an elegant solution to the "more than one accepted" thing that comes up every now and then. For things that expect binary, order=1 could be treated as accepted and everything else as unaccepted. For those willing to embrace the gradient, you could have
I think I like it, but I'm not sure I'd want to use it if I just had a dead rat (complicated model, simple data), nor if I had https://arctos.database.museum/guid/MSB:Mamm:55245 (lots of about-equally-important IDs). That's possibly "just UI" but I'm not sure I see how just yet. |
Cool. I guess let's put it on the "wish list" and think/talk about how it might be developed and/or used by various collections. |
"More than one accepted ID" is also needed for environmental samples, and
ordering them by abundance or GenBank ID confidence, for example, would be
useful.
…On Thu, Mar 25, 2021 at 10:09 AM Angela Linn ***@***.***> wrote:
* [EXTERNAL]*
I think I like it,
Cool. I guess let's put it on the "wish list" and think/talk about how it
might be developed and/or used by various collections.
The "more than one accepted" thing is an interesting one for cultural
collections when we have the classic "unclassifiable object" and the
associated "unidentified object" where we could add some options that
people have guessed about but have not been confirmed.
[image: Screen Shot 2021-03-25 at 8 06 26 AM]
<https://user-images.githubusercontent.com/17605945/112504810-0323ca80-8d41-11eb-9576-b4f5aa8522f6.png>
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#3540 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBFQW6VLMO5P5KTDPADTFNNZ5ANCNFSM4ZZLSRKA>
.
|
You can use A {string} with identification="some goo" and associated taxa={list of stuff that PCRed out}, but....
That does need something more than the A {string} can provide - I think it's about the same as "have guessed about but have not been confirmed" - the full ID (confidence, sensu, determiners, etc.) for each "entity" would definitely be more informative. I suppose one option for things like https://arctos.database.museum/guid/MSB:Mamm:55245 would just be to make the ordering a non-unique integer, so you could...
Still plenty of complexity to sort out (I'm working on the citation bulkloader, which creates identifications, at the moment - it would be affected by this) but this is starting to feel realistic. |
This could also be useful in paleo where you have a slab of rock with, say, 200 fossils from 10 different taxa, where it would be very complicated to individually catalog things. |
I'd like to request to have this functional in next couple of weeks to test
for possible environmental sample for SPNHC talk.
…On Tue, Apr 13, 2021 at 12:17 PM Nicole-Ridgwell-NMMNHS < ***@***.***> wrote:
* [EXTERNAL]*
This could also be useful in paleo where you have a slab of rock with,
say, 200 fossils from 10 different taxa, where it would be very complicated
to individually catalog things.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#3540 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBDOBY6YJJKLQGIGRVDTISDBXANCNFSM4ZZLSRKA>
.
|
I'm not saying I wouldn't use it that way, but that's not cataloging the item of scientific interest either; those are really distinct THINGS (that just happen to be difficult to separate). Cataloging them is pretty trivial - similar things happen in insect collections on a regular basis - but associating the correct catalog record with a tiny speck on the rock might be another story. I don't have any better ideas (maybe the paleo pollen folks or similar do??), but the implications (eg, poor links between THAT teensy fossil and the literature) should be understood before using this approach.
I'm not sure that's at all realistic. Most obviously, there are a ton of forms (adding IDs, bulkloading citations, etc.) where inserting an "accepted" record makes everything else "unaccepted." If that's no longer binary then all of those forms have to be redesigned. (So would everything that uses the binary to find the "best" ID, but that should mostly be behind the scenes.) I have no idea at all what that might look like; I need input from The Community. The simplest use case is probably adding an identification to the "detail page" - what should happen there? Maybe the answer to that can help guide more complex and less interactive situations. I suspect the most likely problem (if we go with #3540 (comment)) will be the intersection of multiple equally-highest-ranked IDs and flattening - given 12 "most-preferred" IDs, what gets stashed in FLAT or sent to GBIF? And all that said, I'm not sure I understood the initial request. After this afternoon's webinar, I think maybe we just need to order the taxa used in A {string} IDs? Are we talking about these - three IDs are in the screenshot... ...or these 5 taxa used in a single identification? |
Unfortunately, I think we are talking about both....but this issue was started based upon @AJLinn 's request in the webinar so it should focus on
|
Angie wants the cultural name(s) to appear first as they are more important to her. |
Yes, both would be highly useful but solve Angie's first.
…On Tue, Apr 13, 2021, 5:03 PM Teresa Mayfield-Meyer < ***@***.***> wrote:
* [EXTERNAL]*
Angie wants the cultural name(s) to appear first as they are more
important to her.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#3540 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBDHWIZ5MZ3QHOI2C2DTITEVJANCNFSM4ZZLSRKA>
.
|
We could:
"A and B and C and ...." have one place for metadata, and would require new formulae for every number of involved taxa. I introduced the 'many taxa on A {string} IDs' thing because it was easy (just drop a unique composite key), we didn't have cultural taxonomy at the time, and we were/are limited to one accepted. It gets the important bits across, but it has pretty severe limitations - eg does the ID metadata apply to all of the taxa, or some average of everything involved, or ???? Is the parka expert who identified the thing really equally comfortable with phocids and plants and minerals and whatever else is included? There's no structured way to say, even if we do somehow figure out a way to add ordering without conflicting with the "core" mechanism for ordering (taxa formula). Multiple IDs has none of those restrictions. The parka's IDs would include
That should be interpreted as "mostly parka, then seal and alder, then copper." It contains the same taxa information as the A {string} ID (assuming the seal isn't a hybrid &etc., which isn't possible in the A {string} multitaxa approach), but
Potentially not-so-great (or really great, depending on your viewpoint!) features include:
We'd first need to get past the hurdles outlined above - eg, what happens when you {do thing that currently depends on the binary nature of acceptedness}, how exactly does this get displayed in tabular format, etc.? @AJLinn does that approach work for you? |
Create two "accepted" identifications - one cultural, one biological and we need to figure out how to get the cultural ID at the top. We could use this as a test case - @AJLinn want to play? @Nicole-Ridgwell-NMMNHS maybe you have an example ot two that we can play around with? |
I don't know if I have anything right off the bat where ordering matters much, but for greater than 2 accepted IDs needed, here is a good example: https://arctos.database.museum/guid/NMMNH:Paleo:14152 needs four accepted IDs, Evazoum sirigui, Grallator cursorius, Tetrapoda, and Theropoda |
@dustymc can we work with Nicole's example above? |
Five: don't forget the rock! I think #3540 (comment) starting at "Multiple IDs.. " does that. (So does the first part of that comment, a new taxa formula, as long as you're OK with the identification being formulaic - "Evazoum sirigui and Grallator cursorius and Tetrapoda and Theropoda and rock." That would take a code table request and a fair bit of code to deal with more than 2 names in an ID.) "rank" (aka order) doesn't necessarily have to be unique, so 4 unsorted (NULL rank, same rank, IDK, needs ironed out if we proceed) accepted IDs (and/or 54 sorted unaccepted, or any crazy mix thereof) should work well enough. An Entity-ish alternative would be to call the current record (or a new one in the entity collection or whatever) "rock with lots of tracks" and then catalog the tracks individually and link them to the "parent." I think that's probably a bit more sciencey and a bit less representative of how things are actually cataloged and stored, and I don't think multiple accepted IDs can cause the things we created Entities to avoid, but that possibility deserves a very close look before we write any code. |
Sorry to miss all of this... guess I zoned out for a year...
So that means on this record instead of having this list with one set of metadata: I'd add multiple determinations, each with their own metadata. And the order in which I enter them is their "rank"? What happens if you need to correct one and add something new, like a correction... what happens to the order then? |
It happens!
Yep.
No, rank would be some explicit new thing.
You tell me, but I think whatever you want - I'm envisioning this as some sort of (probably nonunique) integer (which could be drag-n-drop or whatever in the UI) that you'd have complete control over. If you want some particular order then you can do that, if you don't care then it wouldn't add anything (other than the ability to have multiple accepted, which you could use or not as you wish). I think I've flip-flopped a few times (at least in my head) on keeping the accepted flag around or replacing it with the rank, right now I'm leaning towards keeping it which seems like it would be a more explicit/less confusing way to say "nope, wasn't this at all" when necessary, even if it is almost-sorta redundant with the rank. |
@AJLinn my idea was that you would have only two identifications - one with Parka and the other with all of the biological names using the A {string}, but then we would still need a way to prioritize them so that the cultural one showed up at the top. |
I don't think this would be reason to get rid of the A {string} with lots of taxa thing, so yes, both
and
would work, and the first could incrementally be turned into the second as things get used and identifications get made. |
We will probably need to consider how we publish identification to aggregators AND publish the Darwin Core Identification History extension. |
As usual, we can put a lot of work into some arbitrary thing that won't much be used and can't possibly be a decent representation of the data, or we can just give them what we have and let the aggregators worry about how to cram it into whatever weird little pigeonholes they want to construct. I recommend focusing all efforts on the latter. |
Already, if you use an "and" identification, GBIF takes whatever is listed first and makes that the identification. If we concatenate for GBIF, if collections want to try to control what GBIF takes as the ID, they could make that ID 1, and all the others 2? Identified by and Identified date might be more complicated? |
I support this. |
AWG approved for implementation 2023-01-05 Setting to next task - do we need a milestone of AWG approved? |
For @DerekSikes : make sure a single 1 (highest-rank, whatever values turn out to be) and zero or more 0 (lowest rank) continue to function as current, and that that's documented. |
EDIT: remapped to 0-10 - IDK why 100 'ranks' would ever be necessary and it makes ugly dropdowns. Tentatively mapped as
I don't think the above ("highest-rank") is necessary, the '1' (currently 'accepted') will continue to work as it does, as long as it's accompanied by only '0' (current and future ~'not preferred'). |
Issue Documentation is http://handbook.arctosdb.org/how_to/How-to-Use-Issues-in-Arctos.html
Is your feature request related to a problem? Please describe.
https://arctos.database.museum/guid/UAM:EH:0610-5898 has a number of IDs that mix cultural and biological taxonomic designations; I’d like to have the cultural ones go first, then biological
Describe what you're trying to accomplish
Organize various identifications on a catalog record
Describe the solution you'd like
Allow collections to create an order in manage collection (cultural, biological, mineral for example)
Describe alternatives you've considered
Allow manual ordering of each set of ids - might be nice, but also might be time intensive as opposed to a general rule
Additional context
Add any other context or screenshots about the feature request here.
Priority
Please assign a priority-label. Unprioritized issues gets sent into a black hole of despair.
The text was updated successfully, but these errors were encountered: