Releases: J535D165/datahugger
v0.8
What's Changed
- Improve resolve speed and prevent hitting re3data.org servers by @J535D165 in #52
- Add extensible support for handle systems and metadata by @J535D165 in #56
- Auto unzip option #53 by @davetromp in #55
- Fix datahugger errors for CrossRef DOIs by @J535D165 in #58
New Contributors
- @davetromp made their first contribution in #55
Full Changelog: v0.7...v0.8
Coverage report
The following benchmark was applied to 500 randomly selected records from Datacite.
Percentages
Percentage of datasets supported: 22.4%
Percentage of datasets not supported: 60.6%
Percentage of datasets with error: 17.0%
Table with unexpected errors
v0.7
What's Changed
- Enhance errors for repositories that throw 403 errors by @J535D165 in #50
- Add support for DOIs pointing to single files by @J535D165 in #51
Full Changelog: v0.6...v0.7
Coverage report
The following benchmark was applied to 500 randomly selected records from Datacite.
Percentages
Percentage of datasets supported: 18.6%
Percentage of datasets not supported: 75.0%
Percentage of datasets with error: 6.4%
Table with unexpected errors
id | type | url | service | error | |
---|---|---|---|---|---|
9 | 10.48448/kgfs-s492 | dois | https://underline.io/lecture/50210-findings-thai-nested-named-entity-recognition-corpus | nan | 500 Server Error: Internal Server Error for url: https://underline.io/lecture/50210-findings-thai-nested-named-entity-recognition-corpus |
64 | 10.7910/dvn/ghcv1g/bbucjs | dois | https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/GHCV1G/BBUCJS | nan | Failed to parse URL 'https://dataverse.harvard.edu/loginpage.xhtml;jsessionid=de4d68eca12a3479d7a636cd6d83?redirectPage=%2Ffile.xhtml%3FpersistentId%3Ddoi%3A10.7910%2FDVN%2FGHCV1G%2FBBUCJS' |
73 | 10.20345/digitue.1029.61 | dois | http://idb.ub.uni-tuebingen.de/opendigi/litrdsch_1902#p=141 | nan | 500 Server Error: Internal Server Error for url: https://idb.ub.uni-tuebingen.de/opendigi/litrdsch_1902#p=141 |
81 | 10.7916/d8-qcx3-yp94 | dois | https://dlc.library.columbia.edu/resolve/10.7916/d8-qcx3-yp94 | nan | 500 Server Error: Internal Server Error for url: https://dlc.library.columbia.edu/catalog/10.7916/d8-qcx3-yp94 |
96 | 10.17876/plate/dr.2/plates/201_33742 | dois | https://www.plate-archive.org/objects/dr.2/plates/201_33742 | nan | 500 Server Error: Internal Server Error for url: https://www.plate-archive.org/objects/dr.2/plates/201_33742/ |
128 | 10.25560/78890 | dois | http://spiral.imperial.ac.uk/handle/10044/1/78890 | nan | 404 Client Error: for url: https://spiral.imperial.ac.uk/rest/handle/10044/1 |
146 | 10.15496/publikation-32226 | dois | https://publikationen.uni-tuebingen.de/xmlui/handle/10900/90845 | nan | 403 Client Error: Forbidden for url: https://publikationen.uni-tuebingen.de/rest/handle/10900/90845 |
163 | 10.34755/irok.2022.72.26.033 | dois | https://www.elibrary.ru/item.asp?id=48800309&pff=1 | nan | ('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer')) |
170 | 10.25673/opendata2-168870 | dois | https://opendata2.uni-halle.de//handle/1516514412012/168894 | nan | 404 Client Error: for url: https://opendata2.uni-halle.de/rest/handle/1516514412012/168894 |
200 | 10.23725/akhp-6959 | dois | https://ors.datacite.org/doi:/10.23725/akhp-6959 | nan | HTTPSConnectionPool(host='ors.datacite.org', port=443): Max retries exceeded with url: /doi:/10.23725/akhp-6959 (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f1549b33710>: Failed to resolve 'ors.datacite.org' ([Errno -2] Name or service not known)")) |
202 | 10.48370/ofd/clo2tl/a0ndzw | dois | https://dataverse.openforestdata.pl/file.xhtml?persistentId=doi:10.48370/OFD/CLO2TL/A0NDZW | nan | list index out of range |
252 | 10.14469/ch/129258 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/134211 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/134211 (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_HANDSHAKE_FAILURE] sslv3 alert handshake failure (_ssl.c:1006)'))) |
257 | 10.25673/opendata2-91140 | dois | https://opendata2.uni-halle.de//handle/1516514412012/91154 | nan | 404 Client Error: for url: https://opendata2.uni-halle.de/rest/handle/1516514412012/91154 |
258 | 10.14469/ch/41814 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/48213 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/48213 (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_HANDSHAKE_FAILURE] sslv3 alert handshake failure (_ssl.c:1006)'))) |
296 | 10.14457/cmu.the.2009.132 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14457/CMU.the.2009.132 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Read timed out. (read timeout=3) |
297 | 10.6085/aa/lop001_026mtbd004r00_20100911.40.1 | dois | https://data.piscoweb.org/catalog/d1/mn/v1/object/doi:10.6085/AA/LOP001_026MTBD004R00_20100911.40.1 | nan | Failed to parse URL 'https://data.piscoweb.org/metacat/d1/mn/v1/object/doi:10.6085/AA/LOP001_026MTBD004R00_20100911.40.1' ... |
v0.6
What's Changed
- Disable Pangaea (implementation will follow) by @J535D165 in #48
- Add coverage reports to releases by @J535D165 in #44
- Fix version of datahugger by @J535D165 in #49
Full Changelog: v0.5...v0.6
Coverage report
The following benchmark was applied to 500 randomly selected records from Datacite.
Percentages
Percentage of datasets supported: 17.2%
Percentage of datasets not supported: 71.2%
Percentage of datasets with error: 11.6%
Table with unexpected errors
v0.6a7
v0.6a6
v0.6a5
v0.6a4
Full Changelog: v0.6a3...v0.6a4
v0.6a3
Full Changelog: v0.6a2...v0.6a3