-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nextprot dataset and protein examples #423
base: master
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,43 @@ | ||
{ | ||
"@type": "Dataset", | ||
"name": "neXtProt entries", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Is 'entries' part of the dataset name? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It does not seem it is from what I saw on their website |
||
"description": "The collection of neXtProt entries for human proteins", | ||
"url": "https://www.nextprot.org", | ||
"keywords": "nextprot,Human,Proteins,Proteome,Proteomics,protein database,protein knowledgebase,protein resource,human protein,human proteome,function,medical,disease,expression,interactions,sequence,isoform,mutation,variant,phenotypes,proteomics,peptide,structure,3D,annotation,biocuration,chromosomes,protein validation,protein-coding genes,post-translational modifications,ptm,data integration,systems biology,genetic variations,UniProt", | ||
"distribution": [ | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This looks to be correct |
||
{ | ||
"@type": "DataDownload", | ||
"contentUrl": "ftp://ftp.nextprot.org/pub/current_release/xml/nextprot_all.xml.gz", | ||
"fileFormat": "XML" | ||
}, | ||
{ | ||
"@type": "DataDownload", | ||
"fileFormat": "RDF" | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Are not you missing the download URL here? |
||
}, | ||
{ | ||
"@type": "DataDownload", | ||
"contentUrl": "ftp://ftp.nextprot.org/pub/current_release/peff/nextprot_all.peff.gz", | ||
"fileFormat": "PEFF" | ||
}, | ||
{ | ||
"@type": "DataDownload", | ||
"contentUrl": "ftp://ftp.nextprot.org/pub/current_release/md5/nextprot_sequence_md5.txt", | ||
"fileFormat": "TXT" | ||
}, | ||
{ | ||
"@type": "DataDownload", | ||
"contentUrl": "https://api.nextprot.org/export/entries/all.fasta", | ||
"fileFormat": "FASTA" | ||
} | ||
], | ||
"potentialAction": { | ||
"@type": "SearchAction", | ||
"target": "https://www.nextprot.org/proteins/search?query={query}", | ||
"query-input": "required name=query" | ||
}, | ||
"license": { | ||
"@type": "CreativeWork", | ||
"name": "Creative Commons CC BY 4.0 Attribution", | ||
"url": "https://creativecommons.org/licenses/by/4.0/" | ||
} | ||
} | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. And it is minimum according to https://bioschemas.org/profiles/Dataset/0.3-RELEASE-2019_06_14/. |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
{ | ||
"@context": "http://schema.org", | ||
"@type": "DataRecord", | ||
"@id": "https://www.nextprot.org/entry/NX_P52701", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This |
||
"includedInDataset": "ftp://ftp.nextprot.org/pub/current_release/xml/nextprot_all.xml.gz", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The value of this should be the value of the |
||
"citation": { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Omit this property if there isn't a value for it. You may also want to add a |
||
"@id": "", | ||
"@type": "" | ||
}, | ||
"mainEntity": { | ||
"@id": "https://www.nextprot.org/entry/NX_P52701", | ||
"@type": "Protein", | ||
"http://purl.org/dc/terms/conformsTo": "https://bioschemas.org/specifications/Protein/0.9-DRAFT", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Please update to 0.11-RELEASE |
||
"identifier": "NX_P52701", | ||
"name": "DNA mismatch repair protein Msh6", | ||
"description": "", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Omit properties for which there is no data. However in this case you probably want to include the text in your overview section of the webpage |
||
"alternateName": ["G/T mismatch-binding protein", "GTBP", "GTMBP", "MutS protein homolog 6", "MutS-alpha 160 kDa subunit"], | ||
"url": "https://www.nextprot.org/entry/NX_P52701", | ||
"hasBioChemEntityPart": [ | ||
{ | ||
"isEncodedByBioChemEntity": { | ||
"@type": "Gene", | ||
"name": "MSH6", | ||
"identifier": "HGNC:7329", | ||
"hasRepresentation": "2p16.3" | ||
}, | ||
"taxonomicRange": { | ||
"@id": "https://identifiers.org/taxonomy:9606", | ||
"@type": "Taxon", | ||
"name": "Human" | ||
} | ||
Comment on lines
+21
to
+31
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do you intend that these are embedded within the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. According to the profile, these two properties can be used directly for a Protein. |
||
} | ||
] | ||
} | ||
} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should have an
@id
property to identify the datasetThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree, @id is important for Dataset