Skip to content

Commit

Permalink
Merge pull request #25 from wfondrie/quant
Browse files Browse the repository at this point in the history
Add support for downstream quantitation tools
  • Loading branch information
Will Fondrie authored Mar 19, 2021
2 parents d56f69a + 253ba36 commit 1eef5f3
Show file tree
Hide file tree
Showing 27 changed files with 1,860 additions and 705 deletions.
26 changes: 26 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,31 @@
# Changelog for mokapot

## [unreleased]
### Added
- Support for downstream peptide and protein quantitation with
[FlashLFQ](https://github.com/smith-chem-wisc/FlashLFQ). This is accomplished
through the `mokapot.to_flashlfq()` function or the `to_flashlfq()` method of
`LinearConfidence` objects. Note that to support the FlashLFQ format, you'll
need to specify additional columns in `read_pin()` or use a PepXML input file
(`read_pepxml()`).
- Added a top-level function for exporting confident PSMs, peptides, and
proteins from one or more `LinearConfidence` objects as a tab-delimited file:
`mokapot.to_txt()`.
- Added a top-level function for reading FASTA files for protein-level
confidence estimates: `mokapot.read_fasta()`.
- Tests accompanying the support for the features above.
- Added a "mokapot cookbook" to the documentation with helpful code snippets.

### Changed
- Corresponding with support for new formats, the `mokapot.read_pin()` function
and the `LinearPsmDataset` constructor now have many new optional parameters.
These specify the columns containing the metadata needed to write the added
formats.
- Starting mokapot should be slightly faster for Python >= 3.8. We were able to
eliminate the runtime call to setuptools, because of the recent addition of
`importlib.metadata` to the standard library, saving a few hundred
milliseconds.

## [0.6.2] - 2021-03-12
### Added
- Now checks to verify there are no debugging print statements in the code
Expand Down
4 changes: 3 additions & 1 deletion docs/source/api/functions.rst
Original file line number Diff line number Diff line change
Expand Up @@ -7,8 +7,10 @@ Primary Functions
-----------------
.. autofunction:: read_pin
.. autofunction:: read_pepxml
.. autofunction:: read_fasta
.. autofunction:: brew

.. autofunction:: to_txt
.. autofunction:: to_flashlfq

Utility Functions
-----------------
Expand Down
21 changes: 21 additions & 0 deletions docs/source/api/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,7 @@ Finally, custom machine learning models can be created using the
model.rst
dataset.rst
confidence.rst
proteins.rst

Functions
---------
Expand All @@ -44,7 +45,10 @@ Primary Functions

read_pin
read_pepxml
read_fasta
brew
to_txt
to_flashlfq

Utility Functions
*****************
Expand Down Expand Up @@ -100,3 +104,20 @@ optionally, proteins.

LinearConfidence
.. CrossLinkedConfidence
Protein Sequences
-----------------

To calculate protein-level confidence estimates, mokapot needs the original
protein sequences and digestion parameters used for the database search. These
are created using the :py:func:`mokapot.read_fasta()` function, which return a
:py:class:`Proteins` object. :py:class:`Proteins` objects store the mapping of
peptides to the proteins that may have generated them and the mapping of
target protein sequences to their corresponding decoys.

.. currentmodule:: mokapot.proteins
.. autosummary::
:nosignatures:

Proteins
3 changes: 3 additions & 0 deletions docs/source/api/proteins.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Protein Sequences
=================
.. automodule:: mokapot.proteins
Loading

0 comments on commit 1eef5f3

Please sign in to comment.