https://github.com/czbiohub-sf/protdata

Dataloader for common proteomics search engine outputs into AnnData format

https://github.com/czbiohub-sf/protdata

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (17.3%) to scientific vocabulary

Keywords

anndata proteomics proteomics-datasets python
Last synced: 5 months ago · JSON representation

Repository

Dataloader for common proteomics search engine outputs into AnnData format

Basic Info
Statistics
  • Stars: 5
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Topics
anndata proteomics proteomics-datasets python
Created 10 months ago · Last pushed 7 months ago
Metadata Files
Readme License

README.md

protdata

Test PyPI - Version docs online

image

Proteomics data loaders for the AnnData format.

This package provides loader functions to import proteomics data (e.g., MaxQuant) into the AnnData structure for downstream analysis and easy integration with single-cell and multi-omics workflows.

Features

  • Multiple formats: Support for MaxQuant, FragPipe, DIA-NN, and mzTab files
  • Reads metadata: Automatically extracts and organizes sample and protein metadata

Installation

Protdata has minimal dependencies listed in pyproject.toml

To install the latest release from PyPI, run:

bash pip install protdata

Or install from source: bash git clone https://github.com/czbiohub-sf/protdata.git cd protdata pip install -e . # or make setup-develop for developers

Usage Example

MaxQuant Import

You can download an example proteinGroups file here ```python import protdata

adata = loadmaxquantto_anndata("/path/to/proteinGroups.txt") print(adata) ```

DIA-NN Import

You can download an example DIA-NN report file here

```python from protdata.io import read_diann

adata = readdiann("/path/to/report.pgmatrix.tsv") print(adata) ```

FragPipe Import

You can download an example FragPipe output file here

```python from protdata.io import read_fragpipe

adata = readfragpipe("/path/to/combinedprotein.tsv") print(adata) ```

mzTab Import

You can download an example mzTab file here

```python from protdata.io import read_mztab

adata = readmztab("/path/to/SILACSQ.mzTab") print(adata) ```

Authors

protdata is created and maintained by the Computational Biology Platform at the Chan Zuckerberg Biohub San Francisco.

To get in touch please use the GihHub issues page.

Contributing

If you want to contribute to protdata, please read the Contribution Guide

Changelog

See Release Notes

Owner

  • Name: Chan Zuckerberg Biohub San Francisco
  • Login: czbiohub-sf
  • Kind: organization
  • Location: San Francisco

GitHub Events

Total
  • Watch event: 6
  • Issue comment event: 1
  • Push event: 1
  • Pull request event: 1
  • Create event: 1
Last Year
  • Watch event: 6
  • Issue comment event: 1
  • Push event: 1
  • Pull request event: 1
  • Create event: 1

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 203 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 5
  • Total maintainers: 1
pypi.org: protdata

Proteomics data loaders for AnnData format.

  • Versions: 5
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 203 Last month
Rankings
Dependent packages count: 9.2%
Average: 30.4%
Dependent repos count: 51.7%
Maintainers (1)
Last synced: 6 months ago

Dependencies

.github/workflows/publish.yml actions
  • actions/checkout v4 composite
  • actions/download-artifact v4 composite
  • actions/setup-python v5 composite
  • actions/upload-artifact v4 composite
  • pypa/gh-action-pypi-publish release/v1 composite
.github/workflows/test.yml actions
  • actions/checkout v4 composite
  • actions/setup-python v5 composite
pyproject.toml pypi
  • anndata *
  • numpy *
  • pandas *
  • pyteomics *