https://github.com/bigbio/onsite

onsite: posttranslation modification site localization

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.7%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

onsite: posttranslation modification site localization

Basic Info

Host: GitHub
Owner: bigbio
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 132 KB

Statistics

Stars: 0
Watchers: 4
Forks: 1
Open Issues: 1
Releases: 0

Created over 1 year ago · Last pushed 10 months ago

Metadata Files

Readme License

OnSite

Mass spectrometry post-translational modification localization tool.

This package provides tools for phosphorylation site localization and scoring using various algorithms including AScore and PhosphoRS.

Features

AScore Algorithm: Implementation of the AScore algorithm for phosphorylation site localization
PhosphoRS Algorithm: Implementation of the PhosphoRS algorithm for phosphorylation site localization
PhosphoScoring Pipeline: Complete workflow tool for processing MS/MS data files
LucXor: Python port of LuciPHOr2 for accurate PTM localization with probabilistic scoring

Installation

Using Poetry (Recommended)

```bash

Clone the repository

git clone https://github.com/bigbio/onsite.git cd onsite

Install with Poetry

poetry install

Activate the virtual environment

poetry shell ```

Using pip

bash pip install onsite

Usage

Command Line Interface

PhosphoScoring Pipeline

Process MS/MS data files using the PhosphoScoring pipeline:

```bash

Using Poetry

poetry run phospho-scoring -in spectra.mzML -id identifications.idXML -out results.idXML

Or after installation

phospho-scoring -in spectra.mzML -id identifications.idXML -out results.idXML ```

LucXor Tool

Use the LucXor tool for PTM localization:

```bash

Using Poetry

poetry run lucxor -in spectra.mzML -id identifications.idXML -out results.idXML

Or after installation

lucxor -in spectra.mzML -id identifications.idXML -out results.idXML

With custom parameters

python -m onsite.lucxor.cli \ -in spectra.mzML \ -id identifications.idXML \ -out results.idXML \ --fragmenttolerance 0.05 \ --fragmenttoleranceunit Da \ --minpeptidelength 6 \ --maxpeptide_length 50 ```

Python API

AScore Algorithm

```python from pyopenms import * from onsite import AScore

Initialize AScore

ascore = AScore()

Set parameters

ascore.setParameter("fragmentmasstolerance", 0.05) ascore.setParameter("fragmentmassunit", "Da")

Process a peptide hit

result = ascore.compute(peptide_hit, spectrum) ```

PhosphoRS Algorithm

```python from onsite import calculatephospholocalizationcompomicsstyle

Calculate phosphorylation site probabilities

siteprobs, isomerdetails = calculatephospholocalizationcompomicsstyle( peptidehit, spectrum, fragmenttolerance=0.05 ) ```

LucXor API

```python from onsite.lucxor import PyLuciPHOr2, LucXor, Peptide, PSM from onsite.lucxor.models import CIDModel, HCDModel from onsite.lucxor.spectrum import Spectrum from onsite.lucxor.flr import FLRCalculator

Initialize the processor

processor = PyLuciPHOr2()

Configure parameters

config = LucXorConfig() config.fragmenttolerance = 0.05 config.fragmenttolerance_unit = "Da"

Process data

results = processor.process(spectra, identifications) ```

Advanced LucXor Usage

```python from onsite.lucxor.core import CoreProcessor from onsite.lucxor.config import LucXorConfig

Custom configuration

config = LucXorConfig() config.fragmenttolerance = 0.05 config.fragmenttoleranceunit = "Da" config.minpeptidelength = 6 config.maxpeptide_length = 50 config.threads = 4

Initialize processor

processor = CoreProcessor(config)

Process data

results = processor.processpsms(psmlist, spectrum_map) ```

LucXor Algorithm

LucXor implements the complete LuciPHOr2 algorithm:

Spectrum Processing: Peak picking and noise filtering
Theoretical Spectrum Generation: Generate all possible modification site combinations
Fragment Matching: Match experimental and theoretical fragments
Probabilistic Scoring: Calculate localization probabilities using binomial statistics
False Localization Rate: Estimate confidence in localization results
Result Ranking: Rank results by probability and confidence

LucXor Configuration

Key parameters:

fragment_tolerance: Mass tolerance for fragment matching (default: 0.05 Da)
fragment_tolerance_unit: Unit for tolerance (Da or ppm)
min_peptide_length: Minimum peptide length to process
max_peptide_length: Maximum peptide length to process
threads: Number of parallel processing threads

LucXor Performance

Processing Speed: ~100-1000 PSMs/second depending on data complexity
Memory Usage: ~1-2 GB for typical datasets
Scalability: Linear scaling with dataset size and number of threads

LucXor Troubleshooting

Common Issues

Memory Errors: Reduce max_peptide_length or use fewer threads
Slow Processing: Increase number of threads or reduce fragment tolerance
Poor Results: Check fragment tolerance settings and data quality

Dependencies

Python 3.11+
pyOpenMS 3.4.0+
NumPy 2.3.2+
SciPy 1.16.1+

Development

Setup Development Environment

```bash

Install development dependencies

poetry install --with dev

Run tests

poetry run pytest

Format code

poetry run black .

Lint code

poetry run flake8 ```

Project Structure

onsite/ ├── onsite/ # Main package directory │ ├── __init__.py # Package initialization │ ├── ascore.py # AScore algorithm implementation │ ├── phosphors.py # PhosphoRS algorithm implementation │ └── lucxor/ # LucXor utilities │ ├── cli.py # Command-line interface │ ├── core.py # Core processing logic │ ├── models.py # Fragmentation models │ └── ... # Other modules ├── PhosphoScoring.py # Processing pipeline tool ├── pyproject.toml # Poetry configuration ├── README.md # This file └── LICENSE # MIT License

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

Citation

If you use this software in your research, please cite:

BigBio Stack. (2025). OnSite: Mass spectrometry post-translational localization tool. https://github.com/bigbio/onsite

If you use LucXor specifically, please also cite the original LuciPHOr2 paper:

Fermin, D., et al. (2013). LuciPHOr2: site identification of generic post-translational modifications from tandem mass spectrometry data. Nature Methods, 10(8), 744-746.

Owner

Name: BigBio Stack
Login: bigbio
Kind: organization
Email: proteomicsstack@gmail.com
Location: Cambridge, UK

Website: http://bigbio.xyz
Repositories: 24
Profile: https://github.com/bigbio

Provide big data solutions Bioinformatics

GitHub Events

Total

Issue comment event: 3
Push event: 1
Pull request event: 1
Pull request review comment event: 18
Pull request review event: 4

Last Year

Issue comment event: 3
Push event: 1
Pull request event: 1
Pull request review comment event: 18
Pull request review event: 4