Recent Releases of instanovo

instanovo - InstaNovo v1.1.4

What's Changed

  • build(deps): bump astral-sh/setup-uv from 5 to 6 by @dependabot in https://github.com/instadeepai/InstaNovo/pull/100
  • fix: diffusion sampling and checkpoint download by @rcatzel in https://github.com/instadeepai/InstaNovo/pull/111
  • chore: bump version number to v.1.1.4 by @rcatzel in https://github.com/instadeepai/InstaNovo/pull/112

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/1.1.3...1.1.4

- Python
Published by rcatzel 9 months ago

instanovo - InstaNovo v1.1.3

What's Changed

  • chore: bump version number to v1.1.2 by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/104
  • docs: add InstaNovo-P notebook by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/105
  • InstaNovo v1.1.3 by @rcatzel in https://github.com/instadeepai/InstaNovo/pull/109

Notes

  • Update codebase to use new diffusion checkpoint, instanovoplus-v1.1.0
  • Update diffusion predict script with refinement configuration options, and multiple prediction sampling for improved performance
    • refine_all: If True, all predictions will be refined
    • refine_threshold: Only predictions with a confidence score less than this will be refined
    • n_preds: Number of diffusion predictions to sample per spectrum
  • Include updated model performance benchmarking in README.md
  • Added option to specify valid_path as a DictConfig, where keys represent the validation group and values represent the validation path.
    • Eg. "acpt": "/path/to/acpt/*-valid-*.parquet"
    • This will add an "acpt" group in the validation metrics
  • Added add_source_file_column option to SpectrumDataFrame, which adds the path to the original input file as a column "source_file"
  • Added add_spectrum_id option to SpectrumDataFrame, which adds unique index values to the input file as a column "spectrum_id"
  • Updated s3 utils to use a class instead of individual functions
    • Added functionality to wrap write methods and optionally specify an s3 path which will get automatically uploaded
    • Added functionality to automatically download files to a temporary directory when converting s3 paths
  • Allow --output-path to remain unspecified in evaluation mode for:
    • Transformer model
    • Diffusion model when not in refinement mode

New Checkpoints

  • Trained a new InstaNovo+ v1.1.0 checkpoint on the Extended Massive-KB dataset - instanovoplus-v1.1.0.ckpt. To be used standalone, or in conjunction with instanovo-v1.1.0.ckpt for best results.

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/1.1.2...1.1.3

- Python
Published by rcatzel 9 months ago

instanovo - InstaNovo v1.1.2: InstaNovo-P v1.0.0 checkpoint

What's Changed

This release includes the checkpoint for our de novo sequencing model for phosphoproteomics InstaNovo-P v1.0.0!

  • docs: re-enable building docs by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/85
  • Fix docs by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/88
  • Auto detect device by @rcatzel in https://github.com/instadeepai/InstaNovo/pull/91
  • Notebook comparing InstaNovo v0.1 versus InstaNovo v1.1 by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/95
  • docs: add coverage badge by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/89
  • instanovo-phospho-v1.0.0 by @jesperdlau in https://github.com/instadeepai/InstaNovo/pull/103

New Contributors

  • @jesperdlau made their first contribution in https://github.com/instadeepai/InstaNovo/pull/103

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/1.1.1...1.1.2

- Python
Published by BioGeek 10 months ago

instanovo - InstaNovo v1.1.1

What's Changed

Code updates for publication release * fix: add sample data by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/81 * docs: update readme for publication by @KevinEloff @BioGeek @rcatzel in https://github.com/instadeepai/InstaNovo/pull/82 * fix: add diffusion device option, set automatically on load by @KevinEloff in https://github.com/instadeepai/InstaNovo/pull/83 * fix: use separate diffusion sdf instance, minor release bump by @rcatzel in https://github.com/instadeepai/InstaNovo/pull/84

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/1.1.0...1.1.1

- Python
Published by KevinEloff 11 months ago

instanovo - InstaNovo v1.1.0

What's Changed

  • Updated to use Typer @BioGeek
    • Typer includes options to predict with InstaNovo and refine with InstaNovo+ in one command
  • Updated dependency management to use UV @BioGeek
  • Re-added updated diffusion code @rcatzel
    • A preliminary checkpoint has been added
  • Adds support for UNIMOD ProForma format in the residue class (backwards compatible with original format)
  • Adds automatic UNIMOD conversion of old model checkpoints during loading by default
  • Adds support for specifying a list of globs for SpectrumDataFrame
  • Removed (+25.98) N-terminal modification from training, and added to suppressed list by default
  • Fixed bug where tempdir would not be initialised if all input data is in .parquet format, causing a crash when performing a preshuffle.
  • Updated predict script to always calculate delta_mass_ppm even in de novo mode.
  • Updated README.md
    • Added usage for UV and Typer
    • Added link to Acknowledgements
    • Added Natively Supported Modifications table with unimod ID
    • Added Output description table
    • Updated Using your own datasets table to remove the modified_sequence column

New checkpoints

  • Trained a new checkpoint with corrected cysteine modification and added MassiveKB data: instanovo-v1.1.0
  • Preliminary InstaNovo+ checkpoint trained on the same data: instanovoplus-v1.1.0-alpha

Notes:

  • A bug with Implicit Cysteine modification has been corrected in the AC-PT. This bug affects the previous instanovo_extended.ckpt checkpoint from the 1.0.0 release. This bug caused significant performance issues on downstream datasets.
  • The diffusion checkpoint is an alpha release with an updated checkpoint coming soon.

- Python
Published by KevinEloff 11 months ago

instanovo - InstaNovo v1.0.1

What's Changed

  • build(deps): bump pypa/gh-action-pypi-publish from 1.4.2 to 1.10.3 by @dependabot in https://github.com/instadeepai/InstaNovo/pull/55
  • build(deps): bump actions/setup-python from 4 to 5 by @dependabot in https://github.com/instadeepai/InstaNovo/pull/42
  • Update starter notebook, add charge check, fix sdpa by @KevinEloff in https://github.com/instadeepai/InstaNovo/pull/61
  • build(deps): bump pypa/gh-action-pypi-publish from 1.10.3 to 1.11.0 by @dependabot in https://github.com/instadeepai/InstaNovo/pull/64
  • fix: ZeroDivisionError when predicting on small sample file by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/68
  • V1.0.1 release by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/69

  • fix: Resolve "AttributeError: 'SpectrumDataFrame' object has no attribute 'df'"

  • feat: update notebooks to v1.0.0

  • feat: Automatic model download and improve residues Co-Authored-By: Kevin Eloff k.eloff@instadeep.com

  • feat: update tests for v1.0.1 release Co-Authored-By: Rachel Catzel r.catzel@instadeep.com

  • feat: update packages

New Contributors

  • @dependabot made their first contribution in https://github.com/instadeepai/InstaNovo/pull/55

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/1.0.0...1.0.1

- Python
Published by BioGeek about 1 year ago

instanovo - InstaNovo 1.0.0

Improved code utility and data validation - Check labels match precursor - Check for data leakage - Verify residue vocabulary - Added better residue support - Fine-tuning trainer automatically updates model weights with new sizes - Added Flash attention, torch.compile(), AMP (fp16) - Added improved fast greedy search - Improved test coverage

Added Spectrum Data Handler - Supports lazy loading with asynchronous prefetching - Filtering and sampling performed non-destructively (by updating the row filter) - Two-fold shuffling strategy for training ensures optimal load times

Extended model checkpoint released. Trained on 32M spectra with additional PTMs: - AC-PT - Additional PRIDE dataset - Additional phosphorylation dataset

- Python
Published by KevinEloff over 1 year ago

instanovo - 0.1.7

What's Changed

  • feat: generalise mgf to all metadata by @KevinEloff in https://github.com/instadeepai/InstaNovo/pull/19
  • Add GitHub action to purge GitHub cache of badge image on new release by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/18
  • fix: dataloader using old schema by @KevinEloff in https://github.com/instadeepai/InstaNovo/pull/24
  • refactor: linting by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/21
  • Add integration test by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/27
  • fix: update benchmark URL to InstaDeepAI/msninespeciesbenchmark by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/28
  • Fix version conflict for docs build by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/29
  • docs: add link to Tran et al 2017 + fix ploars link by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/30
  • ci: add dependabot to update github actions and packages by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/31
  • Release 0.1.7 by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/33

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/0.1.6...0.1.7

- Python
Published by BioGeek almost 2 years ago

instanovo - 0.1.6 Data conversion scripts

  • Add data conversion scripts
  • Add InstaNovo+ to starter notebook
  • Add functionality to return all beams (return_all_beams=True argument in decoder.decode) #12

- Python
Published by KevinEloff over 2 years ago

instanovo - 0.1.5 Diffusion Model and Checkpoints

  • Add code for diffusion model
  • Add checkpoint for diffusion model pre-trained on HC-PT fine-tuned on Nine Species excluding yeast.

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/0.1.4...0.1.5

- Python
Published by amabinsta over 2 years ago

instanovo - 0.1.4 InstaNovo model checkpoints

What's Changed

  • feat: bundle config files with checkpoints
  • add checkpoints instanovo.pt trained on HC-PT, and instanovo_yeast.pt fine-tuned on nine-species excluding yeast.

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/0.1.2...0.1.4

- Python
Published by KevinEloff over 2 years ago

instanovo - 0.1.3

What's Changed

  • docs: add hyperlinks by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/6
  • fix: update URLs by @BioGeek in https://github.com/instadeepai/InstaNovo/pull/7

Full Changelog: https://github.com/instadeepai/InstaNovo/compare/0.1.2...0.1.3

- Python
Published by BioGeek over 2 years ago

instanovo - 0.1.2

- Python
Published by BioGeek over 2 years ago

instanovo - 0.1.1

Initial code release

- Python
Published by BioGeek over 2 years ago

instanovo - 0.1.0

Initial code release

- Python
Published by BioGeek over 2 years ago