filtersam
Tools to filter SAM/BAM files by percent identity and percent of matched sequence
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: ncbi.nlm.nih.gov, zenodo.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.3%) to scientific vocabulary
Keywords
Repository
Tools to filter SAM/BAM files by percent identity and percent of matched sequence
Basic Info
Statistics
- Stars: 5
- Watchers: 1
- Forks: 0
- Open Issues: 2
- Releases: 1
Topics
Metadata Files
README.md

A Python tool to filter sam/bam files by percent identity or percent of matched sequence
Percent identity is computed as:
$$PI = 100 \frac{Nm}{Nm + N_i}$$
where $Nm$ is the number of matches and $Ni$ is the number of mismatches.
Percent of matched sequences is computed as:
$$PM = 100 \frac{N_m}{L}$$
where $L$ corresponds to query sequence length.
NOTES
Percent of matched sequence is also an alternative definition of percent identity used in some cases, for intance, in BLAST.
BAM/SAM files must contain MD tags to be able to filter by percent identity. Aligners such as BWA add MD tags to each queried sequence in a BAM file. MD tags can also be generated with samtools.
Installation
pip install filtersam
Usage
You can find a jupyter notebook with usage examples here.
Citation
If you use this software, please cite it as below:
Robaina-Estévez, S. (2022). filterSAM: filter sam/bam files by percent identity or percent of matched sequence (Version 0.0.11)[Computer software]. https://doi.org/10.5281/zenodo.7056278.
Owner
- Name: Semidán Robaina
- Login: Robaina
- Kind: user
- Location: Atlantic Ocean
- Company: Hapdera
- Website: https://www.linkedin.com/in/semidan-robaina/
- Twitter: srobainae
- Repositories: 12
- Profile: https://github.com/Robaina
Computational Biology | Data Science | Python Dev. | Ph.D. Systems Biology
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Robaina-Estévez"
given-names: "Semidán"
orcid: "https://orcid.org/0000-0003-0781-1677"
title: "filterSAM: filter sam/bam files by percent identity or percent of matched sequence"
version: 0.0.11
doi: 10.5281/zenodo.7056278
date-released: 2022-09-07
url: "https://github.com/Robaina/filterSAM"
GitHub Events
Total
- Issues event: 1
- Watch event: 2
Last Year
- Issues event: 1
- Watch event: 2
Committers
Last synced: about 1 year ago
Top Committers
| Name | Commits | |
|---|---|---|
| Semidán | s****a@g****m | 31 |
| Semidán Robaina Estévez | h****o@s****m | 4 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 3
- Total pull requests: 0
- Average time to close issues: 5 days
- Average time to close pull requests: N/A
- Total issue authors: 3
- Total pull request authors: 0
- Average comments per issue: 0.33
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- mcmahon-uw (1)
- Robaina (1)
- njohner (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 37 last-month
- Total dependent packages: 0
- Total dependent repositories: 1
- Total versions: 11
- Total maintainers: 1
pypi.org: filtersam
('Tools to filter sam o bam files by percent identity or percent of matched sequence',)
- Homepage: http://github.com/robaina/filterSAM
- Documentation: https://filtersam.readthedocs.io/
- License: Creative Commons Attribution 4.0 International
-
Latest release: 0.0.11
published over 3 years ago
Rankings
Maintainers (1)
Dependencies
- numpy ==1.21.2
- parallelbam ==0.0.12
- pysam ==0.16.0.1
- numpy *
- parallelbam *
- pysam *