pepstats_tables_python
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.3%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: raramayo
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Size: 39.1 KB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
PepStatsTablesPython
Motivation
``` This script was generated with the objective of constructing a table summarizing the main physical-chemical properties associated with each protein present in a given proteome. The script uses PepStats, an EMBOSS program to extract and construct a table displaying the following protein properties:
Molecular_weight
Isoelectric_Point
Mole%_Ala
Mole%_Cys
Mole%_Asp
Mole%_Glu
Mole%_Phe
Mole%_Gly
Mole%_His
Mole%_Ile
Mole%_Lys
Mole%_Leu
Mole%_Met
Mole%_Asn
Mole%_Pro
Mole%_Gln
Mole%_Arg
Mole%_Ser
Mole%_Thr
Mole%_Val
Mole%_Trp
Mole%_Tyr
Mole%_Tiny
Mole%_Small
Mole%_Aliphatic
Mole%_Aromatic
Mole%_Non-polar
Mole%_Polar
Mole%_Charged
Mole%_Basic
Mole%_Acidic
Protein_ID
In theory, thie script could be easily modified to extract other protein properties calculated by PepStats, but not extracted by the current script ```
Documentation
```
ARAMAYO_LAB
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.
SCRIPTNAME: PepStatsTablesv1.0.0.py SCRIPTVERSION: 1.0.0
USAGE: PepStatsTablesv1.0.0.py -p Homosapiens.GRCh38.pep.all.fa # REQUIRED (Proteins File - Proteome) -r PepStatsTables # OPTIONAL (Run Name) -z TMPDIR Location # OPTIONAL (default=0='TMP TMPDIR Run')
TYPICAL COMMANDS: PepStatsTablesv1.0.0.py -p Homosapiens.GRCh38.pep.all.fa -r PepStatsTables
INPUT01: -p FLAG REQUIRED - Protein File INPUT01FORMAT: Fasta Format INPUT01DEFAULT: No default
INPUT02: -r FLAG OPTIONAL - Run Name INPUT02FORMAT: Text INPUT02DEFAULT: PepStats_Tables
INPUT03: -z FLAG OPTIONAL input INPUT03FORMAT: Numeric: 0 == TMP TMPDIR Run | 1 == Local TMP Run INPUT03DEFAULT: 0 == TMP TMPDIR Run INPUT03NOTES: 0 Processes the data in the TMP $TMPDIR directory of the computer used or of the node assigned by the SuperComputer scheduler INPUT03NOTES: Processing the data in the $TMPDIR directory of the node assigned by the SuperComputer scheduler reduces the possibility of file error generation due to network traffic INPUT03_NOTES: 1 Processes the data in the same directory where the script is being run
DEPENDENCIES: EMBOSS: Required (see: http://emboss.open-bio.org/html/adm/ch01s01.html)
Author: Rodolfo Aramayo WORKEMAIL: raramayo@tamu.edu PERSONALEMAIL: rodolfo@aramayo.org
```
Development/Testing Environment:
Distributor ID: Apple, Inc.
Description: Apple M1 Max
Release: 14.4.1
Codename: Sonoma
Distributor ID: Ubuntu
Description: Ubuntu 22.04.3 LTS
Release: 22.04
Codename: jammy
Required Script Dependencies:
EMBOSS (https://emboss.sourceforge.net/download/)
Version Number: 6.6.0.0
Credits: https://emboss.sourceforge.net/credits/
Owner
- Name: Rodolfo Aramayo
- Login: raramayo
- Kind: user
- Location: College Station, Texas USA
- Company: Texas A&M University
- Repositories: 1
- Profile: https://github.com/raramayo
@AramayoLab
Citation (CITATION.cff)
cff-version: 1.1.0 message: "If you use this software, please cite it as below." authors: - family-names: Aramayo given-names: Rodolfo orcid: https://orcid.org/0000-0001-9702-6204 title: PepStats_Tables version: v1.0.0 date-released: 2024-03-28