imlp

codebase and dotnet CLI tool for prediction of iMTS-Ls

https://github.com/csbiology/imlp

Science Score: 52.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
    Organization csbiology has institutional domain (csb.bio.uni-kl.de)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.0%) to scientific vocabulary
Last synced: 7 months ago · JSON representation ·

Repository

codebase and dotnet CLI tool for prediction of iMTS-Ls

Basic Info
  • Host: GitHub
  • Owner: CSBiology
  • License: mit
  • Language: F#
  • Default Branch: main
  • Size: 6.19 MB
Statistics
  • Stars: 4
  • Watchers: 3
  • Forks: 1
  • Open Issues: 0
  • Releases: 1
Created almost 5 years ago · Last pushed about 2 years ago
Metadata Files
Readme License Citation

README.md

iMLP

The imlp cli tool predicts iMTS-L propensity profiles for proteins of interest.

General

Installation

Docker (Recommended)

This tool is very easy to install and use with docker.

Dependencies: Only Docker :whale:!

Steps to run only once: 1. docker pull csbdocker/imlp, this step might take a few minutes, as the image is around 5GB. 2. docker run -i -d csbdocker/imlp, start a container from the just downloaded image and run it in the background.

Done! You now have a docker container from the csbdocker/imlp image. You can find the id of the container and more with:

  1. docker ps -l

Output: | CONTAINER ID | IMAGE | COMMAND | CREATED | STATUS | PORTS | NAMES | |--------------|----------------|-------------|----------------|---------------|-------|--------------| | da9b7ed591eb | csbdocker/imlp | "/bin/bash" | 10 minutes ago | Up 10 minutes | | loving_moser |

With this container id (Yours will differ! :warning:) you can start/stop/restart or delete this container. In our case we want to open a console inside the running container.

  1. docker exec -it da9b7ed591eb bash, this will pen a console and you can now type to execute commands inside the container.
  2. Run imlp with dotnet imlp --help :tada:
  3. Check out how to use imlp here.

Using the docker container with mounted directories

The usual use-case is using an input file in fasta format. Obviously, this file must be accessible in the container that you execute the tool in.

A simple way of doing this is mounting the directory on creation of the container. Note that you should target a subfolder of data in the container, as the container starts in /data per default.

docker run -it csbdocker/imlp --mount type=bind,source="<your-path-here>",target=/data/<container-path-here>

Example:

Suppose you have a file test.fasta in C:/my-folder:

bind C:/my-folder to /data/fastas:

docker run -it csbdocker/imlp --mount type=bind,source="C:/my-folder",target=/data/fastas

inside the container, you can use the container path you just bound to access the file, e.g.:

dotnet imlp -f ./fastas/test.fasta -o ./fastas/output.txt

which will run imlp with the fasta input and generate an output file in the same folder, which will also be available on the containers host OS afterwards.

Dotnet tool

imlp is packaged as dotnet tool. To use it: - install the .NET SDK for your system - run either dotnet tool install imlp (after initializing a local manifest via dotnet new tool-manifest) for a local or dotnet tool install -g imlp for a global installation - you can now run the tool via dotnet imlp ... (for a local installation) or imlp ... (for a global installation)

External dependencies on linux

CNTK has some external dependencies that can not be published with the packaged tool.

  • the necessary cntk binaries can be downloaded here
  • for installation of OpenMPI, please follow this guide
  • finally, apply this fix:
    • navigate to the location where you unpacked the cntk binaries
    • run cp ./cntk/lib/Cntk.Core.CSBinding-2.7.so ./cntk/lib/libCntk.Core.CSBinding-2.7.dll

all steps can be seen executed in out Dockerfile

External dependencies on windows

CNTK has some external dependencies that can not be published with the packaged tool.

Download the CNTK package here: https://cntk.ai/dlwc-2.7.html and add the extracted ./cntk/cntk folder to your path variable.

Published binaries

Download links to self-contained binaries can be found under releases. Linux users have to apply the same fixes as laid out under the dotnet tool section

Usage

```shell USAGE: imlp [--help] [--sequence ] [--inputfile ] [--outputfile ] [--model ] [--plotdirectory ] [--proteinheaderregex ] [--verbosity ]

OPTIONS:

--sequence, -s <sequence>
                      A single, one-letter coded amino acid input sequence. Either --sequence (-s) 
                      or --inputFile (-f) must be specified.
--inputfile, -f <inputFile>
                      Path to a fasta formatted input file that may contain multiple entries. Either 
                      --sequence (-s) or --inputFile (-f) must be specified.
--outputfile, -o <outputFile>
                      (optional) Path to the desired output file, which will be tab separated (tsv). 
                      If not specified, output will be printed to stdout instead.
--model, -m <plant|nonplant>
                      (optional) Model to use for prediction. Choose the one that is closest to your 
                      organism of interest: Either 'plant' or 'nonplant'. (default:nonplant)
--plotdirectory, -p <plotDirectory>
                      (optional) Path to a directory to save plots to.
--proteinheaderregex, -pr <proteinHeaderRegex>
                      (optional) Regular expression to extract protein names from fasta headers for 
                      the naming of plot output files. if not provided, plot files will be 'protein_{i}.html', 
                      where i is the index in the input.
--verbosity, -v <verbosity>
                      (optional) The verbosity of the logging process. 0(Silent) | 1(Error) | 2(Warn) | 3(Info) 
                      | >=4 : Debug | (default:1)
--help                display this list of options.

```

Development

build

  • dotnet tool restore (once)
  • dotnet fake build

test

  • dotnet tool restore (once)
  • dotnet fake build -t testpackagedtool (You need to have external dependencies (win/linux) installed)

publish self contained libraries

  • dotnet tool restore (once)
  • dotnet fake build -t publishbinaries will publish both win-x64 and linux-x64 binaries.

docker

  • set the correct imlp version in the containers IMLP_VERSION argument
  • docker build . -t imlp

Owner

  • Name: Computational Systems Biology
  • Login: CSBiology
  • Kind: organization
  • Location: Kaiserslautern

Computational Systems Biology

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Schneider
    given-names: Kevin
    orcid: https://orcid.org/0000-0002-2198-5262
  - family-names: Zimmer
    given-names: David
    orcid: https://orcid.org/0000-0002-7832-7658
  - family-names: Nielsen
    given-names: Henrik
  - family-names: Herrmann
    given-names: Johannes
    orcid: https://orcid.org/0000-0003-2081-4506
  - family-names: Muehlhaus
    given-names: Timo
    orcid: https://orcid.org/0000-0003-3925-6778
title: "iMLP, a predictor for internal matrix targeting-like sequences in mitochondrial proteins"
version: 0.1.2
identifiers:
  - type: doi
    value: 10.1515/hsz-2021-0185
date-released: 2021-07-05
preferred-citation:
  type: article
  authors:
  - family-names: Schneider
    given-names: Kevin
    orcid: https://orcid.org/0000-0002-2198-5262
  - family-names: Zimmer
    given-names: David
    orcid: https://orcid.org/0000-0002-7832-7658
  - family-names: Nielsen
    given-names: Henrik
  - family-names: Herrmann
    given-names: Johannes
    orcid: https://orcid.org/0000-0003-2081-4506
  - family-names: Muehlhaus
    given-names: Timo
    orcid: https://orcid.org/0000-0003-3925-6778
  doi: "10.1515/hsz-2021-0185"
  journal: "Biological Chemistry"
  month: 7
  start: 937 # First page number
  end: 943 # Last page number
  title: "iMLP, a predictor for internal matrix targeting-like sequences in mitochondrial proteins"
  issue: 8
  volume: 402
  year: 2021

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 40
  • Total Committers: 3
  • Avg Commits per committer: 13.333
  • Development Distribution Score (DDS): 0.125
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Kevin Schneider s****v@o****e 35
David Zimmer d****1@g****m 4
Kevin Frey F****r@g****e 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 0
  • Total pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: about 7 hours
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • ZimmerD (4)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • nuget 2,694 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
  • Total maintainers: 4
nuget.org: imlp

imlp predicts iMTS-L propensity profiles of protein sequences of interest

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 2,694 Total
Rankings
Forks count: 5.7%
Stargazers count: 6.2%
Dependent repos count: 12.7%
Average: 16.2%
Dependent packages count: 19.5%
Downloads: 36.8%
Last synced: 8 months ago

Dependencies

Dockerfile docker
  • mcr.microsoft.com/dotnet/sdk 5.0.202-focal-amd64 build