gcmodeller

GCModeller: genomics CAD(Computer Assistant Design) Modeller system in .NET language

https://github.com/smrucc/gcmodeller

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.9%) to scientific vocabulary

Keywords

bioinformatics biological-data-analysis gcmodeller genome-annotation genomics microsoft-visualbasic r-language scibasic single-cell virtual-cell virtualcell

Keywords from Contributors

mesh
Last synced: 6 months ago · JSON representation

Repository

GCModeller: genomics CAD(Computer Assistant Design) Modeller system in .NET language

Basic Info
  • Host: GitHub
  • Owner: SMRUCC
  • License: gpl-3.0
  • Language: Visual Basic .NET
  • Default Branch: master
  • Homepage: http://gcmodeller.org
  • Size: 881 MB
Statistics
  • Stars: 36
  • Watchers: 7
  • Forks: 10
  • Open Issues: 0
  • Releases: 2
Topics
bioinformatics biological-data-analysis gcmodeller genome-annotation genomics microsoft-visualbasic r-language scibasic single-cell virtual-cell virtualcell
Created about 10 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License Code of conduct Citation Notice

README.md

GCModeller

GCModeller: genomics CAD(Computer Assistant Design) Modeller system in .NET language

DOI Github All Releases GPL Licence Build Status

[WARNING] This project is a work in progress and is not recommended for production use.
  • HOME: http://gcmodeller.org
  • Github: https://github.com/SMRUCC/GCModeller
  • BioTools: https://bio.tools/gcmodeller

Supported platform: Microsoft Windows, GNU Linux, MAC, Microsoft Azure Cloud
Development: Microsoft VisualStudio 2019 | VisualBasic.NET
Runtime environment: sciBASIC# v2.1.5 beta & .NET Framework 4.7 (or mono 6.4)
Installation: VS2019 is required of compiles this project. After the source code have been clone using git, just open solution file /src/GCModeller.sln, and when restore nuget packages finished, then you are good to go of compile GCModeller project.

NOTE: Due to the reason of this project is using git submodule for manage some runtime component, so that please do not directly download the project source code from github by using the Donwload Zip button. The internal github client in the VisualStudio is recommended using for download the project source code.

Docker and Database

Dependency: Part of the GCModeller function required running Linux tools through Darwinism Docker environment for VB.NET (If you are running GCModeller on Windows Platform). This toolkit required of these environment installed on your windows server:

  • Microsoft PowerShell SDK 3.0
  • Latest version of Docker for X64
  • Then pull environment container image via: docker pull xieguigang/gcmodeller-env.

The docker container image contains these utils that required by GCModeller:

  • MEME suite for motif analysis
  • Mothur for construct OTU

Install Database: Some feature in GCModeller required the fasta sequence database was installed on a specific location on your server's filesystem, please follow this instruction to install the database for GCModeller.


GCModeller is an open source cloud computing platform for the geneticist and systems biology. You can easily build a local computing server cluster for GCModeller on the large amount biological data analysis.

The GCModeller platform is original writen in VisualBasic.NET language, a feature bioinformatics analysis environment that .NET language hybrids programming with R language was included, which its SDK is available at repository: https://github.com/SMRUCC/R.Bioinformatics

Currently the R language hybrids programming environment just provides some bioconductor API for the analysis in GCModeller.

GCModeller is a set of utility tools working on the annotation of the whole cell system, this including the whole genome regulation annotation, transcriptome analysis toolkits, metabolism pathway analysis toolkits. And some common bioinformatics problem utils tools and common biological database I/O tools is also available in GCModeller for the .NET language programming.

Directory Roadmap

Data Standards

  • GCModeller supports the SBML and BIOM data standards for exchanges the analysis and model data with other bioinformatics softwares.
  • Supports PSI data for the biological interaction network model
  • Supports OBO data for ontology database like go.

Modules & Functions

GCModeller provides a set of .NET libraries and CLI tools for processing biological analysis data. Currently GCModeller can provides these productive ready libraries:

1.Basically libraries
  • NCBI data analysis toolkit: Genbank/Taxonomy/nt/nr database
  • Common Data: FastA database, FastQ, SAM data file I/O class
  • Biological Data Standard Supports: SBML(level 3), BIOM(level1), PSI, OBO
  • Biological Pathway Database: MetaCyc, Reactome, KEGG data tools for .NET language
2.Biological Data Visualization software API for .NET
  • Circos API(genomic visualizing)
  • Cytoscape DataModel API(Biological network visualizing)
  • SequenceLogo(Molecular motif site visualize)
  • KEGG pathway map visualizer
3.Annotation Tools:
  • A complete NCBI localblast toolkit for proteins and nucleotide sequence analysis, includes parallel task library for Win/Linux Server and data analysis protocol.
  • SNP toolkit
  • Nucleotide sequence topology feature site analysis toolkit.
  • RegPrecise database tool and MEME software toolkit for the annotation of bacterial genomics regulation network.
  • Go(Gene Ontology) annotation tools
  • KEGG/GO GSEA functional enrichment tools and reference genome background model creator based on UniProt database.
4.R language hybrids environment for bioinformatics
  • Includes basically R language API wrapper for VisualBasic, like Api in base, utils, stats namespace from R base.
  • and some R package wrapper API from CRAN and Bioconductor is also included.
  • GCModeller R# language scripting
5.WebAPI wrapper for KEGG database and RegPrecise Database
6.Feature tools
  • Cellular module simulator, and virtual cell model generator protocol.
  • Proteomics data analysis toolkit
  • Single-cell data analysis toolkit

GCModeller R# scripting

Here is a code snapshot of R# scripting for drawing sequence logo, input data is accepted from the commandline input:

```R

Demo script for create sequence logo based on the MSA alignment analysis

nt base frequency is created based on the MSA alignment operation.

imports "bioseq.sequenceLogo" from "seqtoolkit.dll"; imports "bioseq.fasta" from "seqtoolkit.dll";

script cli usage

R# sequenceLogo.R --seq input.fasta [--title --save output.png]

get input data from commandline arguments and

fix for the optional arguments default value

by apply or default syntax for non-logical values

let seq.fasta as string = ?"--seq" || stop("No sequence input data for draw sequence logo!"); let logo.png as string = ?"--save" || ${seq.fasta}.logo.png; let title as string = ?"--title" || basename(seq.fasta);

read sequence and then do MSA alignment

finally count the nucleotide base frequency

and then draw the sequence logo

by invoke sequence logo drawer api

seq.fasta :> read.fasta :> MSA.of :> plot.seqLogo(title) :> save.graphics( file = logo.png ); ```

Run the R# script from commandline:

```cmd @echo off

R# ./sequenceLogo.R --seq LexA.fasta --save LexA.png --title "LexA" ```

Publications

Here listing the scientific paperworks that based on the analysis services of GCModeller:

  • Niu, X.-N., et al. (2015). "Complete sequence and detailed analysis of the first indigenous plasmid from Xanthomonas oryzae pv. oryzicola." BMC Microbiol 15(1): 1-15.

    • DOI: 10.1186/s12866-015-0562-x

Bacterial plasmids have a major impact on metabolic function and adaptation of their hosts. An indigenous plasmid was identified in a Chinese isolate (GX01) of the invasive phytopathogen Xanthomonas oryzae pv. oryzicola (Xoc), the causal agent of rice bacterial leaf streak (BLS). To elucidate the biological functions of the plasmid, we have sequenced and comprehensively annotated the plasmid.

Gallery

single cell data toolkit includes in GCModeller:

Visit our project home: http://gcmodeller.org

For developers

Here are some released library of the GCModeller is published on nuget, then you can install these library in VisualStudio from Package Manager Console:

```bash

Install Microsoft VisualBasic sciBASIC# runtime via nuget:

https://github.com/xieguigang/sciBASIC/

PM> Install-Package sciBASIC -Pre

The GCModeller core base library was released:

https://github.com/SMRUCC/GCModeller.Core

PM> Install-Package GCModeller.Core

The NCBI localblast analysis toolkit:

https://github.com/SMRUCC/ncbi-localblast

PM> Install-Package NCBI_localblast ```

For User

The GCModeller demo script and data for user tutorials can be download from these public data repository: + [Xanthomonas campestris pv. campestris 8004] GCModeller genomics Modelling Project.

genome map plot of Xanthomonas campestris pv. campestris 8004


Copyleft © SMRUCC genomics 2016. All rights reversed.

Owner

  • Name: SMRUCC
  • Login: SMRUCC
  • Kind: organization
  • Email: genomics@SMRUCC.org
  • Location: GuiLin City, China

Welcome to the SMRUCC genomics Institute

GitHub Events

Total
  • Watch event: 3
  • Push event: 862
Last Year
  • Watch event: 3
  • Push event: 862

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 21,967
  • Total Committers: 7
  • Avg Commits per committer: 3,138.143
  • Development Distribution Score (DDS): 0.183
Past Year
  • Commits: 3,762
  • Committers: 3
  • Avg Commits per committer: 1,254.0
  • Development Distribution Score (DDS): 0.162
Top Committers
Name Email Commits
xieguigang x****g@g****g 17,939
xieguigang x****g@g****m 2,026
guigang xie g****e@b****m 1,733
关倩倩 a****a@g****g 157
關倩倩 e****a@l****e 57
谢桂纲 x****g@l****m 33
dependabot[bot] 4****] 22
Committer Domains (Top 20 + Academic)

Dependencies

GCModeller/R/Pairwise-WeirCockerhams-FST/Geneclust/DESCRIPTION cran
  • deldir * depends
  • fields * depends
  • spatial * depends
package.json npm
  • eslint ^6.6.0 development
  • tap ^6.3.0 development
  • distributions ^1.0.0
  • summary 0.3.x
GCModeller/R/GCModeller.cli2R/DESCRIPTION cran
GCModeller/R/r-sparcc/DESCRIPTION cran
src/workbench/workbench/vendor/fancybox_v3/bower.json bower
  • jquery >=1.9.0
src/runtime/sciBASIC#/Data_science/Mathematica/CUDA/sciKernel/Cargo.toml cargo
src/GCModeller/annotations/WGCNA/run_test/metabolome/meta.json cpan
src/GCModeller/models/GPML/test/graph/meta.json cpan
src/runtime/sciBASIC#/Data_science/algorithms/Louvain/facebook_combined_graph/meta.json cpan
src/runtime/sciBASIC#/Data_science/algorithms/Louvain/testdata_graph/meta.json cpan
src/runtime/sciBASIC#/Data_science/algorithms/UnweightedFastUnfolding/meta.json cpan
src/workbench/R#/demo/cytoscape/automation/cytoscape_result/meta.json cpan
src/workbench/R#/demo/cytoscape/automation/meta.json cpan
src/workbench/pkg/test/network/meta.json cpan
src/GCModeller/annotations/GSEA/PFSNet/R/DESCRIPTION cran
src/interops/scripts/GCModeller/DESCRIPTION cran
src/repository/graphquery/kegg/DESCRIPTION cran
src/runtime/sciBASIC#/Data_science/Mathematica/images/ggFunctions/DESCRIPTION cran
  • R >= 3.1.0 depends
  • ggplot2 * depends
  • scales * depends
src/workbench/pkg/DESCRIPTION cran
src/runtime/sciBASIC#/Data_science/Mathematica/data/Student's T-test/ttest-js/package.json npm
  • eslint ^6.6.0 development
  • tap ^6.3.0 development
  • distributions ^1.0.0
  • summary 0.3.x
src/runtime/sciBASIC#/gr/network-visualization/mingle/package-lock.json npm
  • @types/node 14.14.14 development
  • typescript 4.1.3 development
src/runtime/sciBASIC#/gr/network-visualization/mingle/package.json npm
  • @types/node ^14.14.7 development
  • typescript ^4.0.5 development
src/workbench/typescript/InteractionViz/package-lock.json npm
  • @types/node 8.10.50 development
  • @types/sigmajs 1.0.27
src/workbench/typescript/InteractionViz/package.json npm
  • @types/node ^8.10.50 development
  • @types/sigmajs ^1.0.27
src/workbench/typescript/common/package-lock.json npm
  • @types/node 8.10.61 development
  • typescript 3.9.3 development
src/workbench/typescript/common/package.json npm
  • @types/node ^8.0.14 development
  • typescript ^3.2.2 development
src/workbench/typescript/console/package-lock.json npm
  • @types/node 8.10.60 development
  • typescript 3.9.2 development
src/workbench/typescript/console/package.json npm
  • @types/node ^8.0.14 development
  • typescript ^3.2.2 development
src/workbench/typescript/inspector/package.json npm
  • @types/node ^8.0.14 development
  • typescript ^3.2.2 development
src/workbench/workbench/package-lock.json npm
  • 155 dependencies
src/workbench/workbench/package.json npm
  • electron ^22.3.25 development
  • request ^2.88.2
  • tslint ^6.1.2
src/workbench/workbench/vendor/console/src/package-lock.json npm
  • @types/node 8.10.60 development
  • typescript 3.9.2 development
src/workbench/workbench/vendor/console/src/package.json npm
  • @types/node ^8.0.14 development
  • typescript ^3.2.2 development
src/workbench/workbench/vendor/fancybox_v3/package.json npm
  • del ^2.2.2 development
  • gulp ^3.9.1 development
  • gulp-autoprefixer ^3.1.1 development
  • gulp-concat ^2.6.1 development
  • gulp-cssnano ^2.1.2 development
  • gulp-header ^1.8.8 development
  • gulp-jshint ^2.0.4 development
  • gulp-livereload ^3.8.1 development
  • gulp-notify ^2.2.0 development
  • gulp-rename ^1.2.2 development
  • gulp-replace ^0.5.4 development
  • gulp-uglify ^2.0.0 development
  • gulp-util ^3.0.8 development
  • jshint ^2.9.4 development
src/GCModeller/data/GO_gene-ontology/GO_mysql/packages.config nuget
  • BouncyCastle 1.8.9
  • Google.Protobuf 3.17.0
  • K4os.Compression.LZ4 1.2.10-beta
  • K4os.Compression.LZ4.Streams 1.2.10-beta
  • K4os.Hash.xxHash 1.0.6
  • MySql.Data 8.0.25
  • SSH.NET 2020.0.1
  • System.Buffers 4.5.1
  • System.Memory 4.5.4
  • System.Numerics.Vectors 4.5.0
  • System.Runtime.CompilerServices.Unsafe 6.0.0-preview.3.21201.4
src/GCModeller/data/KEGG/packages.config nuget
  • BouncyCastle 1.8.9
  • EntityFramework 6.4.4
  • Google.Protobuf 3.17.0
  • K4os.Compression.LZ4 1.2.10-beta
  • K4os.Compression.LZ4.Streams 1.2.10-beta
  • K4os.Hash.xxHash 1.0.6
  • MySql.Data 8.0.25
  • MySql.Data.Entities 6.8.3.0
  • SSH.NET 2020.0.2
  • System.Buffers 4.5.1
  • System.Memory 4.5.4
  • System.Numerics.Vectors 4.5.0
  • System.Runtime.CompilerServices.Unsafe 6.0.0-preview.3.21201.4
src/runtime/httpd/src/HTTP_SERVER/demo/DemoWebApp/packages.config nuget
  • System.ValueTuple 4.3.0
  • VB.httpd 2.0.11-alpha
  • sciBASIC 2.3.0-beta
src/runtime/sciBASIC#/Data/test/packages.config nuget
  • System.ValueTuple 4.0.0-rc3-24212-01
src/runtime/sciBASIC#/Data_science/MachineLearning/MachineLearning/QLearning/Maze-Q-Learning/packages.config nuget
  • System.ValueTuple 4.4.0
src/runtime/sciBASIC#/Data_science/Mathematica/Math/test/packages.config nuget
  • System.ValueTuple 4.4.0-preview1-25305-02
src/runtime/sciBASIC#/Data_science/Visualization/test/ChartingBase.Test/packages.config nuget
  • System.ValueTuple 4.3.0
src/runtime/sciBASIC#/Data_science/algorithms/CMeans/DEMO/packages.config nuget
  • System.ValueTuple 4.3.0
src/runtime/sciBASIC#/docs/guides/parameter_expression/Demo/packages.config nuget
  • System.ValueTuple 4.3.0
src/runtime/sciBASIC#/gr/network-visualization/network_layout/layout/layout.csproj nuget
src/runtime/sciBASIC#/vs_solutions/dev/LicenseMgr/LicenseMgr/packages.config nuget
  • ModernUI.WPF 1.0.9