https://github.com/benjamin-lee/deep-rules

Ten Quick Tips for Deep Learning in Biology

https://github.com/benjamin-lee/deep-rules

Science Score: 20.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: plos.org
  • Committers with academic emails
    16 of 56 committers (28.6%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary

Keywords

bioinformatics biology computational-biology data-science deep-learning genomics machine-learning manubot manuscript

Keywords from Contributors

hetnets networks dwpcs hetnet-connectivity-search pandoc annotation nextflow citations conda publishing
Last synced: 9 months ago · JSON representation

Repository

Ten Quick Tips for Deep Learning in Biology

Basic Info
Statistics
  • Stars: 226
  • Watchers: 26
  • Forks: 45
  • Open Issues: 65
  • Releases: 0
Topics
bioinformatics biology computational-biology data-science deep-learning genomics machine-learning manubot manuscript
Created over 7 years ago · Last pushed over 4 years ago
Metadata Files
Readme Contributing License Code of conduct

README.md

Ten Quick Tips for Deep Learning in Biology

HTML Manuscript PDF Manuscript Manubot Twitter URL

Manuscript description

This is the manuscript for a community-written article aimed at PLOS Computational Biology. Deep learning is exploding in popularity and is increasingly finding its way into biological data analysis. However, since most biologists receive little (or no) data science training, using deep learning properly can be a daunting task.

We're going to try to fix that. By boiling down the community's knowledge into ten quick tips, we hope to increase the number of biological researchers using DL (by making it more inviting) and the quality of the research (by helping them avoid common mistakes).

Join us!

Where we are now

Currently, we're submitting a revised version of the manuscript in response to reviewer feedback.

The raw text of the manuscript is stored within the content directory. The rendered manuscript may be viewed as HTML here or as a PDF here.

How you can help

Help of any kind is highly appreciated. Want to pitch in but not sure how?

For more information, see the contribution guidelines. All contributions are subject to the code of conduct.

A note on the project's name

Originally, this paper was going to be titled "Ten simple rules for deep learning in biology." However, the journal asked that we change the name to "n quick tips for deep learning in biology" to be placed into the educational section as well as to give us more latitude with the number of tips we have. While we changed the name of the paper, for simplicity, we kept the name of the repo as deep-rules.

Manubot

Manubot is a system for writing scholarly manuscripts via GitHub. Manubot automates citations and references, versions manuscripts using git, and enables collaborative writing via GitHub. An overview manuscript presents the benefits of collaborative writing with Manubot and its unique features. The rootstock repository is a general purpose template for creating new Manubot instances, as detailed in SETUP.md. See USAGE.md for documentation how to write a manuscript.

Please open an issue for questions related to Manubot usage, bug reports, or general inquiries.

Repository directories & files

The directories are as follows:

  • content contains the manuscript source, which includes markdown files as well as inputs for citations and references. See USAGE.md for more information.
  • output contains the outputs (generated files) from Manubot including the resulting manuscripts. You should not edit these files manually, because they will get overwritten.
  • webpage is a directory meant to be rendered as a static webpage for viewing the HTML manuscript.
  • build contains commands and tools for building the manuscript.
  • ci contains files necessary for deployment via continuous integration.

Local execution

The easiest way to run Manubot is to use continuous integration to rebuild the manuscript when the content changes. If you want to build a Manubot manuscript locally, install the conda environment as described in build. Then, you can build the manuscript on POSIX systems by running the following commands from this root directory.

```sh

Activate the manubot conda environment (assumes conda version >= 4.4)

conda activate manubot

Build the manuscript, saving outputs to the output directory

bash build/build.sh

At this point, the HTML & PDF outputs will have been created. The remaining

commands are for serving the webpage to view the HTML manuscript locally.

This is required to view local images in the HTML output.

Configure the webpage directory

manubot webpage

You can now open the manuscript webpage/index.html in a web browser.

Alternatively, open a local webserver at http://localhost:8000/ with the

following commands.

cd webpage python -m http.server ```

Sometimes it's helpful to monitor the content directory and automatically rebuild the manuscript when a change is detected. The following command, while running, will trigger both the build.sh script and manubot webpage command upon content changes:

sh bash build/autobuild.sh

Continuous Integration

Manubot

Whenever a pull request is opened, Travis CI will test whether the changes break the build process to generate a formatted manuscript. The build process aims to detect common errors, such as invalid citations. If your pull request build fails, see the Travis CI logs for the cause of failure and revise your pull request accordingly.

When a commit to the master branch occurs (for example, when a pull request is merged), Travis CI builds the manuscript and writes the results to the gh-pages and output branches. The gh-pages branch uses GitHub Pages to host the following URLs:

  • HTML manuscript at https://Benjamin-Lee.github.io/deep-rules/
  • PDF manuscript at https://Benjamin-Lee.github.io/deep-rules/manuscript.pdf

For continuous integration configuration details, see .travis.yml.

License

License: CC BY 4.0 License: CC0 1.0

Except when noted otherwise, the entirety of this repository is licensed under a CC BY 4.0 License (LICENSE.md), which allows reuse with attribution. Please attribute by linking to https://github.com/Benjamin-Lee/deep-rules.

Since CC BY is not ideal for code and data, certain repository components are also released under the CC0 1.0 public domain dedication (LICENSE-CC0.md). All files matched by the following glob patterns are dual licensed under CC BY 4.0 and CC0 1.0:

  • *.sh
  • *.py
  • *.yml / *.yaml
  • *.json
  • *.bib
  • *.tsv
  • .gitignore

All other files are only available under CC BY 4.0, including:

  • *.md
  • *.html
  • *.pdf
  • *.docx

Please open an issue for any question related to licensing.

Owner

  • Name: Benjamin Lee
  • Login: Benjamin-Lee
  • Kind: user
  • Location: Tysons, VA

NIH-OxCam scholar at @ncbi & DPhil student at Oxford. Previously @Harvard CS '20 and @IQTLabs

GitHub Events

Total
Last Year

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 1,034
  • Total Committers: 56
  • Avg Commits per committer: 18.464
  • Development Distribution Score (DDS): 0.772
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Benjamin Lee b****e@m****m 236
Daniel Himmelstein d****n@g****m 200
Benjamin Lee b****e@c****u 93
Anthony Gitter g****r@b****u 59
rasbt m****l@s****m 54
mdkessler 4****r 54
Anthony Gitter a****r 40
Alexander Titus A****s 32
Finlay Maguire f****e@g****m 32
Casey Greene c****e 30
Vincent Rubinetti v****i@g****m 24
ajlee21 a****1@g****m 19
Casey Greene g****t@g****m 13
chevrm c****m@g****m 11
evancofer e****r@g****m 10
khyu 3****u 9
Casey Greene c****e@u****u 8
pstew p****t@g****m 8
Thiago Britto Borges t****s@d****k 7
Juan J. Carmona c****a@m****u 7
Elana J Fertig e****g@j****u 7
alxndrkalinin 1****n 7
betsig b****l@g****m 6
Venkat Malladi v****i@g****m 6
Paul Stewart p****w 6
David Slochower s****r@g****m 5
Simina M. Boca s****0@g****u 4
Julian Stanley j****y@c****u 3
blengerich b****h@g****m 3
Tim Triche, Jr t****e@g****m 3
and 26 more...

Issues and Pull Requests

Last synced: over 2 years ago

All Time
  • Total issues: 36
  • Total pull requests: 153
  • Average time to close issues: 5 months
  • Average time to close pull requests: 6 days
  • Total issue authors: 6
  • Total pull request authors: 21
  • Average comments per issue: 9.19
  • Average comments per pull request: 1.16
  • Merged pull requests: 146
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Benjamin-Lee (12)
  • juancarmona (2)
  • agitter (2)
  • rasbt (1)
  • SiminaB (1)
  • AlexanderTitus (1)
Pull Request Authors
  • Benjamin-Lee (17)
  • rasbt (15)
  • agitter (10)
  • fmaguire (6)
  • mdkessler (5)
  • signalbash (3)
  • SiminaB (3)
  • ajlee21 (3)
  • ejfertig (2)
  • evancofer (2)
  • khyu (2)
  • pstew (2)
  • cgreene (2)
  • juancarmona (2)
  • alxndrkalinin (1)
Top Labels
Issue Labels
meta (6) paper (1)
Pull Request Labels

Dependencies

build/environment.yml pypi
  • errorhandler ==2.0.1
  • isbnlib ==3.10.3
  • opentimestamps ==0.4.1
  • opentimestamps-client ==0.7.0
  • pandoc-eqnos ==2.3.0
  • pandoc-fignos ==2.3.1
  • pandoc-tablenos ==2.2.2
  • pandoc-xnos ==2.4.3
  • pybase62 ==0.4.3
  • pysha3 ==1.0.2
  • python-bitcoinlib ==0.10.2
  • requests-cache ==0.5.2
  • weasyprint ==0.42.3