https://github.com/alan-turing-institute/data-training-for-bioscience

Introduction to Data Science Project Management for Project Leaders.

https://github.com/alan-turing-institute/data-training-for-bioscience

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Introduction to Data Science Project Management for Project Leaders.

Basic Info
  • Host: GitHub
  • Owner: alan-turing-institute
  • License: other
  • Default Branch: main
  • Homepage:
  • Size: 3.59 MB
Statistics
  • Stars: 10
  • Watchers: 8
  • Forks: 1
  • Open Issues: 31
  • Releases: 0
Created almost 5 years ago · Last pushed over 2 years ago
Metadata Files
Readme Contributing License Code of conduct

README.md

Turing-Crick Partnership Project

This project will facilitate the development of strategic partnerships and resources around skills and capacity building in data science in biomedical research.

Phase 2: Masterclasses in Data Science and AI for senior researcher

Introduction to Data Science and AI for senior researchers, group leaders, late PhD/Postdocs and mid to late-career biomedical scientists. Materials developed through this project will enable a foundational understanding of AI and data science in the context of biosciences. Furthermore, researchers will receive training for managing, supervising and facilitating open and reproducible research for the wider biology community. Funded by the AI for Science and Government Research programme, this project ran from October 2021 to March 2022.

This project is a follow-up of The Crick-Turing Biomedical Data Science Awards (BDSAs) (Phase 1 project period: 01/10/2019 – 28/02/2021) carried out under the Turing and Crick partnership.

Masterclasses developed in this programme

  • Introduction to Data Science and AI for senior researchers: https://carpentries-incubator.github.io/data-science-ai-senior-researchers/
  • Managing Open and Reproducible Computational Projects: https://carpentries-incubator.github.io/managing-computational-projects/

Researchers from outside this project were invited to review and enhance these materials by integrating real-world examples from their work. Additionally, professional illustrators (Scriberia) worked with researchers in this project to develop illustrations to be paired with the written contents.

All materials including the illustrations (see illustrations-from-review-sprint) are shared under CC-BY 4.0 License for reuse, remix, sharing and distribution with appropriate citation.

Project members

Proposal Lead - Dr. Malvika Sharan, Senior Researcher - Tools, Practices and Systems

Development Team

Reviewers and Editors

Contributors from the Turing Research Programmes

Contributors from The Francis Crick Institute

  • Prof. James Briscoe, Senior Group Leader - Assistant Research Director
  • Rebecca Wilson, Head of Strategic Partnerships
  • James Fleming, Chief Information Officer

Thanks to these researchers for sharing feedback and examples to include on the earlier drafts of our training materials!

  • Victor Tybulewicz: Group Leader, The Francis Crick Institute, Lab page
  • Radoslav Enchev: Group Leader, The Francis Crick Institute, Lab page
  • Francesca Ciccarelli: Group Leader, The Francis Crick Institute, Lab page
  • Florencia Iacaruso: Group Leader, The Francis Crick Institute, Lab page
  • Evangenline Corcoran: Postdoctoral Researcher, The Alan Turing Institute, Personal page
  • Jim Maas: Postdoctoral Scientist - Computational Biologist, John Innes Center, Personal page

More members from both the Turing and Crick represent this partnership, contribute to project meetings and help coordinate this project.

Please see the project proposal for details.

Please create an issue to share references or ideas related to the development of this project.

🎯 Roadmap

Please see the Project Charter for details.

Logistics before starting the project

  • [x] Draft a proposal collaboratively to define the scope of this project
  • [x] Set up the repository to develop this project openly
  • [x] Define the scope and stakeholders for the project (help develop a project charter)
  • [x] Identify potential contributors to this project at the Turing
  • [x] Identify potential contributors to this project at the Crick
  • [x] Define the common vision, mission and target audience
  • [x] Host meetings with all stakeholders to discuss
    • The initial plans, project charter and goals
    • Agree on the best way to collaborate and communicate
    • Monthly updates and feedback on the development (align expectations)
  • [x] Identify potential contributors from the wider research community
    • Other institutes, projects and people with a vested interest

Development Tasks

  • [x] Define curriculum by selecting topics for content development (build concept map)
  • [x] Select open source references for reuse (see issues for reference materials)
  • [x] Design training curriculum (concept map, data, reusable materials) using Carpentries Development Handbook
  • [x] Set up the Carpentries template for material development (see community lessons)
  • [x] Define episodes (modules) and adapt training materials for biological datasets <-- REG member
  • [x] Select biological datasets - potentially provided by the Crick through 1:1 interviews
  • [x] Seek feedback from all stakeholders and invite contributions <-- Review and illustration sprint
  • [x] Release the draft and invite the community to test the materials
  • [x] Deliver a pilot training

Main deliverables

Training materials for two masterclasses will be developed and shared from this project.

  1. Introduction to data science and AI for senior researchers: This masterclass will also touch on some concepts related to algorithm selection, statistical approaches and the potential additionality of Machine Learning and Deep Learning.
  2. Managing and supervising computational Projects: This masterclass will provide an understanding of open source tools, version control, literate programming, Markdown, GitHub, metadata and other collaborative approaches.

Inviting feedback from the mid-to late-career researchers from the Turing, the Crick and wider research communities, these masterclasses will build a shared understanding of good practice principles to facilitate the integration of reproducible computational approaches from data science into biological research.

The Carpentries Incubator

The training materials will be developed openly from the start under The Carpentries Incubator GitHub organisation.

These are two separate GitHub repositories for the two masterclasses: - Masterclass 1: Introduction to Data Science and AI for senior researchers: https://github.com/carpentries-incubator/data-science-ai-senior-researchers - Masterclass 2: Managing Open and Reproducible Computational Projects: https://github.com/carpentries-incubator/managing-computational-projects

Though developed under subtitles Masterclass 1 and 2, both the materials will be standalone and modular to encourage their use independently of each other.

Please create an issue to add any milestones or goals that are currently missing from the roadmap, or to suggest new features.

📫 Contact

This project is maintained by Malvika Sharan. For any organisation-related queries or concerns, you can directly reach out to her by emailing msharan@turing.ac.uk.

♻️ License

This work is licensed under the MIT license (code) and Creative Commons Attribution 4.0 International license (for documentation). You are free to share and adapt the material for any purpose, even commercially, as long as you provide attribution (give appropriate credit, provide a link to the license, and indicate if changes were made) in any reasonable manner, but not in any way that suggests the licensor endorses you or your use, and with no additional restrictions.

Owner

  • Name: The Alan Turing Institute
  • Login: alan-turing-institute
  • Kind: organization
  • Email: info@turing.ac.uk

The UK's national institute for data science and artificial intelligence.

GitHub Events

Total
  • Watch event: 2
Last Year
  • Watch event: 2

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 55
  • Total pull requests: 3
  • Average time to close issues: N/A
  • Average time to close pull requests: 22 days
  • Total issue authors: 3
  • Total pull request authors: 1
  • Average comments per issue: 1.6
  • Average comments per pull request: 0.67
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • malvikasharan (23)
  • LydiaFrance (7)
  • fedenanni (1)
Pull Request Authors
  • malvikasharan (2)
Top Labels
Issue Labels
Pull Request Labels