open-source-survey

The Open Source Survey

https://github.com/github/open-source-survey

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: arxiv.org, zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary

Keywords

open-source survey
Last synced: 4 months ago · JSON representation ·

Repository

The Open Source Survey

Basic Info
Statistics
  • Stars: 532
  • Watchers: 399
  • Forks: 77
  • Open Issues: 0
  • Releases: 2
Topics
open-source survey
Created over 9 years ago · Last pushed 11 months ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.md

The Open Source Survey

We've run one of the largest surveys of the open source community with open datasets for us all to use and learn from. Our latest survey conducted in 2024 updates the dataset and offers fresh insights into the open source ecosystem. We hope these datasets inform some of the most pressing questions about open source software, the people that create it, their experience, and their relationship to the industry that depends on it.

Learn more about the survey design and the topics we're studying.

Why is GitHub doing this?

At GitHub our goal is to help everyone build better software. We believe open source code, communities, and principles create better software. As an industry, we know a lot about how open source software is created but very little about the people who create and use it. Are they professional developers, students, or hobbyists?

To build better software, then we need a software community where anyone, regardless of what they look like or where they come from, can participate. This survey will help us see how we, as a community, are doing.

Open data

Open source is bigger than any company or community. The dataset is released under CC0-1.0 for anyone to use and learn from.

Contributors

GitHub Open Source Survey 2024

Thank you Kenyatta Forbes, Kevin Xu, Jeffrey Luszcz, Margaret Tucker, Eva Maxfield Brown, Peter Cihon, Mike Linksvayer, Ashley Wolf, Lukas Spieß, Kevin Crosby, Jason Meridth.

GitHub Open Source Survey 2017

This survey is primarily designed and implemented by GitHub:

  • @franniez - Data and social scientist at GitHub. New to open source but not to studying people or movements, she's done extensive survey research in Washington D.C, from inside the ivory tower, and within the technology sector.
  • @arfon - Program Manager for Open Source Data at GitHub. A lapsed academic with a passion for new models of scientific collaboration, he's used big telescopes to study dust in space, built sequencing technologies in Cambridge, and has engaged millions of people in online citizen science by co-founding the Zooniverse.
  • @mlinksva - Open Source Maven at GitHub. A lapsed engineer and non-lawyer with a passion for increasing the efficacy and scope of open production and policy, he is an advisor/director/volunteer for various open initiatives and was previously a manager and technologist at Creative Commons.

This isn't a solo effort for us, these awesome individuals and organizations have helped us design this survey:

Check out the contributing guidelines if you want to get involved.

License

The material in this repo is open data released under CC0-1.0. This means you need no copyright or database right (if any) permissions to make use of this data and survey questions. However:

  • Survey participants have not waived their privacy rights; read our Privacy Statement regarding Public Information on GitHub. In particular, do not attempt to reidentify survey participants.
  • If you use this dataset in a publication, a link to or citation of this repository would be appreciated.
  • If you extend this dataset, sharing your additions as open data would also be appreciated.
  • CC0-1.0 does not grant any trademark permissions. GitHub® and its stylized versions and the Invertocat mark are GitHub's Trademarks or registered Trademarks. When using GitHub's logos, be sure to follow the GitHub logo guidelines.

Citation info

GitHub Open Source Survey 2024

The data is additionally published on Zenodo, which provides a DOI as well as an easy way to generate citations in a number of formats. We suggest modifying autogenerated citations to reflect the original publication source, e.g as below.

@misc{GitHub_GitHub_Open_Source, author = {{GitHub, Inc.} and Forbes, Kenyatta and Xu, Kevin and Luszcz, Jeffrey and Tucker, Margaret and Brown, Eva Maxfield and Cihon, Peter and Linksvayer, Mike and Wolf, Ashley and Speiß, Lukas and Crosby, Kevin and Meridth, Jason}, title = {{GitHub Open Source Survey 2024}}, month = oct, year = 2024, doi = {10.5281/zenodo.13989018}, publisher = {GitHub, Inc.}, url = {https://github.com/github/open-source-survey} }

GitHub Open Source Survey 2017

The data is additionally published on Zenodo, which provides a DOI as well as an easy way to generate citations in a number of formats. We suggest modifying autogenerated citations to reflect the original publication source, e.g as below.

screen shot 2017-06-19 at 4 13 11 pm

@misc{GitHubOpenSourceSurvey2017, author = {Zlotnick, Frances}, title = {GitHub Open Source Survey 2017}, month = jun, year = 2017, doi = {10.5281/zenodo.806811}, publisher = {GitHub, Inc.}, howpublished = {\url{http://opensourcesurvey.org/2017/}} }

Citations and Reuse

  • R. Stuart Geiger Summary Analysis of the 2017 GitHub Open Source Survey "presenting frequency counts, proportions, and frequency or proportion bar plots for every question asked in the survey."
  • The LibreOffice Design Team asked users what aspects of open source are important, using questions from the Open Source Survey. Their summary includes a comparison with Open Source Survey responses, and their data is also released under CC0-1.0.

Acknowledgement

This survey was designed by GitHub with valuable input from the research and open source communities. We especially thank: Anna Filippova (Carnegie Mellon University), Andrea Forte (Drexel University), Edward Galvez (Wikimedia Foundation), Rebecca Weiss (Mozilla), and Laura Dabbish (Carnegie Mellon University) for conversations, research questions, and prior art that informed the questionnaire design; the Open Source Initiative for offsite sampling recruitment, the many members of the community who assisted with translations and suggestions for improving questions; and everyone who participated in the survey.

Owner

  • Name: GitHub
  • Login: github
  • Kind: organization
  • Location: San Francisco, CA

How people build software.

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: GitHub Open Source Survey 2024
message: >-
  If you use this dataset, please cite it using the metadata
  from this file.
type: dataset
authors:
  - name: 'GitHub, Inc.'
  - given-names: Kenyatta
    family-names: Forbes
  - given-names: Kevin
    family-names: Xu
  - given-names: Jeffrey
    family-names: Luszcz
  - given-names: Margaret
    family-names: Tucker
  - given-names: Eva Maxfield
    family-names: Brown
  - given-names: Peter
    family-names: Cihon
  - given-names: Mike
    family-names: Linksvayer
  - given-names: Ashley
    family-names: Wolf
  - given-names: Lukas
    family-names: Speiß
  - given-names: Kevin
    family-names: Crosby
  - given-names: Jason
    family-names: Meridth
identifiers:
  - type: doi
    value: 10.5281/zenodo.13989018
repository-code: 'https://github.com/github/open-source-survey'
url: 'http://opensourcesurvey.org/'
abstract: >-
  The Open Source Survey is an open data project by GitHub
  and collaborators from academia, industry, and the broader
  open source community.
keywords:
  - open data
  - open source
  - open source community
  - survey
license: CC0-1.0

GitHub Events

Total
  • Release event: 1
  • Watch event: 25
  • Delete event: 5
  • Issue comment event: 1
  • Push event: 10
  • Pull request review event: 4
  • Pull request event: 9
  • Fork event: 5
  • Create event: 6
Last Year
  • Release event: 1
  • Watch event: 25
  • Delete event: 5
  • Issue comment event: 1
  • Push event: 10
  • Pull request review event: 4
  • Pull request event: 9
  • Fork event: 5
  • Create event: 6

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 173
  • Total Committers: 26
  • Avg Commits per committer: 6.654
  • Development Distribution Score (DDS): 0.711
Past Year
  • Commits: 10
  • Committers: 2
  • Avg Commits per committer: 5.0
  • Development Distribution Score (DDS): 0.1
Top Committers
Name Email Commits
Arfon Smith a****n 50
Mike Linksvayer m****a@g****m 27
Brandon Keepers b****s@g****m 26
Frannie Zlotnick f****z@g****m 18
Caged j****n@l****m 11
Kevin Xu k****u@g****m 9
Mike McQuaid m****e@m****m 6
Sophie Shepherd s****p@g****m 5
nayafia n****a@g****m 2
Zafar z****v@h****m 2
Ben Balter b****r@g****m 2
Denise Yu d****u@g****m 1
Justin Palmer j****n@g****m 1
Alan Malloy a****n@m****g 1
Anna Filippova a****l 1
Aurélien Ooms a****s 1
Bogdan Vasilescu v****u@g****m 1
Daijiro Wachi d****i@g****m 1
JD Maturen j****n@g****m 1
Lee Reilly l****e@g****m 1
Matt Yoho m****o@g****m 1
Michael Warkentin m****n@g****m 1
Nick Coghlan n****n@g****m 1
Peter Dave Hello h****u@p****g 1
Zafar Khaydarov z****r@l****m 1
edmz e****z 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 44
  • Total pull requests: 60
  • Average time to close issues: 25 days
  • Average time to close pull requests: 3 days
  • Total issue authors: 30
  • Total pull request authors: 25
  • Average comments per issue: 1.77
  • Average comments per pull request: 0.97
  • Merged pull requests: 56
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: 4 days
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.25
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • arfon (11)
  • CoralineAda (4)
  • anjuan (2)
  • LOHAYBOY999 (2)
  • LemmingAvalanche (1)
  • Thorin-Oakenpants (1)
  • bkeepers (1)
  • azurelunatic (1)
  • michaeldorner (1)
  • copiesofcopies (1)
  • chrisma (1)
  • m-kuhn (1)
  • manfredbrandl (1)
  • franniez (1)
  • hhsadiq (1)
Pull Request Authors
  • bkeepers (12)
  • mlinksva (10)
  • franniez (9)
  • khxu (6)
  • MikeMcQuaid (4)
  • zafarella (2)
  • leereilly (2)
  • arfon (2)
  • PeterDaveHello (1)
  • annafil (1)
  • watilde (1)
  • jdmaturen (1)
  • amalloy (1)
  • ilyabrin (1)
  • sophshep (1)
Top Labels
Issue Labels
help wanted (7)
Pull Request Labels