DataFrames

In-memory tabular data in Julia

https://github.com/juliadata/dataframes.jl

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 5 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    16 of 234 committers (6.8%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.5%) to scientific vocabulary

Keywords

data data-frame dataframes datasets hacktoberfest julia tabular-data

Keywords from Contributors

data-structures graphs juliagraphs datastructures graph-analytics graph-algorithms graph-theory graphics sciml differential-equations
Last synced: 4 months ago · JSON representation ·

Repository

In-memory tabular data in Julia

Basic Info
Statistics
  • Stars: 1,782
  • Watchers: 44
  • Forks: 375
  • Open Issues: 159
  • Releases: 82
Topics
data data-frame dataframes datasets hacktoberfest julia tabular-data
Created over 13 years ago · Last pushed 5 months ago
Metadata Files
Readme Changelog Contributing License Citation

README.md

DataFrames.jl

codecov CI Testing DOI

Tools for working with tabular data in Julia.

Installation: at the Julia REPL, using Pkg; Pkg.add("DataFrames")

Documentation:

Reporting Issues and Contributing: See CONTRIBUTING.md

ColPrac: Contributor's Guide on Collaborative Practices for Community Packages

Maintenance: DataFrames is maintained collectively by the JuliaData collaborators. Responsiveness to pull requests and issues can vary, depending on the availability of key collaborators.

Learning: New to DataFrames.jl? Check out our free Julia Academy course which will walk you through how to use DataFrames.jl. You can also check out Bogumił Kamiński's DataFrames.jl tutorial that is available on GitHub.

Citing: We encourage you to cite our work if you have used DataFrames.jl package. Starring the DataFrames.jl repository on GitHub is also appreciated.

The citation information may be found in the CITATION.bib file within the repository:

Bouchet-Valat, M., & Kamiński, B. (2023). DataFrames.jl: Flexible and Fast Tabular Data in Julia. Journal of Statistical Software, 107(4), 1–32. https://doi.org/10.18637/jss.v107.i04

Owner

  • Name: JuliaData
  • Login: JuliaData
  • Kind: organization

Data manipulation, storage, and I/O in Julia

Citation (CITATION.bib)

@article{JSSv107i04,
 title={DataFrames.jl: Flexible and Fast Tabular Data in Julia},
 volume={107},
 url={https://www.jstatsoft.org/index.php/jss/article/view/v107i04},
 doi={10.18637/jss.v107.i04},
 abstract={DataFrames.jl is a package written for and in the Julia language offering flexible and efficient handling of tabular data sets in memory. Thanks to Julia’s unique strengths, it provides an appealing set of features: Rich support for standard data processing tasks and excellent flexibility and efficiency for more advanced and non-standard operations. We present the fundamental design of the package and how it compares with implementations of data frames in other languages, its main features, performance, and possible extensions. We conclude with a practical illustration of typical data processing operations.},
 number={4},
 journal={Journal of Statistical Software},
 author={Bouchet-Valat, Milan and Kamiński, Bogumił},
 year={2023},
 pages={1--32}
}

GitHub Events

Total
  • Create event: 9
  • Commit comment event: 1
  • Release event: 1
  • Issues event: 28
  • Watch event: 67
  • Delete event: 2
  • Issue comment event: 119
  • Push event: 31
  • Pull request review comment event: 16
  • Pull request review event: 26
  • Pull request event: 31
  • Fork event: 11
Last Year
  • Create event: 9
  • Commit comment event: 1
  • Release event: 1
  • Issues event: 28
  • Watch event: 67
  • Delete event: 2
  • Issue comment event: 119
  • Push event: 31
  • Pull request review comment event: 16
  • Pull request review event: 26
  • Pull request event: 31
  • Fork event: 11

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 2,384
  • Total Committers: 234
  • Avg Commits per committer: 10.188
  • Development Distribution Score (DDS): 0.745
Past Year
  • Commits: 27
  • Committers: 14
  • Avg Commits per committer: 1.929
  • Development Distribution Score (DDS): 0.704
Top Committers
Name Email Commits
Bogumił Kamiński b****s@s****l 609
John Myles White j****w@j****m 317
Tom Short t****t@i****g 203
Milan Bouchet-Valat n****n@c****r 200
Sean Garborg s****g@g****m 111
Simon Kornblith s****n@s****m 67
quinnj q****d@g****m 64
Harlan Harris h****s@k****m 43
Alexey Stukalov a****v@g****m 42
Cameron Prybol c****l@g****m 37
Douglas Bates d****s@g****m 37
Chris DuBois c****s@g****m 28
Kevin Squire k****e@g****m 26
pdeffebach 2****h 20
Alex Arslan a****n@c****t 19
Harlan Harris h****n@h****e 18
David Anthoff a****f@b****u 16
Alex Mellnik a****k@g****m 13
Dave Kleinschmidt d****t@b****u 12
Takafumi Arakaki a****f@g****m 11
tan t****m@g****m 11
Ronan Arraes Jardim Chagas r****r@g****m 10
Peter 4****h 10
Tom Short t****s@g****m 10
milktrader m****r@g****m 9
Andreas Noack a****n@g****m 9
Johan Gustafsson j****n@c****m 8
timema t****l@g****m 8
Viral B. Shah V****h 7
Lyndon White o****x@u****u 7
and 204 more...

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 169
  • Total pull requests: 145
  • Average time to close issues: 4 months
  • Average time to close pull requests: 27 days
  • Total issue authors: 101
  • Total pull request authors: 40
  • Average comments per issue: 5.19
  • Average comments per pull request: 3.31
  • Merged pull requests: 109
  • Bot issues: 0
  • Bot pull requests: 15
Past Year
  • Issues: 28
  • Pull requests: 34
  • Average time to close issues: about 18 hours
  • Average time to close pull requests: 6 days
  • Issue authors: 22
  • Pull request authors: 12
  • Average comments per issue: 0.57
  • Average comments per pull request: 1.38
  • Merged pull requests: 23
  • Bot issues: 0
  • Bot pull requests: 7
Top Authors
Issue Authors
  • bkamins (26)
  • jariji (13)
  • alex-s-gardner (5)
  • adienes (5)
  • pdeffebach (4)
  • stensmo (4)
  • George9000 (4)
  • ctarn (3)
  • eoteroe (3)
  • nalimilan (2)
  • mzy2240 (2)
  • MagicMuscleMan (2)
  • kescobo (2)
  • schlichtanders (2)
  • robsmith11 (2)
Pull Request Authors
  • bkamins (60)
  • nathanrboyer (12)
  • dependabot[bot] (10)
  • ViralBShah (6)
  • github-actions[bot] (5)
  • tdhock (4)
  • hyrodium (4)
  • LilithHafner (3)
  • leei (3)
  • likanzhan (2)
  • rOsemium (2)
  • agdestein (2)
  • drizk1 (2)
  • versionbaygt (2)
  • ronisbr (2)
Top Labels
Issue Labels
feature (36) decision (19) question (17) display (16) bug (12) doc (9) non-breaking (8) ecosystem (7) performance (6) joins (5) multithreading (3) grouping (2) reshaping (2) breaking (2) CI (1) help wanted (1) priority (1)
Pull Request Labels
doc (27) ecosystem (22) feature (21) bug (13) CI (6) non-breaking (4) display (3) decision (2) multithreading (1) grouping (1) reshaping (1)

Packages

  • Total packages: 3
  • Total downloads:
    • julia 19,644 total
  • Total dependent packages: 975
    (may contain duplicates)
  • Total dependent repositories: 604
    (may contain duplicates)
  • Total versions: 359
juliahub.com: DataFrames

In-memory tabular data in Julia

  • Versions: 67
  • Dependent Packages: 975
  • Dependent Repositories: 604
  • Downloads: 19,644 Total
Rankings
Dependent packages count: 0.0%
Dependent repos count: 0.0%
Forks count: 0.1%
Average: 0.1%
Stargazers count: 0.2%
Last synced: 4 months ago
proxy.golang.org: github.com/juliadata/dataframes.jl
  • Versions: 146
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 6.9%
Last synced: 4 months ago
proxy.golang.org: github.com/JuliaData/DataFrames.jl
  • Versions: 146
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 6.9%
Last synced: 4 months ago

Dependencies

.github/workflows/CompatHelper.yml actions
  • julia-actions/setup-julia latest composite
.github/workflows/TagBot.yml actions
  • JuliaRegistries/TagBot v1 composite
.github/workflows/ci.yml actions
  • actions/checkout v2 composite
  • codecov/codecov-action v1 composite
  • julia-actions/cache v1 composite
  • julia-actions/julia-buildpkg v1 composite
  • julia-actions/julia-buildpkg latest composite
  • julia-actions/julia-docdeploy latest composite
  • julia-actions/julia-processcoverage v1 composite
  • julia-actions/julia-runtest v1 composite
  • julia-actions/setup-julia v1 composite