Science Score: 36.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
5 of 156 committers (3.2%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.4%) to scientific vocabulary
Keywords from Contributors
Repository
R's data.table package extends data.frame:
Basic Info
- Host: GitHub
- Owner: Rdatatable
- License: mpl-2.0
- Language: R
- Default Branch: master
- Homepage: http://r-datatable.com
- Size: 58.1 MB
Statistics
- Stars: 3,769
- Watchers: 172
- Forks: 1,015
- Open Issues: 972
- Releases: 0
Metadata Files
README.md
data.table 
data.table provides a high-performance version of base R's data.frame with syntax and feature enhancements for ease of use, convenience and programming speed.
The data.table project uses a custom governance agreement
and is fiscally sponsored by NumFOCUS. Consider making
a tax-deductible donation to help the project
pay for developer time, professional services, travel, workshops, and a variety of other needs.
Why data.table?
- concise syntax: fast to type, fast to read
- fast speed
- memory efficient
- careful API lifecycle management
- community
- feature rich
Features
- fast and friendly delimited file reader:
?fread, see also convenience features for small data - fast and feature rich delimited file writer:
?fwrite - low-level parallelism: many common operations are internally parallelized to use multiple CPU threads
- fast and scalable aggregations; e.g. 100GB in RAM (see benchmarks on up to two billion rows)
- fast and feature rich joins: ordered joins (e.g. rolling forwards, backwards, nearest and limited staleness), overlapping range joins (similar to
IRanges::findOverlaps), non-equi joins (i.e. joins using operators>, >=, <, <=), aggregate on join (by=.EACHI), update on join - fast add/update/delete columns by reference by group using no copies at all
- fast and feature rich reshaping data:
?dcast(pivot/wider/spread) and?melt(unpivot/longer/gather) - any R function from any R package can be used in queries not just the subset of functions made available by a database backend, also columns of type
listare supported - has no dependencies at all other than base R itself, for simpler production/maintenance
- the R dependency is as old as possible for as long as possible, dated April 2014, and we continuously test against that version; e.g. v1.11.0 released on 5 May 2018 bumped the dependency up from 5 year old R 3.0.0 to 4 year old R 3.1.0
Installation
```r install.packages("data.table")
latest development version (only if newer available)
data.table::updatedevpkg()
latest development version (force install)
install.packages("data.table", repos="https://rdatatable.gitlab.io/data.table") ```
See the Installation wiki for more details.
Usage
Use data.table subset [ operator the same way you would use data.frame one, but...
- no need to prefix each column with
DT$(likesubset()andwith()but built-in) - any R expression using any package is allowed in
jargument, not just list of columns - extra argument
byto computejexpression by group
```r library(data.table) DT = as.data.table(iris)
FROM[WHERE, SELECT, GROUP BY]
DT [i, j, by]
DT[Petal.Width > 1.0, mean(Petal.Length), by = Species]
Species V1
1: versicolor 4.362791
2: virginica 5.552000
```
Getting started
- Introduction to data.table vignette
- Getting started wiki page
- Examples produced by
example(data.table)
Cheatsheets
Community
data.table is widely used by the R community. It is being directly used by hundreds of CRAN and Bioconductor packages, and indirectly by thousands. It is one of the top most starred R packages on GitHub, and was highly rated by the Depsy project. If you need help, the data.table community is active on StackOverflow.
A list of packages that significantly support, extend, or make use of data.table can be found in the Seal of Approval document.
Stay up-to-date
- click the Watch button at the top and right of GitHub project page
- read NEWS file
- follow #rdatatable and the rdatatable account on X/Twitter
- follow #rdatatable and the rdatatable account on fosstodon
- follow the data.table community page on LinkedIn
- watch recent Presentations
- read recent Articles
- read posts on The Raft
Contributing
Guidelines for filing issues / pull requests: Contribution Guidelines.
Owner
- Name: Rdatatable
- Login: Rdatatable
- Kind: organization
- Repositories: 3
- Profile: https://github.com/Rdatatable
Committers
Last synced: 8 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Matt Dowle | m****e@g****m | 2,292 |
| arunsrinivasan | a****b@g****m | 986 |
| Michael Chirico | c****m@g****m | 699 |
| Jan Gorecki | j****i | 424 |
| Benjamin Schwendinger | 5****n | 109 |
| Toby Dylan Hocking | t****5@g****m | 71 |
| Pasha Stetsenko | p****a@h****i | 70 |
| Ani | b****6@g****m | 59 |
| aitap | k****t@g****m | 57 |
| Michael Chirico | m****o@g****m | 48 |
| badasahog | 5****g | 39 |
| Tom Short | t****t@e****m | 38 |
| Steve Lianoglou | s****u@g****m | 37 |
| Eduard Antonyan | e****n@g****m | 32 |
| Xianying Tan | s****n@1****m | 30 |
| nitish jha | n****a@n****l | 29 |
| Joshua Wu | j****4@g****m | 29 |
| Nitish Jha | 1****2 | 25 |
| HughParsonage | h****e@g****m | 20 |
| venom1204 | v****4@g****m | 16 |
| Cole Miller | 5****1 | 13 |
| MarkusBonsch | m****h@p****e | 13 |
| Scott Ritchie | s****3@g****m | 11 |
| Václav Tlapák | 5****k | 10 |
| Mukul | 1****4 | 10 |
| Kelly N. Bodwin | k****y@b****s | 9 |
| Rafael Fontenelle | r****e | 8 |
| Tyson Barrett | t****8@g****m | 8 |
| Philippe Chataignon | p****n | 7 |
| Rick Saporta | R****a@g****m | 7 |
| and 126 more... | ||
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 1,241
- Total pull requests: 1,776
- Average time to close issues: over 2 years
- Average time to close pull requests: 4 months
- Total issue authors: 427
- Total pull request authors: 91
- Average comments per issue: 4.22
- Average comments per pull request: 3.34
- Merged pull requests: 1,191
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 272
- Pull requests: 799
- Average time to close issues: 15 days
- Average time to close pull requests: 8 days
- Issue authors: 89
- Pull request authors: 46
- Average comments per issue: 1.67
- Average comments per pull request: 2.72
- Merged pull requests: 526
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- MichaelChirico (230)
- jangorecki (94)
- tdhock (79)
- arunsrinivasan (56)
- mattdowle (41)
- ben-schwen (21)
- TysonStanley (19)
- badasahog (17)
- renkun-ken (16)
- aitap (15)
- franknarf1 (14)
- iago-pssjd (11)
- OfekShilon (10)
- iagogv3 (10)
- shrektan (9)
Pull Request Authors
- MichaelChirico (696)
- ben-schwen (141)
- jangorecki (136)
- aitap (109)
- badasahog (95)
- venom1204 (77)
- tdhock (66)
- Nj221102 (59)
- joshhwuu (47)
- Anirban166 (33)
- Mukulyadav2004 (30)
- TysonStanley (19)
- DorisAmoakohene (14)
- rffontenelle (13)
- KyleHaynes (11)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 2
-
Total downloads:
- cran 842,742 last-month
- Total docker downloads: 124,987,508
-
Total dependent packages: 1,850
(may contain duplicates) -
Total dependent repositories: 8,346
(may contain duplicates) - Total versions: 89
- Total maintainers: 1
cran.r-project.org: data.table
Extension of 'data.frame'
- Homepage: https://r-datatable.com
- Documentation: http://cran.r-project.org/web/packages/data.table/data.table.pdf
- License: MPL-2.0 | file LICENSE
-
Latest release: 1.17.8
published 8 months ago
Rankings
Maintainers (1)
conda-forge.org: r-data.table
- Homepage: https://r-datatable.com
- License: MPL-2.0
-
Latest release: 1.14.6
published over 3 years ago
Rankings
Dependencies
- R >= 3.1.0 depends
- methods * imports
- R.utils * suggests
- bit >= 4.0.4 suggests
- bit64 >= 4.0.0 suggests
- curl * suggests
- knitr * suggests
- markdown * suggests
- nanotime * suggests
- rmarkdown * suggests
- xts * suggests
- yaml * suggests
- zoo >= 1.8 suggests
- actions/cache v2 composite
- actions/checkout v2 composite
- actions/upload-artifact main composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v1 composite
- actions/cache v2 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v1 composite
