studentlife

studentlife: Tidy Handling and Navigation of a Valuable Mobile-Health Dataset - Published in JOSS (2019)

https://github.com/frycast/studentlife

Science Score: 93.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 4 DOI reference(s) in README and JOSS metadata
✓
Academic publication links
Links to: zenodo.org
○
Committers with academic emails
○
Institutional organization owner
✓
JOSS paper metadata
Published in Journal of Open Source Software

Scientific Fields

Artificial Intelligence and Machine Learning Computer Science - 40% confidence

Last synced: 6 months ago · JSON representation

Repository

Tidy handling and navigation of the valuable Student-Life mHealth dataset

Basic Info

Host: GitHub
Owner: frycast
License: gpl-3.0
Language: R
Default Branch: master
Homepage: http://studentlife.cs.dartmouth.edu/
Size: 1 MB

Statistics

Stars: 19
Watchers: 1
Forks: 9
Open Issues: 2
Releases: 1

Created almost 7 years ago · Last pushed almost 5 years ago

Metadata Files

Readme Changelog License

studentlife: Tidy Handling and Navigation of a Valuable Mobile-Health Dataset

$CRAN\_Release\_Badge$

This package is available on CRAN.

Use this R package to download, navigate and analyse the Student-Life dataset. The Student-Life dataset contains passive and automatic sensing data from the phones of a class of 48 de-identified Dartmouth college students. It was collected over a 10 week term. Additionally, the dataset contains Ecological Momentary Assessment results along with pre- and post-study mental health surveys, such as the PHQ-9. The intended use is to allow researchers and other interested people to assess mental health, academic performance and behavioral trends. The raw dataset and additional information is available at https://studentlife.cs.dartmouth.edu/.

Installation

``` r

Uncomment to install the package from CRAN

install.packages("studentlife")

Or, uncomment to install the package from GitHub

install.packages("devtools")

devtools::install_github("frycast/studentlife")

```

This studentlife repository includes a small sample dataset that can be used for practice and testing. You can download the sample data within R: library(studentlife) d <- tempdir() download_studentlife(location = d, url = "testdata")

This README will use the studentlife data in its original format. Details on the full original dataset are available here. However, an RData version of the dataset has been hosted on Zenodo as of 09/11/2019 here. You can download and use this RData version, which is faster to read and takes up less space. However, load_SL_tibble is not (yet) designed to work with the RData format (but all the transformation and exploration functions, such as regularise_time, add_block_labels, and vis_NAs still work fine, because they work with any SL_tbl object). If you're working with the RData format, just replace instances of load_SL_tibble in this README with equivalent instances of the following:

```r d <- tempdir() download_studentlife(location = d, url = "rdata")

Choose the schema and table from the list SL_tables:

SL_tables

Example with activity table from sensing schema

schema <- "sensing" table <- "activity" act <- readRDS(paste0(d, "/dataset_rds/", schema, "/", table, ".Rds")) act ```

In this README, we will use the full dataset, so the download size is 5GB.

r download_studentlife(location = d)

Use the interactive menu to browse the tables and schemas of the downloaded dataset:

r tab <- studentlife::load_SL_tibble(location = d)

The object returned by load_SL_tibble, or by loading any table from the RData format, is called a 'StudentLife tibble' (or SL_tbl).

If you're using the load_SL_tibble interactive menu, then restrictions can be placed on the menu options by changing the time_options parameter:

r tab_t <- load_SL_tibble(location = d, time_options = "timestamp", csv_nrows = 10) tab_p <- load_SL_tibble(location = d, time_options = "interval" , csv_nrows = 10) tab_d <- load_SL_tibble(location = d, time_options = "dateonly" , csv_nrows = 10) tab_s <- load_SL_tibble(location = d, time_options = "dateless" , csv_nrows = 10)

The regularise_time function can be used to summarise information within blocks of time, producing an object called a 'regularised StudentLife tibble' (or reg_SL_tbl):

``` r tab <- loadSLtibble( loc = d, schema = "sensing", table = "activity", csv_nrows = 10)

regularisetime( tab, blocks = c("day","weekday"), actinf = max(activityinference), addNAs = FALSE) ```

If you just want to add date, epoch, or other labels, without aggregating, you can use add_block_labels:

``` r tab <- loadSLtibble( loc = d, schema = "sensing", table = "activity", csv_nrows = 10)

btab <- addblocklabels(tab) btab ```

Produce a histogram showing PAM EMA response frequencies over the course of the study:

r tab_PAM <- load_SL_tibble(schema = "EMA", table = "PAM", location = d) response_hour_hist(tab_PAM, break_hours = 10)

A summary will produce details such as EMA question, UIDs of dropped students, schema name, table name, and summary statistics.

r summary(tab_PAM)

``` EMA_questions Refer to: Pollak, J. P., Adams, P., & Gay, G. (2011, May). PAM: a photographic affect meter for frequent, in situ measurement of affect. In Proceedings of the SIGCHI conference on Human factors in computing systems (pp. 725-734). ACM.

dropped_students None

columnnames pictureidx timestamp uid

schema EMA

table PAM

Skim summary statistics n obs: 9040 n variables: 3

-- Variable type:factor -------------------------------------------------------- variable missing complete n nunique topcounts ordered uid 0 9040 9040 49 59: 437, 0: 390, 19: 384, 57: 377 FALSE

-- Variable type:numeric ------------------------------------------------------- variable missing complete n mean sd p0 p25 p50 p75 p100 picture_idx 0 9040 9040 8.85 4.17 1 6 8 12 16
timestamp 0 9040 9040 1.4e+09 1608614.45 1.4e+09 1.4e+09 1.4e+09 1.4e+09 1.4e+09 hist ▂▃▆▇▃▆▅▅ ▇▇▆▅▃▁▁▁ ```

After regularising a StudentLife tibble, we can visualise the missing values in each block for each student:

r reg_PAM <- regularise_time(tab_PAM, blocks = c("day", "epoch"), m = mean(picture_idx, na.rm = TRUE)) vis_NAs(reg_PAM, response = "m")

We can also visualise and compare the total number of responses received from each student over the course of the study:

r vis_response_counts(reg_PAM, response = "m")

Summaries of survey data are formatted with question information and some answer statistics:

r tab_PHQ9 <- load_SL_tibble(loc = d, schema = "survey", table = "PHQ-9") summary(tab_PHQ9)

``` time_info none

survey_questions Q1: Little interest or pleasure in doing things Q2: Feeling down, depressed, hopeless. Q3: Trouble falling or staying asleep, or sleeping too much. Q4: Feeling tired or having little energy Q5: Poor appetite or overeating Q6: Feeling bad about yourself or that you are a failure or have let yourself or your family down Q7: Trouble concentrating on things, such as reading the newspaper or watching television Q8: Moving or speaking so slowly that other people could have noticed. Or the opposite being so figety or restless that you have been moving around a lot more than usual Q9: Thoughts that you would be better off dead, or of hurting yourself Q10: Response

column_names uid type Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10

schema survey

table PHQ-9

skim Skim summary statistics n obs: 84 n variables: 12

-- Variable type:factor -------------------------------------------------------- variable missing complete n nunique topcounts ordered Q1 0 84 84 4 Not: 42, Sev: 28, Mor: 10, Nea: 4 FALSE Q10 5 79 84 4 Som: 37, Not: 36, NA: 5, Ext: 3 FALSE Q2 0 84 84 4 Not: 40, Sev: 34, Mor: 6, Nea: 4 FALSE Q3 0 84 84 4 Not: 41, Sev: 25, Mor: 10, Nea: 8 FALSE Q4 0 84 84 4 Sev: 38, Not: 23, Mor: 17, Nea: 6 FALSE Q5 0 84 84 4 Sev: 35, Not: 34, Mor: 9, Nea: 6 FALSE Q6 0 84 84 4 Not: 45, Sev: 27, Mor: 6, Nea: 6 FALSE Q7 0 84 84 4 Not: 48, Sev: 26, Mor: 7, Nea: 3 FALSE Q8 0 84 84 4 Not: 64, Sev: 15, Mor: 4, Nea: 1 FALSE Q9 0 84 84 3 Not: 75, Sev: 6, Mor: 3, NA: 0 FALSE type 0 84 84 2 pre: 46, pos: 38, NA: 0 FALSE uid 0 84 84 46 0: 2, 1: 2, 2: 2, 3: 2 FALSE ```

Software Testing

This studentlife repository includes many automated software tests implemented via testthat. We use these to check for bugs before releasing new updates. They can be found under the directory tests. Also, we use Travis-CI for continuous integration.

Community Guidelines

Please give your feedback and report bugs at the issues page.
Contributions are welcome! The best way to contribute is to fork the project on GitHub, make a contribution to your fork, and then submit a pull request. A useful guide can be found here.
If you have questions or need support please email Daniel Fryer via the email on his GitHub profile.

Owner

Name: Daniel Vidali Fryer
Login: frycast
Kind: user
Location: Melbourne
Company: The University of Queensland and La Trobe University

Website: www.danielvfryer.com
Repositories: 2
Profile: https://github.com/frycast

JOSS Publication

studentlife: Tidy Handling and Navigation of a Valuable Mobile-Health Dataset

Published

August 21, 2019

DOI

10.21105/joss.01587

Volume 4, Issue 40, Page 1587

Authors

Daniel Fryer

Department of Mathematics and Statistics, La Trobe University, Bundoora 3086, Victoria Australia, School of Mathematics and Physics, University of Queensland, St. Lucia 4072, Queensland Australia

Hien Nguyen

Department of Mathematics and Statistics, La Trobe University, Bundoora 3086, Victoria Australia

Pierre Orban
Department of Psychiatry, University of Montreal, Montreal H3C 3J7, Quebec Canada

Editor

Ariel Rokem

GitHub Events

Total

Watch event: 3
Fork event: 2

Last Year

Watch event: 3
Fork event: 2

Committers

Last synced: 7 months ago

All Time

Total Commits: 203
Total Committers: 3
Avg Commits per committer: 67.667
Development Distribution Score (DDS): 0.034

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Daniel Vidali Fryer	2****t	196
Daniel Vidali Fryer	2****t	5
Kyle Niemeyer	k**r@g**m	2

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 10
Total pull requests: 2
Average time to close issues: 10 days
Average time to close pull requests: 25 minutes
Total issue authors: 4
Total pull request authors: 1
Average comments per issue: 0.2
Average comments per pull request: 1.0
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

hiendn (6)
jminnier (2)
gabriela0099 (1)
frycast (1)

Pull Request Authors

kyleniemeyer (2)

Top Labels

Issue Labels

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- cran 229 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 2
Total maintainers: 1

cran.r-project.org: studentlife

Tidy Handling and Navigation of the Student-Life Dataset

Homepage: https://github.com/Frycast/studentlife
Documentation: http://cran.r-project.org/web/packages/studentlife/studentlife.pdf
License: GPL-3
Latest release: 1.1.0
published over 5 years ago

Versions: 2
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 229 Last month

Rankings

Forks count: 10.1%

Stargazers count: 17.9%

Dependent packages count: 29.8%

Average: 31.9%

Dependent repos count: 35.5%

Downloads: 66.2%

Maintainers (1)

d.fryer@latrobe.edu.au

Last synced: 6 months ago

Dependencies

DESCRIPTION cran

R >= 3.4.0 depends
R.utils >= 2.8.0 imports
crayon >= 1.3.4 imports
dplyr >= 0.8.0.1 imports
ggplot2 >= 3.1.1 imports
jsonlite >= 1.6 imports
purrr >= 0.3.2 imports
readr >= 1.3.1 imports
skimr >= 1.0.7 imports
tibble >= 2.0.1 imports
tidyr >= 0.8.3 imports
visdat >= 0.5.3 imports
testthat * suggests

studentlife

Science Score: 93.0%

Scientific Fields

Repository

Basic Info

Statistics

Metadata Files

README.md

studentlife: Tidy Handling and Navigation of a Valuable Mobile-Health Dataset

Installation

Uncomment to install the package from CRAN

install.packages("studentlife")

Or, uncomment to install the package from GitHub

install.packages("devtools")

devtools::install_github("frycast/studentlife")

Choose the schema and table from the list SL_tables:

Example with activity table from sensing schema

Software Testing

Community Guidelines

Owner

JOSS Publication

studentlife: Tidy Handling and Navigation of a Valuable Mobile-Health Dataset

Authors

Editor

Tags

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

cran.r-project.org: studentlife

Rankings

Maintainers (1)

Dependencies