cthist

Clinical Trial Registry History

https://github.com/bgcarlisle/cthist

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: plos.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.0%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Clinical Trial Registry History

Basic Info
  • Host: GitHub
  • Owner: bgcarlisle
  • License: agpl-3.0
  • Language: R
  • Default Branch: main
  • Size: 195 KB
Statistics
  • Stars: 8
  • Watchers: 1
  • Forks: 2
  • Open Issues: 1
  • Releases: 18
Created over 4 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog License

README.md

cthist

This package provides functions for mass-downloading and interpreting historical clinical trial registry entry data.

How to install

To install the stable version of cthist through CRAN:

{r} install.packages("cthist") library(cthist)

If you want the most recent development version of cthist, you will need to install devtools first, and then install via git:

{r} install.packages("devtools") library(devtools) install_github("bgcarlisle/cthist") library(cthist)

Functions provided by cthist

This package provides 5 functions for downloading and interpreting historical clinical trial data from ClinicalTrials.gov

Download clinical trial version dates:

```{r}

Get all the dates and status updates when the registry entry for

NCT02110043 changed

clinicaltrialsgovdates(c("NCT02110043", "NCT03281616"))

A tibble: 10 × 5

nctid versionnumber totalversions versiondate overallstatus

1 NCT02110043 0 8 2014-04-08 RECRUITING

2 NCT02110043 1 8 2014-09-22 RECRUITING

3 NCT02110043 2 8 2014-10-13 RECRUITING

4 NCT02110043 3 8 2016-03-15 RECRUITING

5 NCT02110043 4 8 2016-12-20 RECRUITING

6 NCT02110043 5 8 2017-07-04 RECRUITING

7 NCT02110043 6 8 2017-07-26 ACTIVENOTRECRUITING

8 NCT02110043 7 8 2021-05-20 COMPLETED

9 NCT03281616 0 2 2017-09-11 COMPLETED

10 NCT03281616 1 2 2017-09-18 COMPLETED

Get all the dates when NCT02110043 had a change in overall status

clinicaltrialsgovdates("NCT02110043", statuschangeonly=TRUE)

A tibble: 3 × 5

nctid versionnumber totalversions versiondate overallstatus

1 NCT02110043 0 8 2014-04-08 RECRUITING

2 NCT02110043 6 8 2017-07-26 ACTIVENOTRECRUITING

3 NCT02110043 7 8 2021-05-20 COMPLETED

```

Download clinical trial registry entry version data:

```{r}

Get the 4th version of NCT02110043

versiondata <- clinicaltrialsgov_version("NCT02110043", 4)

Get the 2nd item (enrolment) for that version

version_data$enrol

[1] 22

Get the 3rd item (enrolment type) for that version

version_data$enroltype

[1] "ESTIMATED"

```

Mass-download clinical trial registry entry versions:

```{r}

Download all data for all versions of NCT02110043 and store in

variable versions

versions <- clinicaltrialsgovdownload("NCT02110043") ```

Mass-download clinical trial registry entry versions for many trials and save to disk:

```{r}

Download all data for all versions of NCT02110043 and NCT03281616

and save to versions.csv

clinicaltrialsgovdownload(c("NCT02110043", "NCT03281616"), "versions.csv") ```

Extract publications indexed on downloaded trial versions

The function clinicaltrials_gov_download downloads a data frame of versions of a trial's history, with the references column containing a nested JSON-encoded data frame of the publications that were indexed by ClinicalTrials.gov.

The function extract_publications interprets a data frame of the type returned by clinicaltrials_gov_download and returns a new data frame that contains only publications of the type specified ("RESULT", "BACKGROUND", or "DERIVED").

This function will provide one row for every publication of the type specified that was indexed on ClinicalTrials.gov for every version of the trial registry record contained on the data frame provided.

```{r}

Download only the latest clinical trial registry entries for the

specified NCT numbers and extract PMID's for indexed RESULT

publications

clinicaltrialsgovdownload( c("NCT05784103", "NCT05780281"), latest=TRUE ) %>% extract_publications(type="RESULT") %>% select(nctid, pmid)

A tibble: 2 × 2

nctid pmid

1 NCT05784103 28183823 2 NCT05780281 34928698 ```

Calculate overall status lengths

The function clinicaltrials_gov_download downloads a data frame of versions of a trial's history, with the overall_status column indicating the status of the trial on the date the entry is updated, which is specified in the version_date column.

The function overall_status_lengths interprets a data frame of the type returned by clinicaltrials_gov_download and returns a new data frame that contains a list of the NCT numbers and all the overall statuses that the trial in question passed through, and for how many days, optionally, within a specified timeframe.

```{r}

Download the clinical trial registry entries for the specified NCT

number(s) and calculate the number of days that each registry entry

spends in a reported overall status within a prescribed time

interval of interest (in this case, the years 2020-2022, inclusive)

clinicaltrialsgovdownload( c("NCT04338971", "NCT03461211") ) %>% overallstatuslengths( startdate = "2020-01-01", enddate = "2022-12-31" )

A tibble: 5 × 3

Groups: nctid [2]

nctid overallstatus days

1 NCT03461211 ACTIVE
NOTRECRUITING 406 days 2 NCT03461211 COMPLETED 668 days 3 NCT03461211 ENROLLINGBY_INVITATION 21 days 4 NCT04338971 COMPLETED 488 days 5 NCT04338971 WITHHELD 511 days ```

What data is extracted?

| Variable | Data type | |:----------------------------------|-------------------:| | Version number (0, 1, 2, etc.) | Double | | Version date (ISO-8601) | Date | | Overall status | Character | | Start date | Date | | Start date precision | Character | | Primary completion date | Date | | Primary completion date precision | Date | | Primary completion date type | Character | | Enrolment | Double | | Enrolment type | Character | | Inclusion and exclusion criteria | Character (HTML) | | Outcome measures | JSON-encoded table | | Overall contacts | JSON-encoded table | | Central contacts | JSON-encoded table | | Responsible party | JSON-encoded table | | Lead sponsor | JSON-encoded table | | Collaborators | JSON-encoded table | | Locations | JSON-encoded table | | "Why stopped?" | Character | | Results posted | Logical | | References | JSON-encoded table | | Organization study ID | Character | | Secondary IDs | JSON-encoded table |

Note regarding ClinicalTrials.gov July 2023 website re-write

For cthist v >= 2.0.0, the method for downloading has been updated to reflect the new version of ClinicalTrials.gov. Because the data on the updated website are presented differently from the way they were scraped from the old version, there will be some changes. E.g. the overall status field is now in all-caps.

DRKS.de functions deprecated

Update as of 2022-12-11

DRKS.de has recently been updated in a manner that makes scraping data more difficult and so the functions related to DRKS.de have been deprecated, at least temporarily while I assess the changes.

Note on use

Please note that this script is provided under AGPL v 3, and so you may use it for any purpose, however if you modify it, you must provide access to your modified version or you are in violation of the terms of the license.

Citing cthist

@Manual{bgcarlisle-cthist, title = {Analysis of Clinical Trial Registry Entry Histories Using the Novel {{R}} Package cthist}, author = {Carlisle, Benjamin Gregory}, date = {2022-07-01}, journaltitle = {PLOS ONE}, shortjournal = {PLOS ONE}, volume = {17}, number = {7}, pages = {e0270909}, publisher = {{Public Library of Science}}, issn = {1932-6203}, doi = {10.1371/journal.pone.0270909}, url = {https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0270909} }

Please open an issue in the issue tracker above if you find a bug, need this package to download some historical trial data that it currently does not capture, or if you would like to collaborate on a project that uses this tool.

If you used my package in your research and you found it useful, I would take it as a kindness if you cited it.

Best,

Benjamin Gregory Carlisle PhD

Owner

  • Name: The research fairy
  • Login: bgcarlisle
  • Kind: user

I'm the research fairy, here to make your academic problems disappear!

GitHub Events

Total
  • Push event: 1
  • Create event: 1
Last Year
  • Push event: 1
  • Create event: 1

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 184
  • Total Committers: 3
  • Avg Commits per committer: 61.333
  • Development Distribution Score (DDS): 0.054
Past Year
  • Commits: 44
  • Committers: 1
  • Avg Commits per committer: 44.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Benjamin Gregory Carlisle PhD b****e 174
Benjamin Gregory Carlisle PhD m****h@b****m 9
Carlisle c****g@c****e 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 17
  • Total pull requests: 6
  • Average time to close issues: 4 months
  • Average time to close pull requests: less than a minute
  • Total issue authors: 5
  • Total pull request authors: 1
  • Average comments per issue: 3.47
  • Average comments per pull request: 0.0
  • Merged pull requests: 6
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 2
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 2.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • bgcarlisle (9)
  • maia-sh (5)
  • under-score (1)
  • sama9767 (1)
  • YunChiLo (1)
Pull Request Authors
  • bgcarlisle (7)
Top Labels
Issue Labels
enhancement (3) wontfix (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 464 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 18
  • Total maintainers: 1
cran.r-project.org: cthist

Clinical Trial Registry History

  • Versions: 18
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 464 Last month
Rankings
Stargazers count: 24.2%
Forks count: 28.8%
Downloads: 29.1%
Average: 29.5%
Dependent packages count: 29.8%
Dependent repos count: 35.5%
Maintainers (1)
Last synced: 7 months ago

Dependencies

DESCRIPTION cran
  • RCurl * imports
  • assertthat * imports
  • dplyr * imports
  • httr * imports
  • jsonlite * imports
  • magrittr * imports
  • polite * imports
  • readr * imports
  • rlang * imports
  • rvest * imports
  • selectr * imports
  • stringr * imports
  • tibble * imports
  • testthat >= 3.0.0 suggests
.github/workflows/rhub.yaml actions
  • r-hub/actions/checkout v1 composite
  • r-hub/actions/platform-info v1 composite
  • r-hub/actions/run-check v1 composite
  • r-hub/actions/setup v1 composite
  • r-hub/actions/setup-deps v1 composite
  • r-hub/actions/setup-r v1 composite