tap-clinicaltrials

Singer tap for ClinicalTrials.gov study records data.

https://github.com/edgarrmondragon/tap-clinicaltrials

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.8%) to scientific vocabulary

Keywords

clinical-trials data-mining elt meltano pharma singer-io singer-sdk
Last synced: 6 months ago · JSON representation ·

Repository

Singer tap for ClinicalTrials.gov study records data.

Basic Info
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 1
  • Open Issues: 2
  • Releases: 14
Topics
clinical-trials data-mining elt meltano pharma singer-io singer-sdk
Created about 2 years ago · Last pushed 6 months ago
Metadata Files
Readme License Citation Codeowners

README.md

tap-clinicaltrials

Singer tap for ClinicalTrials.gov study records data.

Built with the Meltano Tap SDK for Singer Taps.

Capabilities

  • catalog
  • state
  • discover
  • about
  • stream-maps

Settings

| Setting | Required | Default | Description | |:--------------------|:--------:|:-------:|:------------| | startdate | False | None | Earliest datetime to get data from | | condition | False | None | Conditions or disease query | | sponsor | False | None | Sponsor query | | streammaps | False | None | Config object for stream maps capability. For more information check out Stream Maps. | | streammapconfig | False | None | User-defined config values to be used within map expressions. | | flatteningenabled | False | None | 'True' to enable schema flattening and automatically expand nested properties. | | flatteningmaxdepth| False | None | The max depth to flatten schemas. | | batchconfig | False | None | |

A full list of supported settings and capabilities is available by running: tap-clinicaltrials --about

Installation

In a Meltano project

Using a direct reference

bash meltano add extractor tap-clinicaltrials --from-ref=https://raw.githubusercontent.com/edgarrmondragon/tap-clinicaltrials/main/plugin.yaml

Requires Meltano v3.1.0+.

From MeltanoHub

Not yet available.

From PyPI

bash python3 -m pip install --upgrade tap-clinicaltrials

With pipx

bash pipx install tap-clinicaltrials

From source

bash git clone https://github.com/edgarrmondragon/tap-clinicaltrials cd tap-clinicaltrials python3 -m pip install .

Usage

You can easily run tap-clinicaltrials by itself or in a pipeline using Meltano.

With Meltano

  1. Clone the repo and cd into it:

bash git clone https://github.com/edgarrmondragon/tap-clinicaltrials.git cd tap-clinicaltrials

  1. Make sure you have Meltano installed

  2. Install all plugins

bash meltano install

  1. Configure the tap-clinicaltrials tap:

bash meltano config tap-clinicaltrials set start_date '2020-01-01' meltano config tap-clinicaltrials set condition 'COVID-19' meltano config tap-clinicaltrials set sponsor 'Pfizer'

  1. Run a test tap-clinicaltrials extraction

bash meltano run tap-clinicaltrials target-duckdb

  1. That's it! Check the data

console $ duckdb output/warehouse.duckdb -c "select nctid, lastUpdateSubmitDate, protocolsection->>'$.identificationModule.briefTitle' from clinicaltrials.studies limit 5; ┌─────────────┬──────────────────────┬─────────────────────────────────────────────────────────────────────────────────────────────────────┐ │ nctid │ lastupdatesubmitdate │ (protocolsection ->> '$.identificationModule.briefTitle') │ │ varchar │ varchar │ varchar │ ├─────────────┼──────────────────────┼─────────────────────────────────────────────────────────────────────────────────────────────────────┤ │ NCT06156215 │ 2023-12-06 │ PROmotion of COVID-19 BOOSTer VA(X)Ccination in the Emergency Department - PROBOOSTVAXED │ │ NCT05487040 │ 2023-12-06 │ A Study to Measure the Amount of Study Medicine in Blood in Adult Participants With COVID-19 and … │ │ NCT06163677 │ 2023-12-07 │ A Study to Look at the Health Outcomes of Patients With COVID-19 and Influenza. │ │ NCT05032976 │ 2023-12-07 │ Korea Comirnaty Post-marketing Surveillance │ │ NCT05596734 │ 2023-12-11 │ A Study to Evaluate the Safety, Tolerability, and Immunogenicity of Combined Modified RNA Vaccine… │ └─────────────┴──────────────────────┴─────────────────────────────────────────────────────────────────────────────────────────────────────┘

Executing the Tap Directly

bash tap-clinicaltrials --version tap-clinicaltrials --help tap-clinicaltrials --config CONFIG --discover > ./catalog.json

Developer Resources

Initialize your Development Environment

bash pipx install hatch

Create and Run Tests

Run integration tests:

bash hatch run test:integration

You can also test the tap-clinicaltrials CLI interface directly:

bash hatch run sync:console -- --about --format=json

SDK Dev Guide

See the dev guide for more instructions on how to use the SDK to develop your own taps and targets.

Owner

  • Name: Edgar Ramírez Mondragón
  • Login: edgarrmondragon
  • Kind: user
  • Location: Mexico
  • Company: @archdotdev / @meltano

Python / Data / Backend Applications

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Ramírez Mondragón"
  given-names: "Edgar"
  orcid: "https://orcid.org/0000-0002-4182-0385"
title: "Citric"
version: "0.1.0"
doi: 10.5281/zenodo.1234
date-released: 2023-01-24
url: "https://github.com/edgarrmondragon/tap-clinicaltrials"

GitHub Events

Total
  • Create event: 84
  • Issues event: 1
  • Release event: 6
  • Delete event: 75
  • Issue comment event: 9
  • Push event: 103
  • Pull request review event: 26
  • Pull request event: 148
Last Year
  • Create event: 84
  • Issues event: 1
  • Release event: 6
  • Delete event: 75
  • Issue comment event: 9
  • Push event: 103
  • Pull request review event: 26
  • Pull request event: 148

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 2
  • Total pull requests: 219
  • Average time to close issues: N/A
  • Average time to close pull requests: about 6 hours
  • Total issue authors: 2
  • Total pull request authors: 4
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.13
  • Merged pull requests: 194
  • Bot issues: 1
  • Bot pull requests: 187
Past Year
  • Issues: 1
  • Pull requests: 127
  • Average time to close issues: N/A
  • Average time to close pull requests: about 8 hours
  • Issue authors: 1
  • Pull request authors: 4
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.14
  • Merged pull requests: 105
  • Bot issues: 1
  • Bot pull requests: 113
Top Authors
Issue Authors
  • edgarrmondragon (1)
  • renovate[bot] (1)
Pull Request Authors
  • pre-commit-ci[bot] (71)
  • dependabot[bot] (67)
  • renovate[bot] (65)
  • edgarrmondragon (37)
Top Labels
Issue Labels
enhancement (1)
Pull Request Labels
dependencies (130) python (47) github_actions (20)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 28 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 14
  • Total maintainers: 1
pypi.org: tap-clinicaltrials

`tap-clinicaltrials` is a Singer tap for ClinicalTrials.gov, built with the Meltano SDK for Singer Taps.

  • Versions: 14
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 28 Last month
Rankings
Dependent packages count: 10.0%
Average: 38.2%
Dependent repos count: 66.3%
Maintainers (1)
Last synced: 6 months ago