pmidcite

Download "Cited by" data from the NIH for any paper with a PubMed ID

https://github.com/dvklopfenstein/pmidcite

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 24 DOI reference(s) in README
  • Academic publication links
    Links to: pubmed.ncbi, ncbi.nlm.nih.gov, wiley.com, plos.org, zenodo.org
  • Committers with academic emails
    1 of 3 committers (33.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.2%) to scientific vocabulary

Keywords

citation-analysis citation-counts citation-downloader citations command-line-tool google-scholar google-search library literature-review literature-search ncbi nih-citation-data pmid pubmed snowballing
Last synced: 6 months ago · JSON representation ·

Repository

Download "Cited by" data from the NIH for any paper with a PubMed ID

Basic Info
Statistics
  • Stars: 32
  • Watchers: 5
  • Forks: 7
  • Open Issues: 9
  • Releases: 34
Topics
citation-analysis citation-counts citation-downloader citations command-line-tool google-scholar google-search library literature-review literature-search ncbi nih-citation-data pmid pubmed snowballing
Created about 6 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog Contributing Funding License Code of conduct Citation

README.md

PubMed ID (PMID) Cite

Latest PyPI version DOI

pmidcite summary

Turbocharge a PubMed literature search with the command, icite, rather than clicking and clicking and clicking on Google Scholar "Cited by N" links.

This open-source project is part of a peer-reviewed commentary that was invited by the editors of Research Synthesis Methods. Please Cite and star on GitHub if you use pmidcite in your research or literature search.

Contact: dvklopfenstein@protonmail.com

PubMed and NIH Citation data

PubMed contains peer-reviewed research papers in biomedicine, biochemistry, chemistry, behavioral science, and other life sciences.
Citation data is downloaded from the National Institutes of Health (NIH) each time icite is run and includes: * Citation counts of all papers and clinical papers * Performance of a paper among its peer papers * Existence of MeSH terms for the human, animal, and molecular/cellular categories

Table of Contents

1) Download citation counts and data for a research paper

$ icite -H 26032263
* This paper (PMID 26032263) has 25 citations, 10 references, and 4 authors.
* This paper is performing well (74th percentile in column %) compared to its peers.

Starting usage

NIH percentile

This paper is performing well (74th percentile) compared to its peers (column %).

The NIH percentile grouping (column G) helps to highlight the better performing papers in groups 2, 3, and 4 by sorting the citing papers by group first, then publication year.

The sort places the lower performing papers in groups 0 or 1 at the back.

New papers appear at the beginning of a sorted list, no matter how many citations they have to better facilitate researchers in finding the latest discoveries.

The grouping of papers by NIH percentile grouping is a novel feature created by dvklopfenstein for this project.

2) Forward citation search

pmidcite summary

Also known as following a paper's Cited by links or Forward snowballing

icite -H; icite 26032263 --load_citations | sort -k6 -r
or
icite -H; icite 26032263 -c | sort -k6 -r

3) Backward citation search

Also known as following links to a paper's references or Backward snowballing

pmidcite summary

$ icite -H; icite 26032263 --load_references | sort -k6 -r
or
$ icite -H; icite 26032263 -r | sort -k6 -r

4) Summarize a group of citations

Create a file containing numerous PMIDs annotated with icite info $ icite 30022098 -c -o goatools_cites.txt WROTE: goatools_cites.txt

Count the number of lines in the file $ wc -l goatools_cites.txt 468 goatools_cites.txt

Summarize the papers in "goatoolscites.txt" ``` $ sumpaps goatoolscites.txt i=026.9% 4=003.0% 3=018.9% 2=028.8% 1=015.9% 0=006.5% 6 years:2018-2024 465 papers goatools_cites.txt `` * The output is on one line so many files containing sets of PMIDs may be compared * The groups are from newest(i) to top-performing(4), great(3), very good(2), and overlooked(1and0`)

5) Download citations for all papers returned from a PubMed search

  1. Do a search in PubMed
  2. Save all results into a file containing all PMIDs found by the search
  3. Download the list of PMIDs
  4. Run icite to analyze all the PMIDs

1. Do a search in PubMed

pmidcite summary

2. Save all results into a list of PMIDs

pmidcite summary

3. Download the list of PMIDs

pmidcite summary

4. Run icite to analyze all the PMIDs

$ icite -i pmid-HIVANDDNAm-set.txt -o pmid-HIVANDDNAm-icite.txt $ grep TOP pmid-HIVANDDNAm-icite.txt | sort -k6

Command Line Interface (CLI)

A Command-Line Interface (CLI) can be preferable to a Graphical User Interface (GUI) because: * processing can be automated from a script * time-consuming mouse clicking is reduced * more data can be seen at once on a text screen than in a browser, giving the researcher a better overall impression of the full set of information [1]

Researchers who use Linux or Mac already work from the command line. Researchers who use Windows can get that Linux-like command line feeling while still running native Windows programs by downloading Cygwin from https://www.cygwin.com/ [1].

PubMed vs Google Scholar

Google Scholar vs PubMed

In 2013, Boeker et al. [6] recommended that a scientific search interface contain five integrated search criteria. PubMed implements all five, while Google did not in 2013 or today.

Google's highly popular implementation of the forward citation search through their ubiquitous "Cited by N" links is a "Better" experience than the PubMed's "forward citation search" implementation.

But if your research is in the health sciences and you are amenable to working from the command line, you can use PubMed in your browser plus citation data downloaded from the NIH using the command-line using pmidcite. The NIH's citation data includes a paper's ranking among its co-citation network.

What is in PubMed? Take a quick tour

PubMed Contents

PubMed is a search interface and toolset used to access over 30.5 million article records from databases such as: * MEDLINE: a highly selective database started in the 1960s * PubMed Central (PMC): an open-access database for full-text papers that are free of cost * Additional content such as books and articles published before the 1960s

Installation

To install from PyPI

$ pip install pmidcite

To install using Bioconda

$ conda install -c bioconda pmidcite

To install locally

$ git clone https://github.com/dvklopfenstein/pmidcite.git $ cd ./pmidcite $ pip install .

Setup

Save your literature search in a GitHub repo.

1. Add a pmidcite init file

Add a .pmidciterc init file to a non-git managed directory, such as home (~) ``` $ icite --generate-rcfile | tee ~/.pmidciterc [pmidcite] email = myname@email.edu

To download PubMed search results, get an NCBI API key here:

https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities

apikey = MYLONGHEXNCBIAPIKEY tool = myscripts $ export PMIDCITECONF=~/.pmidciterc `` Do not version manage the.pmidciterc` using a tool such as GitHub because it contains your personal email and your private NCBI API key.

2. NCBI E-Utils API key

To download PubMed abstracts and PubMed search results using NCBI's E-Utils, get an NCBI API key using these instructions:
https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities

Set the apikey value in the config file: ~/.pmidciterc

Contributing

See the contributing guide for detailed instructions on how to get started contributing to the pmidcite project.

Contact

email: dvklopfenstein@protonmail.com
https://orcid.org/0000-0003-0161-7603

How to Cite

If you use pmidcite in your research or literature search, please cite paper 1 (pmidcite) and paper 3 (NIH citation data).

Please also consider reading and citing Gusenbauer's response (paper 2) about improving search for all during the information avalanche of these times:

  1. The pmidcite paper:
    Commentary to Gusenbauer and Haddaway 2020: Evaluating Retrieval Qualities of PubMed and Google Scholar
    Klopfenstein DV and Dampier W
    2020 | Research Synthesis Methods | PMID: 33031632 | DOI: 10.1002/jrsm.1456 | pdf

  2. Gusenbauer's response to the pmidcite paper:
    What every Researcher should know about Searching – Clarified Concepts, Search Advice, and an Agenda to improve Finding in Academia
    Gusenbauer M and Haddaway N
    2020 | Research Synthesis Methods | PMID: 33031639 | DOI: 10.1002/jrsm.1457 | pdf

  3. The NIH citation data used by pmidcite -- Scientific Influence, Translation, and Citation counts:
    The NIH Open Citation Collection: A public access, broad coverage resource
    Hutchins BI ... Santangelo GM
    2019 | PLoS Biology | PMID: 31600197 | DOI: 10.1371/journal.pbio.3000385

References

Please consider reading and citing the paper [4] which inspired the creation of pmidcite [1] and the authors' response to our paper [2]:

  1. Which Academic Search Systems are Suitable for Systematic Reviews or Meta-Analyses? Evaluating Retrieval Qualities of Google Scholar, PubMed and 26 other Resources
    Gusenbauer M and Haddaway N
    2019 | Research Synthesis Methods | PMID: 31614060 | DOI: 10.1002/jrsm.1378

Mentioned in this README are also these outstanding contributions:

  1. Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level
    Hutchins BI, Xin Yuan, Anderson JM, and Santangelo, George M.
    2016 | PLoS Biology | PMID: 27599104 | DOI: 10.1371/journal.pbio.1002541

  2. Google Scholar as replacement for systematic literature searches: good relative recall and precision are not enough
    Boeker M et al.
    2013 | BMC Medical Research Methodology | PMID: 24160679 | DOI: 10.1186/1471-2288-13-131

  3. Best Match: New relevance search for PubMed
    Fiorini N ... Lu Zhiyong
    2018 | PLoS Biology | PMID: 30153250 | DOI: 10.1371/journal.pbio.2005343

PDFs

Contact

dvklopfenstein@protonmail.com
https://orcid.org/0000-0003-0161-7603

Copyright (C) 2019-present pmidcite, DV Klopfenstein, PhD. All rights reserved.

Owner

  • Name: DV Klopfenstein, PhD
  • Login: dvklopfenstein
  • Kind: user
  • Location: Philadelphia, PA, USA

Everyone is greedy for gain; Everyone practices deceit.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this project, please cite it as below."
authors:
- family-names: "Klopfenstein"
  given-names: "DV"
  orcid: "https://orcid.org/0000-0003-0161-7603"
  email: dvklopfenstein@protonmail.com
- family-names: "Dampier"
  given-names: "Will"
title: "pmidcite"
version: 0.0.18
doi: 10.5281/zenodo.5172712
keywords: 
  - "Google Scholar"
  - CitedBy
  - PubMed
  - PMID
  - "forward citation"
  - "backward citation"
  - "forward snowball"
  - "backward snowball"
  - "literature review"
  - "literature search"
  - "citation downloader"
contact:
  - email: dvklopfenstein@protonmail.com
    name: "PubMed ID (PMID) Cite"
date-released: 2020-09-01
url: "https://github.com/dvklopfenstein/pmidcite"
preferred-citation:
  type: article
  authors:
  - family-names: "Klopfenstein"
    given-names: "DV"
    orcid: "https://orcid.org/0000-0003-0161-7603"
    email: dvklopfenstein@protonmail.com
  - family-names: "Dampier"
    given-names: "Will"
  doi: "10.1002/jrsm.1456"
  journal: "Research Synthesis Methods"
  month: 3
  start: 126 # First page number
  end: 135 # Last page number
  title: "Commentary to Gusenbauer and Haddaway 2020: Evaluating retrieval qualities of Google Scholar and PubMed"
  issue: 2
  volume: 12
  year: 2021
  keywords: 
    - CitedBy
    - PubMed
    - PMID
    - "Google Scholar"
    - "forward citation"
    - "backward citation"
    - "forward snowball"
    - "backward snowball"
    - "literature review"
    - "literature search"
    - "citation downloader"
  contact:
    - email: dvklopfenstein@protonmail.com
      name: "PubMed ID (PMID) Cite"
  repository: https://github.com/dvklopfenstein/pmidcite
  identifiers:
    - type: "other"
      value: "pmidcite"

GitHub Events

Total
  • Release event: 1
  • Watch event: 2
  • Issue comment event: 2
  • Push event: 25
  • Pull request event: 11
  • Create event: 3
Last Year
  • Release event: 1
  • Watch event: 2
  • Issue comment event: 2
  • Push event: 25
  • Pull request event: 11
  • Create event: 3

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 867
  • Total Committers: 3
  • Avg Commits per committer: 289.0
  • Development Distribution Score (DDS): 0.002
Past Year
  • Commits: 10
  • Committers: 1
  • Avg Commits per committer: 10.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
dvklopfenstein d****n 865
Manodeep Sinha m****p@g****m 1
Shirley Barrera s****y@n****u 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 16
  • Total pull requests: 60
  • Average time to close issues: 13 days
  • Average time to close pull requests: 3 days
  • Total issue authors: 12
  • Total pull request authors: 3
  • Average comments per issue: 1.88
  • Average comments per pull request: 0.03
  • Merged pull requests: 60
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • aditya-sarkar441 (2)
  • KuechlerO (2)
  • cegunderson (2)
  • dvklopfenstein (2)
  • msbased (1)
  • d-yarmosh (1)
  • pnguyen-biotech (1)
  • Travis-Barton (1)
  • bgriffen (1)
  • scbarrera (1)
  • SVN-PhD (1)
  • Raefcon (1)
Pull Request Authors
  • dvklopfenstein (70)
  • manodeep (1)
  • scbarrera (1)
Top Labels
Issue Labels
good first issue (1) enhancement (1)
Pull Request Labels

Dependencies

setup.py pypi
  • docopt *
.github/workflows/build.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
.github/workflows/codeql-analysis.yml actions
  • actions/checkout v2 composite
  • github/codeql-action/analyze v1 composite
  • github/codeql-action/autobuild v1 composite
  • github/codeql-action/init v1 composite