pubmed_embedding

Building a pubmed graph, automatically.

https://github.com/lucacappelletti94/pubmed_embedding

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.3%) to scientific vocabulary
Last synced: 7 months ago · JSON representation ·

Repository

Building a pubmed graph, automatically.

Basic Info
  • Host: GitHub
  • Owner: LucaCappelletti94
  • License: mit
  • Language: Rust
  • Default Branch: main
  • Size: 534 KB
Statistics
  • Stars: 5
  • Watchers: 1
  • Forks: 2
  • Open Issues: 3
  • Releases: 0
Created almost 4 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.rst

PubMed embedding
===================================
|pip| |downloads| |paper|

Building PubMed embedding, automatically.


Install the package
----------------------------------
As usual, just install from Pypi:

.. code:: shell

    pip install pubmed_embedding


Usage examples
----------------------------------
You can retrieve embedding for PubMed IDs of interest as such:

BERT
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: python

    from pubmed_embedding import get_pubmed_embedding_from_curies

    pubmed_ids = ["PMID:24774509", "PMID:15170967", "PMID:7850793"]

    bert_features = get_pubmed_embedding_from_curies(
        curies=pubmed_ids,
        version="pubmed_bert_30_11_2022"
    )

And the result is:

|BERT|


SciBERT
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: python

    scibert_features = get_pubmed_embedding_from_curies(
        curies=pubmed_ids,
        version="pubmed_scibert_30_11_2022"
    )
   
And the result is:

|SciBERT|

Specter
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: python

    spected_features = get_pubmed_embedding_from_curies(
        curies=pubmed_ids,
        version="pubmed_specter_30_11_2022"
    )

And the result is:

|Specter|

Citing this work
-----------------------------
If you have found these datasets useful, please do cite:

.. code:: bib

    @software{cappellettiPubMed2022,
        author = {Cappelletti, Luca and Fontana, Tommaso and Reese, Justin},
        month = {12},
        title = {{BM25-weighted BERT-based embedding of PubMed}},
        url = {https://github.com/LucaCappelletti94/pubmed_embedding},
        version = {1.0.14},
        year = {2022}
    }



.. |BERT| image:: https://github.com/LucaCappelletti94/pubmed_embedding/blob/main/bert.png?raw=true
.. |SciBERT| image:: https://github.com/LucaCappelletti94/pubmed_embedding/blob/main/scibert.png?raw=true
.. |Specter| image:: https://github.com/LucaCappelletti94/pubmed_embedding/blob/main/specter.png?raw=true

.. |pip| image:: https://badge.fury.io/py/pubmed-embedding.svg
    :target: https://badge.fury.io/py/pubmed-embedding
    :alt: Pypi project

.. |downloads| image:: https://pepy.tech/badge/pubmed-embedding
    :target: https://pepy.tech/badge/pubmed-embedding
    :alt: Pypi total project downloads 

.. |paper| image:: https://img.shields.io/badge/Paper-BM25_weighted_BERT_based_embedding_of_PubMed-blue.svg
    :target: https://github.com/LucaCappelletti94/pubmed_embedding/blob/main/BM25_weighted_BERT_based_embedding_of_PubMed.pdf
    :alt: Paper

Owner

  • Name: Luca Cappelletti
  • Login: LucaCappelletti94
  • Kind: user
  • Location: Milano, Italy
  • Company: University of Fribourg

I am but an egg 🥚 - "Stranger in a Strange Land"

Citation (CITATION.cff)

@software{cappellettiPubMed2022,
  author = {Cappelletti, Luca and Fontana, Tommaso and Reese, Justin},
  month = {12},
  title = {{BM25-weighted BERT-based embedding of PubMed}},
  url = {https://github.com/LucaCappelletti94/pubmed_embedding},
  version = {1.0.12},
  year = {2022}
}

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Committers

Last synced: over 1 year ago

All Time
  • Total Commits: 79
  • Total Committers: 1
  • Avg Commits per committer: 79.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
LucaCappelletti94 c****4@g****m 79

Issues and Pull Requests

Last synced: 11 months ago

All Time
  • Total issues: 2
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 2
  • Total pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • cthoyt (1)
  • LucaCappelletti94 (1)
Pull Request Authors
  • cthoyt (2)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

pubmed_parser/Cargo.lock cargo
  • adler 1.0.2
  • autocfg 1.1.0
  • cfg-if 1.0.0
  • console 0.15.0
  • crc32fast 1.3.2
  • crossbeam-channel 0.5.4
  • crossbeam-deque 0.8.1
  • crossbeam-epoch 0.9.8
  • crossbeam-utils 0.8.8
  • either 1.6.1
  • encode_unicode 0.3.6
  • flate2 1.0.24
  • hermit-abi 0.1.19
  • indicatif 0.16.2
  • lazy_static 1.4.0
  • libc 0.2.126
  • memoffset 0.6.5
  • miniz_oxide 0.5.3
  • num_cpus 1.13.1
  • number_prefix 0.4.0
  • once_cell 1.12.0
  • rayon 1.5.3
  • rayon-core 1.9.3
  • regex 1.5.6
  • regex-syntax 0.6.26
  • scopeguard 1.1.0
  • terminal_size 0.1.17
  • winapi 0.3.9
  • winapi-i686-pc-windows-gnu 0.4.0
  • winapi-x86_64-pc-windows-gnu 0.4.0
pubmed_parser/Cargo.toml cargo
setup.py pypi
  • compress_json *
  • downloaders *
  • numpy *
  • pandas *
  • tqdm *
  • userinput *