LISC

LISC: A Python Package for Scientific Literature Collection and Analysis - Published in JOSS (2019)

https://github.com/lisc-tools/lisc

Science Score: 100.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: pubmed.ncbi, ncbi.nlm.nih.gov, joss.theoj.org
  • Committers with academic emails
    2 of 7 committers (28.6%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

literature-mining literature-review meta-analysis scientific-publications text-mining web-scraping

Scientific Fields

Mathematics Computer Science - 84% confidence
Last synced: 6 months ago · JSON representation ·

Repository

Literature Scanner: Automated collection & analyses of the scientific literature.

Basic Info
Statistics
  • Stars: 111
  • Watchers: 5
  • Forks: 12
  • Open Issues: 2
  • Releases: 6
Topics
literature-mining literature-review meta-analysis scientific-publications text-mining web-scraping
Created over 7 years ago · Last pushed 7 months ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.rst

=========================
LISC - Literature Scanner
=========================

|ProjectStatus| |Version| |BuildStatus| |Coverage| |License| |PythonVersions| |Publication|

.. |ProjectStatus| image:: https://www.repostatus.org/badges/latest/active.svg
   :target: https://www.repostatus.org/#active
   :alt: project status

.. |Version| image:: https://img.shields.io/pypi/v/lisc.svg
   :target: https://pypi.org/project/lisc/
   :alt: version

.. |BuildStatus| image:: https://github.com/lisc-tools/lisc/actions/workflows/build.yml/badge.svg
   :target: https://github.com/lisc-tools/lisc/actions/workflows/build.yml
   :alt: build status

.. |Coverage| image:: https://codecov.io/gh/lisc-tools/lisc/branch/main/graph/badge.svg
   :target: https://codecov.io/gh/lisc-tools/lisc
   :alt: coverage

.. |License| image:: https://img.shields.io/pypi/l/lisc.svg
   :target: https://opensource.org/license/apache-2-0
   :alt: license

.. |PythonVersions| image:: https://img.shields.io/pypi/pyversions/lisc.svg
   :target: https://pypi.org/project/lisc/
   :alt: python versions

.. |Publication| image:: https://joss.theoj.org/papers/10.21105/joss.01674/status.svg
   :target: https://doi.org/10.21105/joss.01674
   :alt: publication

LISC is a package for collecting and analyzing scientific literature.

Overview
--------

LISC is a Python module for collection and analyzing the scientific literature. To do so,
LISC wraps and combines existing APIs, allowing users to collect and analyze scientific articles,
allowing for automated meta-analyses.

A curated list of some projects enabled by LISC is available on the
`projects `_ page.

Supported APIs & Collection Approaches
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Supported APIs and data collection approaches include:

- The `EUtils `_ API, which provides access to literature data,
  including the `Pubmed `_ database, from which counts, co-occurrences,
  text, and meta-data from scientific articles can be collected.
- The `OpenCitations `_ API, which provides access to citation data, from which
  citation and reference information can be collected.

Analysis & Other Functionality
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

In addition to connecting to external APIs, LISC also provides:

- A database structure, and save and load utilities for storing collected data
- Custom data objects for managing and preprocessing collected data
- Functions and utilities to analyze collected data
- Data visualization functions for plotting collected data and analysis outputs

Documentation
-------------

Documentation is available on the
`documentation site `_.

This documentation includes:

- `Tutorials `_:
  with a step-by-step guide through the module and how to use it
- `Examples `_:
  demonstrating example analyses and use cases, and other functionality
- `API list `_:
  which lists and describes all the code and functionality available in the module
- `Reference `_:
  with information for how to reference and report on using the module

For a curated list of projects that use LISC, see the
`projects `_ page.

Dependencies
------------

LISC is written in Python 3, and requires Python >= 3.7 to run.

Requirements:

- `numpy `_
- `requests `_
- `lxml `_
- `beautifulsoup4 `_

Optional dependencies, used for plotting, analyses & testing:

- `matplotlib `_
- `seaborn `_
- `scipy `_
- `wordcloud `_
- `pytest `_

Install
-------

Stable releases of LISC are released on the Github
`release page `_, and on
`PYPI `_.

Descriptions of updates and changes across versions are available in the
`changelog `_.

**Stable Release Version**

To install the latest stable release, you can install from pip:

.. code-block:: shell

    $ pip install lisc

LISC can also be installed with conda, from the conda-forge channel:

.. code-block:: shell

    $ conda install -c conda-forge lisc

**Development Version**

To get the development version (updates that are not yet published to pip), you can clone this repository.

.. code-block:: shell

    $ git clone https://github.com/lisc-tools/lisc

To install this cloned copy of LISC, move into the directory you just cloned, and run:

.. code-block:: shell

    $ pip install .

**Editable Version**

If you want to install an editable version, for making contributions, download the development
version as above, and run:

.. code-block:: shell

    $ pip install -e .

Reference
---------

If you use this code in your project, please cite:

.. code-block:: text

    Donoghue, T. (2018)  LISC: A Python Package for Scientific Literature Collection and Analysis.
    Journal of Open Source Software, 4(41), 1674. DOI: 10.21105/joss.01674

Direct Link: https://doi.org/10.21105/joss.01674

More information for how to cite this method can be found on the
`reference page `_.

Contribute
----------

This project welcomes and encourages contributions from the community!

To file bug reports and/or ask questions about this project, please use the
`Github issue tracker `_.

To see and get involved in discussions about the module, check out:

- the `issues board `_
  for topics relating to code updates, bugs, and fixes
- the `development page `_
  for discussion of potential major updates to the module

When interacting with this project, please use the
`contribution guidelines `_
and follow the
`code of conduct `_.

Owner

  • Name: LISC
  • Login: lisc-tools
  • Kind: organization
  • Email: tdonoghue.research@gmail.com

Automated collection & analysis of scientific literature.

JOSS Publication

LISC: A Python Package for Scientific Literature Collection and Analysis
Published
September 26, 2019
Volume 4, Issue 41, Page 1674
Authors
Thomas Donoghue ORCID
Department of Cognitive Science, University of California, San Diego
Editor
Daniel S. Katz ORCID
Tags
python literature analysis text mining web scraping meta science knowledge discovery hypothesis generation automated meta analysis literature based discovery

Citation (CITATION.cff)

cff-version: 1.2.0
message: >-
  If you use this software, please cite it using the metadata from this file.
type: software
title: 'lisc: literature scanner'
authors:
- given-names: 'Thomas'
  family-names: 'Donoghue'
  orcid: 'https://orcid.org/0000-0001-5911-0472'
repository-code: 'https://github.com/lisc-tools/lisc'
url: 'https://lisc-tools.github.io/'
license: Apache-2.0
preferred-citation:
  type: article
  title: 'LISC: A Python Package for Scientific Literature Collection and Analysis'
  authors:
  - given-names: Thomas
    family-names: Donoghue
    orcid: 'https://orcid.org/0000-0001-5911-0472'
  journal: 'Journal of Open Source Software'
  doi: '10.21105/joss.01674'
  issue: 41
  volume: 4
  year: 2019
  start: 1674

GitHub Events

Total
  • Create event: 10
  • Release event: 1
  • Issues event: 5
  • Watch event: 11
  • Delete event: 8
  • Issue comment event: 22
  • Push event: 31
  • Pull request event: 17
  • Fork event: 1
Last Year
  • Create event: 10
  • Release event: 1
  • Issues event: 5
  • Watch event: 11
  • Delete event: 8
  • Issue comment event: 22
  • Push event: 31
  • Pull request event: 17
  • Fork event: 1

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 589
  • Total Committers: 7
  • Avg Commits per committer: 84.143
  • Development Distribution Score (DDS): 0.034
Past Year
  • Commits: 63
  • Committers: 1
  • Avg Commits per committer: 63.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Tom Donoghue t****h@g****m 569
Ryan Hammonds r****s@u****u 14
hahaschool h****g@g****m 2
Kendra Oudyk k****k@g****m 1
Jason Fleischer j****r@g****m 1
Daniel S. Katz d****z@i****g 1
Ben Cipollini 4****i 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 32
  • Total pull requests: 86
  • Average time to close issues: 8 months
  • Average time to close pull requests: 16 days
  • Total issue authors: 9
  • Total pull request authors: 7
  • Average comments per issue: 1.53
  • Average comments per pull request: 0.33
  • Merged pull requests: 78
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 19
  • Average time to close issues: 5 days
  • Average time to close pull requests: 2 days
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 0.33
  • Average comments per pull request: 0.21
  • Merged pull requests: 13
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • TomDonoghue (19)
  • linuxscout (3)
  • skytguuu (3)
  • lakimenon (2)
  • koudyk (1)
  • hahaschool (1)
  • suncyd (1)
  • another-helpful-user (1)
  • guiomar (1)
Pull Request Authors
  • TomDonoghue (80)
  • ryanhammonds (4)
  • jasongfleischer (2)
  • koudyk (1)
  • bcipolli (1)
  • danielskatz (1)
Top Labels
Issue Labels
enhancement (4) 0.2 (4) documentation (3) question (1)
Pull Request Labels

Packages

  • Total packages: 2
  • Total downloads:
    • pypi 91 last-month
  • Total dependent packages: 0
    (may contain duplicates)
  • Total dependent repositories: 2
    (may contain duplicates)
  • Total versions: 7
  • Total maintainers: 1
pypi.org: lisc

Literature Scanner

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Downloads: 91 Last month
Rankings
Stargazers count: 7.7%
Dependent packages count: 10.0%
Forks count: 10.5%
Dependent repos count: 11.6%
Average: 12.9%
Downloads: 24.7%
Maintainers (1)
Last synced: 6 months ago
conda-forge.org: lisc
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Stargazers count: 33.7%
Dependent repos count: 34.0%
Average: 40.3%
Forks count: 42.2%
Dependent packages count: 51.2%
Last synced: 6 months ago

Dependencies

optional-requirements.txt pypi
  • matplotlib *
  • scipy *
  • seaborn *
  • wordcloud *
requirements-docs.txt pypi
  • matplotlib *
  • numpydoc *
  • scipy *
  • seaborn *
  • sphinx *
  • sphinx-copybutton *
  • sphinx_bootstrap_theme *
  • sphinx_gallery *
  • wordcloud *
requirements.txt pypi
  • beautifulsoup4 *
  • lxml *
  • numpy >=1.17.1
  • requests *
.github/workflows/build.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • codecov/codecov-action v3 composite