sosia

Sosia: Automatic author matching in Scopus on-line

https://github.com/sosia-dev/sosia

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.2%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Sosia: Automatic author matching in Scopus on-line

Basic Info
Statistics
  • Stars: 12
  • Watchers: 2
  • Forks: 4
  • Open Issues: 3
  • Releases: 0
Created over 7 years ago · Last pushed 8 months ago
Metadata Files
Readme Contributing License Citation

README.rst

sosia
=====

Match authors automatically in Scopus on-line

Documentation: https://sosia.readthedocs.io

Development: https://github.com/sosia-dev/sosia

.. image:: https://badge.fury.io/py/sosia.svg
    :target: https://badge.fury.io/py/sosia

.. image:: https://readthedocs.org/projects/sosia/badge/?version=latest
    :target: https://readthedocs.org/projects/sosia/badge/?version=latest

.. image:: https://img.shields.io/pypi/pyversions/sosia.svg
    :target: https://img.shields.io/pypi/pyversions/sosia.svg

.. image:: https://img.shields.io/pypi/l/sosia.svg
    :target: https://img.shields.io/pypi/l/sosia.svg

.. image:: https://api.codeclimate.com/v1/badges/3e10a47fefae831b973a/maintainability
   :target: https://codeclimate.com/github/sosia-dev/sosia/maintainability

Installation
============

Install stable version from PyPI:

.. code:: bash

    pip install sosia

or development version from GitHub repository:

.. code:: bash

    pip install git+https://github.com/sosia-dev/sosia

Functioning
===========

sosia performs a series of queries in the Scopus database using the `pybliometrics package 
`_.  After configuring your local pybliometrics (providing access credentials and eventually setting cache directories), you are ready to use sosia:

.. inclusion-marker-start
.. code-block:: python

    >>> import sosia
    >>> 
    >>> # You need the Scopus ID and the year, optionally set a database path
    >>> stefano = sosia.Original(55208373700, 2018)
    >>> # Sources similiar to those stefano publishes in
    >>> stefano.define_search_sources()
    >>> # Authors publishing in search sources every 2 years
    >>> stefano.identify_candidates_from_sources(first_year_margin=1,
    >>>                                          frequency=2)
    >>> # Find candidates whose characteristics fall within margins
    >>> stefano.filter_candidates(same_discipline=True, pub_margin=0.2,
    >>>                           first_year_margin=1, cits_margin=0.2,
    >>>                           coauth_margin=0.15)
    >>> print(stefano.matches)
    >>> [55567912500]
    >>> # Optional step to provide additional information
    >>> stefano.inform_matches()
    >>> print(stefano.matches[0])
    Match(ID=55567912500, name='Eling, Katrin', first_name='Katrin',
          surname='Eling', first_year=2013, last_year=2018, num_coauthors=9,
          num_publications=8, num_citations=56, subjects=['BUSI', 'COMP', 'ENGI'],
          affiliation_country='Netherlands', affiliation_id='60032882',
          affiliation_name='Technische Universiteit Eindhoven',
          affiliation_type='univ', language='eng', num_cited_refs=0)

.. inclusion-marker-end

Change log
==========

Please see `CHANGES.rst <./meta/CHANGES.rst>`_.

Contributing
============

Please see `CONTRIBUTING.rst `_.  For the list of contributors see
`AUTHORS.rst <./meta/AUTHORS.rst>`_.

License
=======

MIT License; see `LICENSE `_.

Owner

  • Name: sosia-dev
  • Login: sosia-dev
  • Kind: organization

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If sosia helped you in your research, please cite it using the following:"
title: "Finding Doppelgängers in Scopus: how to build scientists control groups using sosia"
doi: 10.1007/s11192-025-05298-y
authors:
  - family-names: Rose
    given-names: Michael E.
    orcid: 0000-0002-4128-4236
  - family-names: Baruffaldi
    given-names: Stefano H.
    orcid: 0000-0003-4022-7691
preferred-citation:
  type: article
  authors:
    - family-names: Rose
      given-names: Michael E.
      orcid: 0000-0002-4128-4236
    - family-names: Baruffaldi
      given-names: Stefano H.
      orcid: 0000-0003-4022-7691
  title: "Finding Doppelgängers in Scopus: how to build scientists control groups using sosia"
  journal: "Scientometrics"
  doi: 10.1007/s11192-025-05298-y
  year: 2025

GitHub Events

Total
  • Issues event: 8
  • Watch event: 2
  • Issue comment event: 22
  • Push event: 73
  • Pull request review event: 12
  • Pull request review comment event: 11
  • Pull request event: 7
  • Fork event: 1
  • Create event: 2
Last Year
  • Issues event: 8
  • Watch event: 2
  • Issue comment event: 22
  • Push event: 73
  • Pull request review event: 12
  • Pull request review comment event: 11
  • Pull request event: 7
  • Fork event: 1
  • Create event: 2

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 36
  • Total pull requests: 8
  • Average time to close issues: about 1 year
  • Average time to close pull requests: 6 months
  • Total issue authors: 4
  • Total pull request authors: 2
  • Average comments per issue: 1.14
  • Average comments per pull request: 2.5
  • Merged pull requests: 6
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 7
  • Pull requests: 6
  • Average time to close issues: 21 days
  • Average time to close pull requests: 8 days
  • Issue authors: 2
  • Pull request authors: 1
  • Average comments per issue: 0.86
  • Average comments per pull request: 3.0
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Michael-E-Rose (19)
  • ste210 (14)
  • nils-herrmann (2)
  • quaquel (1)
Pull Request Authors
  • nils-herrmann (12)
  • ste210 (3)
Top Labels
Issue Labels
enhancement (23) bug (8) invalid (1) good first issue (1) info (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 21 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 14
  • Total maintainers: 1
pypi.org: sosia

Find control groups for academics in Scopus

  • Versions: 14
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 21 Last month
Rankings
Dependent packages count: 10.1%
Downloads: 16.2%
Stargazers count: 16.6%
Average: 16.7%
Forks count: 19.2%
Dependent repos count: 21.6%
Maintainers (1)
Last synced: 6 months ago

Dependencies

requirements.txt pypi
  • numpy *
  • pandas >=1.1.5
  • pbr >=5
  • pybliometrics >=3.2.0
  • simplejson *