Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.6%) to scientific vocabulary
Last synced: 9 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: panos-span
  • License: mit
  • Language: Python
  • Default Branch: main
  • Size: 1.95 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 2
  • Releases: 0
Created over 1 year ago · Last pushed 9 months ago
Metadata Files
Readme License Citation

README.md

Implementation and analysis of the EigenFactor-adjusted H-Index.

The EigenFactor metric evaluates the quality of journals based on their influence and impact. The implementation includes:

  • Data extraction from the Alexandria3k tool.
  • Calculation of the EigenFactor-adjusted H-Index for top authors.
  • Analysis of citation practices and patterns based on the EigenFactor ranking.
  • Comparison of top authors publishing in lower-tier journals with random authors publishing in top-tier journals.

Requirements

For this project, we will use the following libraries: - alexandria3k for data extraction. - rdbunit for database unit testing.

And more common datatools like pandas, numpy, matplotlib, and seaborn.

Exact requirements can be found in the requirements.txt file.

Instructions

Prerequisites

Files needed externally for the project:

  • get_citation_network.txt: This file is needed to get the citation network from the Alexandria3k tool. It is a copy of the original file, which is not included in this repository. You can find the necessary tables for it's generation in the citation_network_(if) folder.
  • get_issn_subject: This file is needed to get the ISSN and subject of the journals from the Alexandria3k tool. It is a copy of the original file, which is not included in this repository. In order to get the ISSN and subject of the journals, you need to run a SQL query on the Alexandria3k database on the Crossref 2023 database.

Install Crossref 202X

Install at the desired path with the following command:

bash aria2c http://dx.doi.org/XXX/XXX &&

Then for convenience, rename the directory:

bash mv 'April 2024 Public Data File from Crossref' Crossref-April-2024

Populate Database

Now we can populate the database with the Crossref data. In our folder we can now run the following command:

bash make populate

Warning: This will take a while, as it will download the entire Crossref database and populate the database with it.

Additional Note: According to the year of the Crossref data, the database will be populated with the data from the year 202X. The years set in the Makefile and can be changed if needed.

Structure

The project is structured as follows:

Data Processing

  • citationnetwork(if)/ - Citation network extraction code for eigenfactor calculation
  • common/ - Makefiles for database population and dependency installation
  • base_tables/ - Core tables that form the backbone of the analysis structure
  • top_tables/ - Top tables generation for eigenfactor analysis
  • bottom_tables/ - Bottom tables generation for eigenfactor analysis

Analysis Components

  • simplecitationanalysis/ - Simple citation analysis for eigenfactor-adjusted h-index
  • citationnetworkanalysis/ - Complex citation network analysis for top/bottom authors
  • analysis_files/ - General analysis files for eigenfactor tables
  • orcidh5calculations/ - Base calculations for h5-index

Comparison tables & Statistics

  • mw_statistics/ - Mann-Whitney U statistics analysis
  • randomauthorstables/ - Random author tables for citation network comparison
  • author_matching/ - Author matching between top and bottom tables

Output & Testing

  • print/ - Code for table printing and basic statistics
  • tests/ - Test suite using rdbunit for code validation

Owner

  • Name: Panagiotis Spanakis
  • Login: panos-span
  • Kind: user
  • Location: Athens
  • Company: AUEB

Undergraduate Student , Management Science and Technology

Citation (citation-graph-indexes.sql)

-- Indexes required by citation-graph.py

CREATE INDEX IF NOT EXISTS work_references_doi_idx ON work_references(doi);
CREATE INDEX IF NOT EXISTS works_id_idx ON works(id);
CREATE INDEX IF NOT EXISTS works_doi_idx ON works(doi);
CREATE INDEX IF NOT EXISTS work_references_work_id_idx ON work_references(work_id);

SELECT 1;

GitHub Events

Total
  • Issues event: 2
  • Push event: 8
  • Create event: 2
Last Year
  • Issues event: 2
  • Push event: 8
  • Create event: 2