csindex

Transparent data about Brazilian scientific production in Computer Science

https://github.com/aserg-ufmg/csindex

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.4%) to scientific vocabulary

Keywords

academic-publishing computer-science scientific-publishing
Last synced: 6 months ago · JSON representation

Repository

Transparent data about Brazilian scientific production in Computer Science

Basic Info
  • Host: GitHub
  • Owner: aserg-ufmg
  • License: mit
  • Language: HTML
  • Default Branch: master
  • Homepage: http://csindexbr.org
  • Size: 17.9 MB
Statistics
  • Stars: 36
  • Watchers: 7
  • Forks: 21
  • Open Issues: 0
  • Releases: 7
Topics
academic-publishing computer-science scientific-publishing
Created about 8 years ago · Last pushed 8 months ago
Metadata Files
Readme License Citation Authors

README.md

CSIndexbr

CSIndexbr (https://csindexbr.org) provides transparent data about Brazilian scientific production in Computer Science. We index full research papers published in selected conferences and journals. The papers are retrieved from DBLP.

Dependencies

CSIndexbr is implemented in Python 3.9 (backend scripts). The front-end uses HTML and pure Javascript.

We also use:

  • requests: "a simple, yet elegant HTTP library", which is used to retrieve data from DBLP.
  • xmltodict: "a Python module that makes working with XML feel like you are working with JSON", which is used to parse the XML files returned by DBLP.

Scripts

All these script must be called from "data" folder:

  • ./run se pl chi: update the papers (and related data) for the listed research areas (se, pl, and chi, in the example).

  • ./runall: update the papers (and related data) for all research areas

  • ./rundblp: download dblp files (xml, with papers) for all tracked professors

  • ./runcitations: update citations for for all research areas

Input files

** These files must be placed in the "data" folder: **

There are two "global" configuration files:

  • all-researchers.csv: Brazilian CS professors (i.e., from CS departments) whose papers are tracked by CSIndexber, with three columns:

    • Professor name (do not use "-" or accents in names)
    • University (do not use distinct names for the same university; e.g. PUC-Rio and PUC-RIO)
    • DBLP PID (see in this screenshot how to retrieve PIDs from DBLP profiles)
  • research-areas-config.csv: research areas covered by CSIndexbr, with two columns:

    • research area acronym (e.g., se)
    • minimum size of the conference papers indexed in this area (e.g., 10).

The following files are specific of a given research area (i.e., each area has all files listed next; although, in this list, we are using "se" as example):

  • se-confs.csv: conferences and journals indexed in a given research area ("se", in this case), with three columns:

    • venue name at DBLP:
    • for conferences, use "booktitle" XML entry, see example;
    • for journals, use "journal" XML entry; see example
    • venue name in the charts and tables generated by CSIndexbr
    • venue type, as follows:
    • 1: top-conference
    • 2: not used anynore
    • 3: "regular" conference (i.e., non-top)
    • 4: top-journal
    • 5: "regular" journal (i.e., non-top)
    • 6: magazine or journal that accept short papers (>= 6 pages)
    • 7: journals with low normalized-h5-index (see FAQ, for details)
  • se-black-list.txt: list of papers that must not be indexed, although they attend the basic indexing criteria. For example, they are papers published in other tracks, that is not the main research track of a conference. Each line contains the "url" XML field of the paper (see example)

  • se-white-list.txt: list of papers that must be indexed. For example, papers that do not have page numbers at DBLP metadata (see example)

Output files

** These files are generated in the "data" folder: **

Examples assuming "se" research area:

  • se-out-confs.csv: number of papers in indexed conferences
  • se-out-journals.csv: number of papers in indexed journals
  • se-out-profs-list.csv: professores with indexed papers in the area (and their departments)
  • se-out-profs.csv: number of professores with indexeded papers (in the area) per department
  • se-out-scores.csv: department scores (see formula in the FAQ)
  • se-out-papers.csv: metadata about indexed papers: year, venue, title, deparments, authors, doi, top or null (otherwise), journal (J) or conference (C), arxiv url or no_arxiv (otherwise), and number of citations

License:

MIT (for the source code) and CC BY-NC-SA 4.0 (for the data).

Owner

  • Name: Applied Software Engineering Research Group
  • Login: aserg-ufmg
  • Kind: organization
  • Location: Belo Horizonte, Brazil

Department of Computer Science, Federal University of Minas Gerais

GitHub Events

Total
  • Create event: 1
  • Release event: 1
  • Issues event: 2
  • Watch event: 3
  • Push event: 36
Last Year
  • Create event: 1
  • Release event: 1
  • Issues event: 2
  • Watch event: 3
  • Push event: 36

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 1,669
  • Total Committers: 2
  • Avg Commits per committer: 834.5
  • Development Distribution Score (DDS): 0.013
Past Year
  • Commits: 53
  • Committers: 1
  • Avg Commits per committer: 53.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Marco Tulio Valente m****e@g****m 1,648
Klerisson Paixao k****o@g****m 21

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 10
  • Total pull requests: 16
  • Average time to close issues: 15 days
  • Average time to close pull requests: 6 days
  • Total issue authors: 8
  • Total pull request authors: 3
  • Average comments per issue: 1.7
  • Average comments per pull request: 0.19
  • Merged pull requests: 13
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: 5 months
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • marciomribeiro (2)
  • gustavopinto (2)
  • mauricioaniche (1)
  • klerisson (1)
  • faustocarva (1)
  • damorim (1)
  • rsmbarros (1)
  • denilsonalvespereira (1)
Pull Request Authors
  • klerisson (14)
  • mafra456 (1)
  • meneguzzi (1)
Top Labels
Issue Labels
Pull Request Labels