https://github.com/5uperpalo/surname_heritage_classifier

https://github.com/5uperpalo/surname_heritage_classifier

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: 5uperpalo
  • License: apache-2.0
  • Language: Jupyter Notebook
  • Default Branch: master
  • Size: 866 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 2 years ago · Last pushed 10 months ago
Metadata Files
Readme License

README.MD

SUrname HEritage Classifier

Documentation: https://5uperpalo.github.io/surnameheritageclassifier/

An old hobby project to classify surnames to countries and areas of the world. An attempt for an open source alternative to paid services:

  • https://nationalize.io/our-data
  • https://namsor.app/
  • https://forebears.io/onograph/
  • https://census.name/
    • ~1000e for their database

used data:

aggregated data:

code based on:

  • https://www.kaggle.com/code/yonatankpl/surname-classification-with-bert

other ideas:

  • query names and origin countries somehow from wiki https://opendata.stackexchange.com/a/13199
    • maybe somehow get more surnames from here: https://en.wiktionary.org/wiki/Appendix:Names
  • rerun data gathering from wiki-nationality-estimate

Owner

  • Name: Pavol Mulinka
  • Login: 5uperpalo
  • Kind: user
  • Location: Barcelona, ES
  • Company: CTTC

Data Scientist / Machine learning Enthusiast & former network engineer

GitHub Events

Total
  • Push event: 1
  • Pull request event: 2
  • Create event: 1
Last Year
  • Push event: 1
  • Pull request event: 2
  • Create event: 1

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: 3 minutes
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: 3 minutes
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • 5uperpalo (1)
Top Labels
Issue Labels
Pull Request Labels