https://github.com/5uperpalo/surname_heritage_classifier
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.9%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
Basic Info
- Host: GitHub
- Owner: 5uperpalo
- License: apache-2.0
- Language: Jupyter Notebook
- Default Branch: master
- Size: 866 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Created about 2 years ago
· Last pushed 10 months ago
Metadata Files
Readme
License
README.MD
SUrname HEritage Classifier
Documentation: https://5uperpalo.github.io/surnameheritageclassifier/
An old hobby project to classify surnames to countries and areas of the world. An attempt for an open source alternative to paid services:
- https://nationalize.io/our-data
- https://namsor.app/
- https://forebears.io/onograph/
- https://census.name/
- ~1000e for their database
used data:
- data/name_dataset
- data/annotatednamesNamePrism.tsv
- kaggle surname-dataset-classification
- data/finalallnames_code.csv
- data/name2lang.txt
aggregated data:
code based on:
- https://www.kaggle.com/code/yonatankpl/surname-classification-with-bert
other ideas:
- query names and origin countries somehow from wiki https://opendata.stackexchange.com/a/13199
- maybe somehow get more surnames from here: https://en.wiktionary.org/wiki/Appendix:Names
- rerun data gathering from wiki-nationality-estimate
Owner
- Name: Pavol Mulinka
- Login: 5uperpalo
- Kind: user
- Location: Barcelona, ES
- Company: CTTC
- Website: https://5uperpalo.github.io/online-cv/
- Repositories: 18
- Profile: https://github.com/5uperpalo
Data Scientist / Machine learning Enthusiast & former network engineer
GitHub Events
Total
- Push event: 1
- Pull request event: 2
- Create event: 1
Last Year
- Push event: 1
- Pull request event: 2
- Create event: 1
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 0
- Total pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: 3 minutes
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: 3 minutes
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- 5uperpalo (1)