word2vec-russian-novels
Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov 📚
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.0%) to scientific vocabulary
Keywords
Repository
Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov 📚
Basic Info
- Host: GitHub
- Owner: nevmenandr
- Language: Jupyter Notebook
- Default Branch: master
- Homepage: https://nevmenandr.github.io/novel2vec/
- Size: 9.29 MB
Statistics
- Stars: 48
- Watchers: 2
- Forks: 12
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
word2vec-russian-novels 📖
Fun digital humanities project by Boris Orekhov
Inspired by this work the replacement of words of Russian most valuable novels text with closest word2vec model words.
I used a model (ruwikiruscorpora) from RusVectōrēs project.
Other dependencies: * gensim * pymorphy2
Possible applications: * Fun * Source for tests for so called "olympic" competitions in literature * Base for literary studies that include the principle question "why this word, not the other?"
Owner
- Name: Boris Orekhov
- Login: nevmenandr
- Kind: user
- Location: Moscow
- Website: https://nevmenandr.github.io
- Twitter: nevmenandr
- Repositories: 42
- Profile: https://github.com/nevmenandr
Digital humanities researcher
Citation (CITATION.cff)
cff-version: 1.2.0
title: word2vec Russian novels
message: >-
If you use this dataset, please cite it using the metadata
from this file.
type: dataset
authors:
- given-names: Boris
family-names: Orekhov
email: nevmenandr@gmail.com
affiliation: HSE University
orcid: 'https://orcid.org/0000-0002-9099-0436'
identifiers:
- type: doi
value: 10.5281/zenodo.12814086
repository-code: 'https://github.com/nevmenandr/word2vec-russian-novels'
url: 'https://nevmenandr.github.io/novel2vec/'
abstract: >-
Inspired by word2vec-pride-vis the replacement of words of
Russian most valuable novels text with closest word2vec
model words.
keywords:
- word2vec
- Russian literature
- Digital Humanities
license: GPL-3.0
commit: 87ddfbdb3737f0494da17d7048ff96b15f7012ba
version: 1.0.0
date-released: '2019-10-20'
GitHub Events
Total
- Watch event: 3
Last Year
- Watch event: 3
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 1
- Total pull requests: 0
- Average time to close issues: about 13 hours
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 2.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- GenTxt (1)