word2vec-russian-novels

Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov 📚

https://github.com/nevmenandr/word2vec-russian-novels

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.0%) to scientific vocabulary

Keywords

digital-humanities russian-literature word2vec word2vec-russian-novels
Last synced: 6 months ago · JSON representation ·

Repository

Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov 📚

Basic Info
Statistics
  • Stars: 48
  • Watchers: 2
  • Forks: 12
  • Open Issues: 0
  • Releases: 1
Topics
digital-humanities russian-literature word2vec word2vec-russian-novels
Created almost 9 years ago · Last pushed over 1 year ago
Metadata Files
Readme Citation

README.md

DOI

Jupyter Notebook

word2vec-russian-novels 📖

Fun digital humanities project by Boris Orekhov

Inspired by this work the replacement of words of Russian most valuable novels text with closest word2vec model words.

I used a model (ruwikiruscorpora) from RusVectōrēs project.

Other dependencies: * gensim * pymorphy2

Possible applications: * Fun * Source for tests for so called "olympic" competitions in literature * Base for literary studies that include the principle question "why this word, not the other?"

Owner

  • Name: Boris Orekhov
  • Login: nevmenandr
  • Kind: user
  • Location: Moscow

Digital humanities researcher

Citation (CITATION.cff)

cff-version: 1.2.0
title: word2vec Russian novels
message: >-
  If you use this dataset, please cite it using the metadata
  from this file.
type: dataset
authors:
  - given-names: Boris
    family-names: Orekhov
    email: nevmenandr@gmail.com
    affiliation: HSE University
    orcid: 'https://orcid.org/0000-0002-9099-0436'
identifiers:
  - type: doi
    value: 10.5281/zenodo.12814086
repository-code: 'https://github.com/nevmenandr/word2vec-russian-novels'
url: 'https://nevmenandr.github.io/novel2vec/'
abstract: >-
  Inspired by word2vec-pride-vis the replacement of words of
  Russian most valuable novels text with closest word2vec
  model words.
keywords:
  - word2vec
  - Russian literature
  - Digital Humanities
license: GPL-3.0
commit: 87ddfbdb3737f0494da17d7048ff96b15f7012ba
version: 1.0.0
date-released: '2019-10-20'

GitHub Events

Total
  • Watch event: 3
Last Year
  • Watch event: 3

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 1
  • Total pull requests: 0
  • Average time to close issues: about 13 hours
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 2.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • GenTxt (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels