video-game-text-dataset
Collection of videogame text datasets from Library of Codexes. Text data from 30+ different videogame series.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.7%) to scientific vocabulary
Keywords
Repository
Collection of videogame text datasets from Library of Codexes. Text data from 30+ different videogame series.
Statistics
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 1
- Releases: 0
Topics
Metadata Files
README.MD
video-game-text-dataset
Collected in-game text like notes, letters, codex entries, and audio recordings into JSON format.
Datasets
The data used to created these datasets was collected from a variety of sources (wikis, transcribing, finding in-game files w/ the data) for LibraryofCodexes. There are bound to be some mistakes but I've tried to sanatize the text the best I can.
All datasets are in JSON format.
Please refer to the individual series folder for more information regarding each series.
- Assassin's Creed
- Baldur's Gate
- Battlefield
- Crysis
- Dead Space
- Destiny
- Deus Ex
- Diablo
- Dishonored
- Doom
- Dragon Age
- Dying Light
- Fable
- Fallout
- Gears of War
- Horizon Zero Dawn
- Kingdoms of Amalur
- Mass Effect
- Metroid Prime
- Middle-Earth
- Nier
- Red Dead Redepmtion
- Resident Evil
- Star Wars: The Old Republic
- System Shock
- The Divison
- The Elder Scrolls
- The Last of Us
- The Witcher
- Tomb Raider
- Watch Dogs
- World of Warcraft
Scientific paper
This repository does not currently have a paper written for it. If you use the data, please use the 'Cite this repository' in the about section.
Games
The datasets were extracted from the following commercial video games. The games and the game assets are copyright the respective game publishers and game developers. If you use the datasets, don't forget to cite the games. ``` @misc{game:starwarsknightsoftheoldrepublic, title = {\emph{Star Wars: Knights of the Old Republic}}, year = {2003}, organization = {LucasArts}, publisher = {LucasArts}, author = {{BioWare}}, Howpublished = {Game [PC]}, Note = {LucasArts, San Francisco, US}, }
@misc{gamesseries:tes, title = {\emph{The Elder Scrolls I-V} and \emph{The Elder Scrolls Online}}, date = {1994/2014}, year = {1994--2014}, organization = {Bethesda Softworks}, publisher = {Bethesda Softworks}, author = {{Bethesda Softworks}}, Howpublished = {Game series [PC]}, Note = {Bethesda Softworks, Rockville, Maryland, US}, } ```
Owner
- Login: Davis24
- Kind: user
- Location: United States
- Website: davis24.github.io
- Repositories: 1
- Profile: https://github.com/Davis24
Citation (CITATION.cff)
abstract: This is a collection of video game text data originally generated for Library of Codexes.
authors:
- family-names: Davis
given-names: Megan
cff-version: 1.2.0
date-released: "2021-08-27"
message: If you use this software, please cite it using these metadata.
repository-code: "https://github.com/Davis24/video-game-text-dataset"
title: Video Game Text Dataset
GitHub Events
Total
- Watch event: 3
Last Year
- Watch event: 3