video-game-text-dataset

Collection of videogame text datasets from Library of Codexes. Text data from 30+ different videogame series.

https://github.com/davis24/video-game-text-dataset

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.7%) to scientific vocabulary

Keywords

dataset text video-games
Last synced: 6 months ago · JSON representation ·

Repository

Collection of videogame text datasets from Library of Codexes. Text data from 30+ different videogame series.

Basic Info
  • Host: GitHub
  • Owner: Davis24
  • Default Branch: master
  • Homepage:
  • Size: 10.8 MB
Statistics
  • Stars: 4
  • Watchers: 1
  • Forks: 1
  • Open Issues: 1
  • Releases: 0
Topics
dataset text video-games
Created over 4 years ago · Last pushed over 4 years ago
Metadata Files
Readme Citation

README.MD

video-game-text-dataset

Collected in-game text like notes, letters, codex entries, and audio recordings into JSON format.

Datasets

The data used to created these datasets was collected from a variety of sources (wikis, transcribing, finding in-game files w/ the data) for LibraryofCodexes. There are bound to be some mistakes but I've tried to sanatize the text the best I can.

All datasets are in JSON format.

Please refer to the individual series folder for more information regarding each series.

  • Assassin's Creed
  • Baldur's Gate
  • Battlefield
  • Crysis
  • Dead Space
  • Destiny
  • Deus Ex
  • Diablo
  • Dishonored
  • Doom
  • Dragon Age
  • Dying Light
  • Fable
  • Fallout
  • Gears of War
  • Horizon Zero Dawn
  • Kingdoms of Amalur
  • Mass Effect
  • Metroid Prime
  • Middle-Earth
  • Nier
  • Red Dead Redepmtion
  • Resident Evil
  • Star Wars: The Old Republic
  • System Shock
  • The Divison
  • The Elder Scrolls
  • The Last of Us
  • The Witcher
  • Tomb Raider
  • Watch Dogs
  • World of Warcraft

Scientific paper

This repository does not currently have a paper written for it. If you use the data, please use the 'Cite this repository' in the about section.

Games

The datasets were extracted from the following commercial video games. The games and the game assets are copyright the respective game publishers and game developers. If you use the datasets, don't forget to cite the games. ``` @misc{game:starwarsknightsoftheoldrepublic, title = {\emph{Star Wars: Knights of the Old Republic}}, year = {2003}, organization = {LucasArts}, publisher = {LucasArts}, author = {{BioWare}}, Howpublished = {Game [PC]}, Note = {LucasArts, San Francisco, US}, }

@misc{gamesseries:tes, title = {\emph{The Elder Scrolls I-V} and \emph{The Elder Scrolls Online}}, date = {1994/2014}, year = {1994--2014}, organization = {Bethesda Softworks}, publisher = {Bethesda Softworks}, author = {{Bethesda Softworks}}, Howpublished = {Game series [PC]}, Note = {Bethesda Softworks, Rockville, Maryland, US}, } ```

Owner

  • Login: Davis24
  • Kind: user
  • Location: United States

Citation (CITATION.cff)

abstract: This is a collection of video game text data originally generated for Library of Codexes.
authors:
  - family-names: Davis
    given-names: Megan
cff-version: 1.2.0
date-released: "2021-08-27"
message: If you use this software, please cite it using these metadata.
repository-code: "https://github.com/Davis24/video-game-text-dataset"
title: Video Game Text Dataset 

GitHub Events

Total
  • Watch event: 3
Last Year
  • Watch event: 3