diskmags

Repository for the Diskmags Catalog

https://github.com/zpd-digital-editions/diskmags

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.1%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Repository for the Diskmags Catalog

Basic Info
  • Host: GitHub
  • Owner: zpd-digital-editions
  • License: cc-by-4.0
  • Default Branch: main
  • Homepage: https://www.diskmags.de
  • Size: 24.2 MB
Statistics
  • Stars: 4
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 3 years ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.md

The Diskmag Catalog

This project is focused on collecting all possible data about the disk magazines (diskmags). In this repository you will find the datasets containing the collected information on diskmags from various internet sources. The data set mw_import_magazines.csv contains the merged data from all these sources. Now we will explain how to obtain and organize the data for this project.

We also invite you to visit our project site: diskmags.de

1. Data collection

During data collection, the metadata for each found magazine will be extracted and stored in a JSON or CSV file. Ideally, the script language Python and the corresponding libraries such as Scrapy, Selenium or Beautiful Soup can be used. Basically, you can decide yourself what kind of data models you want to create for collecting and storing your data.

1.1 Source

If you have found the page where the topic "diskmags" is dealt with, you must first determine whether you can extract the metadata there. Once you have found that the necessary information is there, think about how you can collect this data. If you want to automate this process, you can look at the structure of the page with devtools. Note the HTML elements that are of interest to you and copy the paths to each of these elements. Usually these are the XPATH expressions.

Currently, data is collected from the following resources:

a) Web: * Demozoo: https://demozoo.org * Internet Archive: https://archive.org * Kultboy: https://www.kultboy.com * Pouët: https://www.pouet.net * The C-64 Scene Database (CSDB): https://csdb.dk * ZX Press: https://zxpress.ru

b) Books: * Volko, Claus-Dieter (2013). Enzyklopädie der Diskmags, ISBN: 3656295387

1.2 Scrape the data

Now you can start collecting data. If you do not want to collect the data manually, you can use the libraries mentioned above. Write the code in which the information is extracted from the HTML elements you have noted. You have to extract some of the following metadata: * title * platform * origin/country * language * publication date * link * download link

Save this data in a JSON or CSV file. You can rely on the files in the repository to choose the appropriate data model. Then you can upload your file to this repository, name these files as follows: diskmags_[sourcename].[fileformat].

2. Data merging

During the data merging process, you will update the main dataset with the information you have collected. You can do this manually, otherwise you can automate this process. The file mw_import_magazines.csv has the following structure:

| Title | Magazine[Language] | Magazine[Origin] | Magazine[Start date] | Magazine[End date] | Magazine[Systems] | Magazine[Issues] | |-------|--------------------|------------------|----------------------|---------------------|-------------------|------------------|

2.1 Case 1: Data are already present in the dataset

If you have collected the information on the diskmag that is already present in this record, you need to update the corresponding row. If this diskmag has the same name but a different platform (row Magazine[Systems]), then it is most likely a completely different diskmag.

2.2 Case 2: Data are not present in the dataset

If the diskmags you have found are not present in the dataset, you have to create a new entry in the dataset for each of these diskmags. Make sure that the issues of the same magazine are added as one entry.

2.3 Final steps

Once you have finished the data merging, you need to sort the dataset alphabetically by title (row Title). To avoid the misunderstandings with upper and lower case, do the case-insensitive sort. Upload the updated file to this repository. In the description of the commit, you have to explain what you have done.

Mediawiki

The Structure for the mediawiki: image

The correlated data will be published here soon.

Owner

  • Name: Digital Editions Department - Centre for Philology and Digitality (ZPD), University of Würzburg
  • Login: zpd-digital-editions
  • Kind: organization

Citation (CITATION.cff)

cff-version: 1.2.0
title: Diskmags
message: If you use this software, please cite it using these metadata.
type: dataset
authors:
  - given-names: Torsten 
    family-names: Roeder
    name-particle: Dr.
    email: torsten.roeder@uni-wuerzburg.de
    affiliation: University of Würzburg
  - given-names: Johannes
    family-names: Leitgeb
    email: johannes.leitgeb@stud-mail.uni-wuerzburg.de
    affiliation: University of Würzburg
    orcid: 'https://orcid.org/0009-0006-2058-9133'
  - given-names: Yannik
    family-names: Herbst
    email: yannik.herbst@uni-wuerzburg.de
    affiliation: University of Würzburg
    orcid: 'https://orcid.org/0000-0002-6547-9599'
  - given-names: Madlin
    family-names: Marenec
    email: madlin@mrsmuseum.de
    affiliation: University of Würzburg
    orcid: 'https://orcid.org/0009-0003-7434-827X'
  - given-names: Tomash
    family-names: Shtohryn
    email: tomash.shtohryn@stud-mail.uni-wuerzburg.de
    affiliation: University of Würzburg
    orcid: 'https://orcid.org/0009-0000-4597-603X'
repository-code: 'https://github.com/zpd-digital-editions/Diskmags.git'
url: 'https://www.diskmags.de'
abstract: |
  Diskmags is a dataset of computer magazines from the 1980s and 1990s. It is a project of the Center for Philology and Digitality at the University of Würzburg. The project is funded by the German Research Foundation (DFG) and the Free State of Bavaria. Diskmags is a collaborative project of the Chair of Digital Humanities and the Chair of Medieval German Literature at the University of Würzburg. The project is part of the research focus "Digital Philology" at the University of Würzburg.
keywords:
  - dataset
  - computer magazines
  - diskmags
  - retrocomputing
version: 1.0
date-released: '2022-01-01'

GitHub Events

Total
  • Issues event: 1
  • Issue comment event: 1
  • Push event: 1
Last Year
  • Issues event: 1
  • Issue comment event: 1
  • Push event: 1