Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (3.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: zhaofz635
  • License: mit
  • Language: Python
  • Default Branch: main
  • Size: 6.19 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 7 months ago · Last pushed 7 months ago
Metadata Files
Readme License Citation

README.md

A-difficulty-annotated-textbook-corpus

License: MIT GitHub repo size

A structured, difficulty-annotated dataset extracted from the open Introduction to Computer Science textbook, designed for educational technology research and NLP applications.

Dataset Highlights

📊 Structured Content
- 1,530 textbook paragraphs with context
- 31 educational diagrams (with image paths)
- 50 mathematical formulas (LaTeX format)
- 5 pedagogical tables

📈 Difficulty Annotations
Every entry rated 1-10 by CS educators:
1-3 = Beginner | 4-6 = Intermediate | 7-10 = Advanced

File Structure

Owner

  • Login: zhaofz635
  • Kind: user
  • Location: Japan
  • Company: Kobe University

Citation (Citation)

@dataset{opentextcs_2023,
  title = {OpenTextCS: A Difficulty-Annotated Computer Science Textbook Dataset},
  author = {F.ZHAO},
  year = {2025},
  url = {https://github.com/yourusername/opentextcs-dataset}
}

GitHub Events

Total
  • Watch event: 1
  • Push event: 5
  • Create event: 1
Last Year
  • Watch event: 1
  • Push event: 5
  • Create event: 1