analyse_data

Analyse PTA data with Python

https://github.com/patristictextarchive/analyse_data

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Analyse PTA data with Python

Basic Info
  • Host: GitHub
  • Owner: PatristicTextArchive
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 66.9 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 1
  • Releases: 0
Created about 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme Citation

README.md

Analyse PTA with Jupyter Notebooks

This repository contains several Jupyter Notebooks to explore the Patristic Text Archive beyond the tools provided in the web frontend.

Open In Colab

This repository contains the following notebooks

  • Open In Colab pta_nlp_cld.ipynb using CLD
  • Open In Colab pta_nlp_spacy.ipynb using grcproielsm (cf. greCy. Ancient Greek models for spaCy)
  • Open In Colab keywords-in-context.ipynb
  • Open In Colab collocations.ipynb
  • Open In Colab analyse_corpus.ipynb (using TF-IDF)
  • Open In Colab biblical_quotations.ipynb (using severian_quotes.json and pta_metadata repository)
  • convert_pta_totext.ipynb Helper notebook to convert PTA-XML to a csv file
  • lemmatize_all.ipynb Helper notebook to lemmatize all text generated by convertptatotext.ipynb

Data for use in notebooks

in folder data

  • DejaVuSans.ttf (to be able to use Greek Extended in wordclouds)
  • severian_plaintext.csv as generated by convert_pta_totext.ipynb
  • severian_plaintext_lemmatized.csv as generated by lemmatize_all.ipynb
  • severian_quotes.json as generated by convert_pta_totext.ipynb

in folder assets

  • SpaCy model grc_proiel_sm-3.7-py3-none-any.whl (downloaded from https://huggingface.co/Jacobo/grcproielsm and filename adjusted to adhere to format convention of pip > 24.1)

Owner

  • Name: Patristic Text Archive
  • Login: PatristicTextArchive
  • Kind: organization
  • Email: annette.von_stockhausen@bbaw.de
  • Location: Berlin

Open access digital archive of ancient Christian texts

Citation (citation.cff)

cff-version: 1.2.0
title: >-
  Analyse PTA with Jupyter Notebooks
message: >-
  If you use these files, please cite it using the
  metadata from this file.
type: dataset
authors:
  - given-names: Annette
    name-particle: von
    family-names: Stockhausen
    email: annette.von_stockhausen@bbaw.de
    affiliation: >-
      Berlin-Brandenburgische Akademie der
      Wissenschaften
    orcid: 'https://orcid.org/0000-0001-5382-6322'
license: CC-BY-SA-4.0

GitHub Events

Total
  • Issues event: 1
  • Push event: 2
  • Create event: 2
Last Year
  • Issues event: 1
  • Push event: 2
  • Create event: 2

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 1
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • pharos-alexandria (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Dependencies

requirements.txt pypi