https://github.com/alex-ip/geolocation-tools-workshop

Geolocation tools for text analysis and a related workshop notebook

https://github.com/alex-ip/geolocation-tools-workshop

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.0%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Geolocation tools for text analysis and a related workshop notebook

Basic Info
  • Host: GitHub
  • Owner: alex-ip
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 2.22 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Fork of Australian-Text-Analytics-Platform/geolocation-tools-workshop
Created over 3 years ago · Last pushed over 3 years ago

https://github.com/alex-ip/geolocation-tools-workshop/blob/main/

# geolocation-tools-workshop

_Tools for geolocation analysis of textual data with related workshop notebook_

This repository relates to a workshop that was presented at the 2022 ASA National Annual Conference for the Australian Society of Archivists (ASA) on 20 Oct 2022. The workshop related to how to use software to recognise placenames in historical documents and then use online gazetteers to determine what known locations the placenames correspond to, and gather related geolocation data like coordinates.

This site provides a combination of slides and Jupytr notebooks, along with audio presentations and explanations. 

* _Geolocating Australian Historical Resources: Finding placenames and locations with gazetteers_ (Workshop slides)
* _[Introduction to Named Entity Recognition with spaCy](https://github.com/Australian-Text-Analytics-Platform/geolocation-tools-workshop/blob/abb322e16ca6187a74669ebc7722a7fa08c52c1b/notebooks/spacy_ner_introduction.ipynb)_ (Jupytr Python notebook)  [![Binder](https://mybinder.org/badge_logo.svg)](https://binderhub.atap-binder.cloud.edu.au/v2/gh/Australian-Text-Analytics-Platform/geolocation-tools-workshop/HEAD?labpath=notebooks%2Fspacy_ner_introduction.ipynb)
* _[ATAP Notebook for the Geolocation project](https://github.com/Australian-Text-Analytics-Platform/geolocation-tools-workshop/blob/abb322e16ca6187a74669ebc7722a7fa08c52c1b/notebooks/atap_geolocation_workshop.ipynb)_ (Jupytr Python notebook)  [![Binder](https://mybinder.org/badge_logo.svg)](https://binderhub.atap-binder.cloud.edu.au/v2/gh/Australian-Text-Analytics-Platform/geolocation-tools-workshop/HEAD?labpath=notebooks%2Fatap_geolocation_workshop.ipynb)

This was developed as part of the [ATAP project](#section-atap).

## Using notebooks on cloud services  

The above links to the [Binder](https://mybinder.org/) service enable you to load the notebook in a online cloud environment, rather than having to install the software on your own computer (it might take a little while to load). This is a free service, but note that cloud sessions will close if you stop using the notebooks, and no data will be saved. Make sure you download any changed notebooks or harvested data that you want to save.

To execute each stage of a notebook, click on the "run" triangle symbol to execute the command cell you have selected. The cells are sequential, so you must have previously executed all previous cells in the correct order.

## Australian Text Analytics Platform 

The [Australian Text Analytics Platform (ATAP)](https://www.atap.edu.au) is an open source environment that provides researchers with tools and training for analysing, processing, and exploring text. This includes using a [range of resources](https://www.atap.edu.au/resources), including the [Language Technology and Data Analysis Laboratory (LADAL)](https://slcladal.github.io/) which aims to help develop computational and digital skills by providing information and practical, hands-on tutorials on data and text analytics as well as on statistical methods relevant for language research. 

The ATAP projects [received investment](https://doi.org/10.47486/PL074) from the Australian Research Data Commons (ARDC). The ARDC is funded by the National Collaborative Research Infrastructure Strategy (NCRIS).

![Logos of ARDC and NCRIS](https://user-images.githubusercontent.com/12245823/192428197-a7cd7d8c-2da4-42be-9bf9-c22b4e767af4.png)

Owner

  • Name: Alex Ip
  • Login: alex-ip
  • Kind: user
  • Location: Canberra
  • Company: @aarnet

GitHub Events

Total
Last Year