https://github.com/chrisgrieser/apuz-gender-analyzer
Analysis of the genders of the authors at the German journal "Aus Politik und Zeitgeschichte" (APuZ)
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.4%) to scientific vocabulary
Keywords
Repository
Analysis of the genders of the authors at the German journal "Aus Politik und Zeitgeschichte" (APuZ)
Basic Info
Statistics
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Name Gender analyzer
Analysis of names by gender.
Table of Content
Usage
Setup
- Clone the repo.
- Run
make initin the terminal to set up the virtual environment and install the dependencies - You might need to restart your editor.
Technical procedure
- Scrap names from
APuZ. - Get list of names.
- Parse names into required format.
- Assign gender to every name. https://genderize.io/
- Calculate percentage of genders.
- Display results.
Technical Implementation
- Background-Service: Genderize to identify gender of names
- API rate limit without subscription: 100 calls per day, 10 names per call

To-do
Short list
- [ ] Reduce number of API calls by caching first name data
- [ ] Analyze all Annual Registers (instead of small sample)
- [ ] Analyze data by year
- [ ] Plot data in a time series graph
- [ ] Detect and factor in co-authorship
- [ ] Investigate correlation between topics and gender of the author
Roadmap (long term)
- [ ] Scrape all data before 2011
- [ ] Publish new data periodically via Mastodon
- [ ] Describe project results in a blog post
Data
Articles
txt
https://www.bpb.de/shop/zeitschriften/apuz/1953-2023/520827/stalins-tod-und-das-ende-der-allmacht/
https://www.bpb.de/shop/zeitschriften/apuz/1953-2023/520821/editorial/
Issues
```txt https://www.bpb.de/shop/zeitschriften/apuz/archiv/524414/apuz-47-1953/ https://www.bpb.de/shop/zeitschriften/apuz/archiv/524422/apuz-49-1953/ https://www.bpb.de/shop/zeitschriften/apuz/archiv/524430/apuz-50-1953/ https://www.bpb.de/shop/zeitschriften/apuz/archiv/524436/apuz-51-1953/ https://www.bpb.de/shop/zeitschriften/apuz/archiv/524443/apuz-1-1954/ https://www.bpb.de/shop/zeitschriften/apuz/archiv/524449/apuz-2-1954/ https://www.bpb.de/shop/zeitschriften/apuz/archiv/525715/aus-politik-und-zeitgeschichte-1959/
https://www.bpb.de/shop/zeitschriften/apuz/archiv/531460/apuz-2-1980/
2010 https://www.bpb.de/shop/zeitschriften/apuz/32275/armut-in-deutschland/
2012 https://www.bpb.de/shop/zeitschriften/apuz/151293/weltuntergang/
https://www.bpb.de/shop/zeitschriften/apuz/diskurskultur-2023/ ```
Annual Registers
only available for 2011-2022
txt
https://www.bpb.de/system/files/dokument_pdf/APuZ_2011-Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2012-Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2013-Jahresregister_online_0.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2014_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2015_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2016_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2017_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2018_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2019_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2020_Jahresregister_online.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2021_Jahresregister_Umbruch_v3.pdf
https://www.bpb.de/system/files/dokument_pdf/APuZ_2022-Jahresregister_online.pdf
Sitemaps
Are apparently incomplete, therefore not using this.
bash
curl "https://www.bpb.de/sitemap.xml?page=1" > sitemap1.xml
curl "https://www.bpb.de/sitemap.xml?page=2" > sitemap2.xml
Owner
- Name: Chris Grieser
- Login: chrisgrieser
- Kind: user
- Location: Berlin, Germany
- Company: Technical University of Berlin
- Website: https://chris-grieser.de/
- Repositories: 189
- Profile: https://github.com/chrisgrieser
Researcher in sociology & software developer
GitHub Events
Total
- Push event: 2
Last Year
- Push event: 2
Committers
Last synced: 11 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Chris Grieser | 7****r | 33 |
| jk | h****e | 3 |
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- genderize *