phd-students-data-analysis
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.2%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: ShaddAhmed14
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 223 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
The number of PhD student in Germany Analysis using Python
RSE Summer 2024 Self-Project # 1.
The Project will tackle the following questions regarding the number of PhD students in Germany. - How do the number of students change over the last 4 years? - How does this change based on nationality, gender and type of courses?
Index
Problem Description?
Being a Masters in Data Science student myself, I was interested in the statistics of higher education students. Analysing this data will provide us with several insighs which we can use to dive deeper and research questions such as:
- Which courses do foreigners usually opt for and how to market them better.
- Which courses are male/female dominant.
- Why are some courses more popular than the others? Do we need better marketing? or is it a language barrier issue?
How to get started and Requirements
Download the repository and have ipython as well as matplotlib and pandas installed. Test it using Python 3.11.3 - I do not give gurantees that it works with older versions.
Running the Program
Copy the ipyton/ Jupyter notebook and open on your preferable editor. Run the notebook to view results.
Program files are stored in the src folder. Further documentation can be found in the description.txt
Dataset
The Dataset is from Genesis, a statistical data service provided by the German government. Dataset used is GENESIS-Tabelle: 21352-0003, Statistics of doctoral students. I have created 2 csv for this: The first has course groups only while the 2nd has all the courses. - Data is recorded from 2019 - 2022. - Main header has Year, Course Group/ Course, German, Foreigner and Total Columns. Where German, Foreigner and Total refer to the number of students. - 2nd header has Male and Female column for the German, Foreigner and Total Columns.
MIT License
Citation
Owner
- Login: ShaddAhmed14
- Kind: user
- Repositories: 2
- Profile: https://github.com/ShaddAhmed14
Citation (CITATION.cff)
cff-version: 1.0.0 message: "If you use this software, please cite it as below." authors: - family-names: "Shaikh" given-names: "Sharjeel Ahmed" title: "The number of PhD student in Germany Analysis using Python" version: 1.0 date-released: 2024-05-12 url: "https://gitup.uni-potsdam.de/shaikh1/rse_24_825121"