introduction-to-python

Complete introductory Python course for data scientists

https://github.com/tinajakob/introduction-to-python

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.9%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Complete introductory Python course for data scientists

Basic Info
  • Host: GitHub
  • Owner: tinajakob
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 9.98 MB
Statistics
  • Stars: 2
  • Watchers: 3
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed over 2 years ago
Metadata Files
Readme Citation

README.md

Welcome to this introductory Python course for data scientists!

Content

This is a complete Python course based on interactive Jupyter notebooks. It is suitable for self-study or teaching and covers the following topics:

  • Part 1: Introduction to the Python syntax (3 lectures)
    • Basics and object types
    • Control flow
    • Functions and methods
  • Part 2: Python for data analysis (5 lectures)
    • Introduction to numpy and pandas
    • Importing, inspecting and cleaning data with pandas
    • Data wrangling with pandas
    • OLS regression, plotting with matplotlib
    • Advanced plotting (e.g., maps with geopandas)
  • Part 3: Web scraping (3 lectures)
    • Basics of HTML, scraping static web pages with requests and BeautifulSoup
    • Scraping dynamic web pages with selenium
    • Working with APIs and JSON
  • Part 4: Working with text (2 lectures)
    • String methods and regular expressions
    • Text analysis

Each lecture consists of a tutorial and an exercise. Please contact me for solutions.

Setup

If you add this folder to Google Drive, you can run everything with Google Colab, and you won't even need to install Python on your computer. For more details, see the file "Setup-of-programming-environment.pdf" provided above.

Acknowledgements

I am grateful to Ben Jann, Sebastian Heinrich, Rudolf Farys, and Nadja Vögtle for their helpful additions, comments, and corrections. The first 3 lectures of this course are inspired by "A Whirlwind Tour of Python" .

License

All materials are distributed under the MIT License.

Owner

  • Login: tinajakob
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.1.0
message: "If you use these materials, please cite it as below."
authors:
  - family-names: Jakob
    given-names: Martina
    orcid: 0000-0002-1052-804X
  - family-names: Jann
    given-names: Ben
    orcid: 0000-0001-9855-1967
title: "Introduction to Python for Data Scientists"
version: 1.0.0
date-released: 2023-11-07

GitHub Events

Total
Last Year