Basics for the use of Python in Official Statistics- 2025 | Eurostat CROS

Basics for the use of Python in Official Statistics
Course Leader	Christian Kauth
Target Group	Statistical production units and methodologists of NSIs.
Entry Qualifications	Sound command of English. Participants should be able to make short interventions and to actively participate in discussions
Objective(s)	The main objectives of the course are: Introducing the participants to the Python language and ecosystem Make the participants able to read and write basic Python programs for common data processing tasks (data analysis, exploration and visualization)
Contents	FIRST PART – Theory (3 days) Python fundamentals through examples • Why Python? History, needs, advantages, disadvantages, … • Python data structures o data types and variables, mutable and immutable, o strings: formatting, slicing, concatenation, repetition, … o lists, tuples: indexing, slicing, concatenation, repetition, shallow and deep copy, … o dictionaries, sets: accessing, merging, iterating, … o args and kwargs. • Basics programming o core syntax and semantics, o comments and documentation, o flow control: conditional statements, iterative constructs (loops), sequences and enumeration… o arithmetic and comparison operators, True and False, o iterators and generators, o list comprehensions, o functions, parameter passing (arbitrary, optional, keyword parameters), global and local variables, returning values o file management, o lambda, filter, reduce, map and zip operators. • Brief tour of standard libraries. Programming in Python through examples • Using online resources (e.g., python.org, stackoverflow, realpython.com, etc…) • Running a Python script o interpreter and compiler, understanding modules and packages, o importing modules, o interactive shell, executable and script files. • Programming in Python o procedural vs. modular programming, o namespaces and scopes, o memorization and decoration, o general introduction to Object Oriented programming in Python: inheritance, polymorphism, encapsulation, o class and instances, methods, instance attributes and properties, o memory management and garbage collection, o error and exception handling, o testing, debugging and logging, o duck typing and monkey patching. Advanced Python programming through projects • Virtual environments and packages o managing packages with pip, o creating virtual environments with pipenv. • Python and Jupyter notebooks • Introduction to data analytics o basics with numpy and scipy: math, matrices, arrays, o data fetching from open API (e.g., Eurostat REST API), o data handling with pandas: times series, dataframes, … o data visualisation with matplotlib, o basic statistical analysis and machine learning methods with scikit-learn. Dealing with databases: SQLite SECOND PART – Practice (2 days):* • Quick recap of the first part • Question and answers • Bring your own project
Expected Outcome	The participants should have a good understanding of Python language basics and its ecosystem in order to proficiently use it for Official Statistics purposes. Familiarity with the syntax of Python Knowledge about the individual aspects of a data processing pipeline: reading a file, processing data, modelling, aggregation, visualization and saving results. Experience with creation, manipulation and conversion of common data structures Experience with writing functions and using (pre-existent) functions Basic knowledge of important packages like Numpy, Pandas, Matplotlib, Seaborn, Scikit-learn, Geopandas
Training Methods	The course consists of alternately: Interactive presentations to introduce topics Exercises (learning by doing) For the practical hands-on parts of the course Jupyter notebook will be used and there will be a discussion regarding possible solutions to the exercises that will be assigned to participants. The participants will be stimulated to write Python code from scratch under the tutors supervision.
Required Reading	None
Suggested Reading	Python Introduction https://www.w3schools.com/python/python_intro.asp Python official site https://www.python.org/about/gettingstarted/ Python 3 Installation & Setup https://realpython.com/installing-python/ Jupyter Notebook: An Introduction https://realpython.com/jupyter-notebook-introduction/ Python IDEs and Code Editors https://realpython.com/python-ides-code-editors-guide/ Python Statistics Fundamentals: How to Describe Your Data https://realpython.com/python-statistics/ Using Pandas and Python to Explore Your Dataset https://realpython.com/pandas-python-explore-dataset/
Required Preparation	Register for free to the Google Colaboratory https://colab.research.google.com/ (with personal login)
Trainer(s)/ Lecturer(s)	Christian Kauth (Independent expert)

Practical Information

When

Duration

Where

Organiser

Application via National Contact Point

24 – 27 March 2025

4 days

Cologne, Germany

ICON-INSTITUT Public Sector

Deadline:

02.12.2024