Python for Data Science – A crash course

Deadline for application

09 August 2024

Objective

This course provides an introduction to the application of Python for Data Science and Machine Learning. In recent years, Data science and Machine Learning have had a significant impact on the way Central Banks perform tasks such as analysing markets, assessing risk, anti-money laundering and banking supervision. Python, as one of the leading programming languages for Data Science, plays a crucial role in this area.

In this course, participants will learn how to use Python to analyse data, create machine learning models and support business decisions. A fairly basic introduction to Python will be provided, and following an introduction to the principles, we will move on quite quickly to the application of Python for Data Science and Machine Learning. NumPy, Pandas, Scikit-learn and Matplotlib are the main libraries used in the course, with others appearing  more sporadically. 

Programming tasks and smaller projects will be carried out in practical exercises, independently or in groups, and participants will be encouraged to collaborate with others.

At the end of the course, participants should be able to use Python to analyse data, build models and autonomously extend the knowledge gained so that it can be transferred to tasks and issues in their central banks. To this end, they will also learn where to find resources on the internet that will allow them to independently discover and apply further approaches and methods.

Contents

Introduction to Python

  • Basics and fundamental concepts
  • Data structures in Python
  • Data visualisation with Python

Data preparation and analysis

  • Data retrieval and cleaning
  • Descriptive statistics in Python

Machine Learning

  • Regression and Classification
  • Artificial Neural Networks
  • Machine learning for central bank tasks

Case studies and projects

  • Practical application of data science in central banks
  • Project work and programming exercises

Starting on the second day, there will be a Q & A slot before every session, during which participants will have the opportunity to discuss the content of the previous day and the challenges they encountered with the practical exercises.

Target group

The course is designed for staff working in information technology, statistics or research departments in central banks and regulatory and supervisory authorities with an interest in applying machine learning methods with Python. It is not aimed at those who already have extensive and advanced knowledge of Python.

Indeed, little prior knowledge is expected, and the necessary Python skills are covered at the beginning, making the course suitable for previously inexperienced staff. It may be interesting for those who wish to switch from the programming language R to Python. 

Please note that we will not discuss the deeper statistical backgrounds of the methods used in any great depth.

Technical requirements

Computer with microphone, camera, speakers, or headphones; an up-to-date internet browser.

Participants should be able to work on their own computers with an existing Python installation (e.g. Anaconda). If necessary, assistance with the installation will be given in advance. Two screens will make participation and coursework much easier.

Registration deadline expired.