An Introduction to Responsible Machine Learning#

by Hilde Weerts

Last updated: 2025-02-17

Note

This book is still very much a work in progress. If you have suggestions, questions, or comments, please feel free to open an issue on the Github repository.

With the advent of large-scale data collection, the toolkit of a data scientist has proven to be a powerful way to make products and processes faster, cheaper, and better. Many data science applications make use of machine learning algorithms: algorithms that build mathematical models by ‘learning’ from data. Nowadays, machine learning models are integrated into many computer systems: from music recommendations to automated fraud detection, facial recognition systems, and personalized medicine assistants. These systems can provide benefits but are not without risks.

A responsible data scientist understands the potential harm of machine learning models and how to mitigate the risks. This online book provides a practical introduction to the nascent field of responsible machine learning. The goal of this book is to provide a practical approach, building a bridge between philosophical, social, and technical perspectives.

Fairness

Data-driven systems can inherit the existing prejudices embedded in society, resulting in systematic discrimination or other harms.

learn more

Code examples

The code examples in this book are generated with the following package versions.

numpy==1.24.4
pandas==2.0.3
sklearn==1.2.1
matplotlib==3.8.3
fairlearn==0.10.0
torch==2.2.1

Citing this book

To cite this book, please use the following bibtex entry:

@book{weerts2024,
  title = {An Introduction to Responsible Machine Learning},
  author = {Hilde Weerts},
  year = {2024},
  url = {https://hildeweerts.github.io/responsiblemachinelearning/}
}

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.