Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare. / Ratner, Helene Friis; Thylstrup, Nanna Bonde.

I: AI and Society, 2024.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Ratner, HF & Thylstrup, NB 2024, 'Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare', AI and Society. https://doi.org/10.1007/s00146-024-01920-4

APA

Ratner, H. F., & Thylstrup, N. B. (Accepteret/In press). Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare. AI and Society. https://doi.org/10.1007/s00146-024-01920-4

Vancouver

Ratner HF, Thylstrup NB. Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare. AI and Society. 2024. https://doi.org/10.1007/s00146-024-01920-4

Author

Ratner, Helene Friis ; Thylstrup, Nanna Bonde. / Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare. I: AI and Society. 2024.

Bibtex

@article{c33acf0ef85d44e7a308bde656352f25,
title = "Citizens{\textquoteright} data afterlives: Practices of dataset inclusion in machine learning for public welfare",
abstract = "Public sector adoption of AI techniques in welfare systems recasts historic national data as resource for machine learning. In this paper, we examine how the use of register data for development of predictive models produces new {\textquoteleft}afterlives{\textquoteright} for citizen data. First, we document a Danish research project{\textquoteright}s practical efforts to develop an algorithmic decision-support model for social workers to classify children{\textquoteright}s risk of maltreatment. Second, we outline the tensions emerging from project members{\textquoteright} negotiations about which datasets to include. Third, we identify three types of afterlives for citizen data in machine learning projects: (1) data afterlives for training and testing the algorithm, acting as {\textquoteleft}ground truth{\textquoteright} for inferring futures, (2) data afterlives for validating the algorithmic model, acting as markers of robustness, and (3) data afterlives for improving the model{\textquoteright}s fairness, valuated for reasons of data ethics. We conclude by discussing how, on one hand, these afterlives engender new ethical relations between state and citizens; and how they, on the other hand, also articulate an alternative view on the value of datasets, posing interesting contrasts between machine learning projects developed within the context of the Danish welfare state and mainstream corporate AI discourses of the bigger, the better.",
keywords = "Data afterlives, Dataset negotiations, Machine learning, Welfare state",
author = "Ratner, {Helene Friis} and Thylstrup, {Nanna Bonde}",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
doi = "10.1007/s00146-024-01920-4",
language = "English",
journal = "AI and Society",
issn = "0951-5666",
publisher = "Springer",

}

RIS

TY - JOUR

T1 - Citizens’ data afterlives: Practices of dataset inclusion in machine learning for public welfare

AU - Ratner, Helene Friis

AU - Thylstrup, Nanna Bonde

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024

Y1 - 2024

N2 - Public sector adoption of AI techniques in welfare systems recasts historic national data as resource for machine learning. In this paper, we examine how the use of register data for development of predictive models produces new ‘afterlives’ for citizen data. First, we document a Danish research project’s practical efforts to develop an algorithmic decision-support model for social workers to classify children’s risk of maltreatment. Second, we outline the tensions emerging from project members’ negotiations about which datasets to include. Third, we identify three types of afterlives for citizen data in machine learning projects: (1) data afterlives for training and testing the algorithm, acting as ‘ground truth’ for inferring futures, (2) data afterlives for validating the algorithmic model, acting as markers of robustness, and (3) data afterlives for improving the model’s fairness, valuated for reasons of data ethics. We conclude by discussing how, on one hand, these afterlives engender new ethical relations between state and citizens; and how they, on the other hand, also articulate an alternative view on the value of datasets, posing interesting contrasts between machine learning projects developed within the context of the Danish welfare state and mainstream corporate AI discourses of the bigger, the better.

AB - Public sector adoption of AI techniques in welfare systems recasts historic national data as resource for machine learning. In this paper, we examine how the use of register data for development of predictive models produces new ‘afterlives’ for citizen data. First, we document a Danish research project’s practical efforts to develop an algorithmic decision-support model for social workers to classify children’s risk of maltreatment. Second, we outline the tensions emerging from project members’ negotiations about which datasets to include. Third, we identify three types of afterlives for citizen data in machine learning projects: (1) data afterlives for training and testing the algorithm, acting as ‘ground truth’ for inferring futures, (2) data afterlives for validating the algorithmic model, acting as markers of robustness, and (3) data afterlives for improving the model’s fairness, valuated for reasons of data ethics. We conclude by discussing how, on one hand, these afterlives engender new ethical relations between state and citizens; and how they, on the other hand, also articulate an alternative view on the value of datasets, posing interesting contrasts between machine learning projects developed within the context of the Danish welfare state and mainstream corporate AI discourses of the bigger, the better.

KW - Data afterlives

KW - Dataset negotiations

KW - Machine learning

KW - Welfare state

UR - http://www.scopus.com/inward/record.url?scp=85190286084&partnerID=8YFLogxK

U2 - 10.1007/s00146-024-01920-4

DO - 10.1007/s00146-024-01920-4

M3 - Journal article

AN - SCOPUS:85190286084

JO - AI and Society

JF - AI and Society

SN - 0951-5666

ER -

ID: 389845152