Politics of data reuse in machine learning systems

Politics of data reuse in machine learning systems: Theorizing reuse entanglements

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Standard

Politics of data reuse in machine learning systems : Theorizing reuse entanglements. / Thylstrup, Nanna Bonde; Hansen, Kristian Bondo; Flyverbom, Mikkel; Amoore, Louise.

I: Big Data and Society, Bind 9, Nr. 2, 2022.

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Harvard

Thylstrup, NB, Hansen, KB, Flyverbom, M & Amoore, L 2022, 'Politics of data reuse in machine learning systems: Theorizing reuse entanglements', Big Data and Society, bind 9, nr. 2. https://doi.org/10.1177/20539517221139785

APA

Thylstrup, N. B., Hansen, K. B., Flyverbom, M., & Amoore, L. (2022). Politics of data reuse in machine learning systems: Theorizing reuse entanglements. Big Data and Society, 9(2). https://doi.org/10.1177/20539517221139785

Vancouver

Thylstrup NB, Hansen KB, Flyverbom M, Amoore L. Politics of data reuse in machine learning systems: Theorizing reuse entanglements. Big Data and Society. 2022;9(2). https://doi.org/10.1177/20539517221139785

Author

Thylstrup, Nanna Bonde ; Hansen, Kristian Bondo ; Flyverbom, Mikkel ; Amoore, Louise. / Politics of data reuse in machine learning systems : Theorizing reuse entanglements. I: Big Data and Society. 2022 ; Bind 9, Nr. 2.

Bibtex

@article{2e3d697ad64c41a496a960ac176d199b,

title = "Politics of data reuse in machine learning systems: Theorizing reuse entanglements",

abstract = "Policy discussions and corporate strategies on machine learning are increasingly championing data reuse as a key element in digital transformations. These aspirations are often coupled with a focus on responsibility, ethics and transparency, as well as emergent forms of regulation that seek to set demands for corporate conduct and the protection of civic rights. And the Protective measures include methods of traceability and assessments of {\textquoteleft}good{\textquoteright} and {\textquoteleft}bad{\textquoteright} datasets and algorithms that are considered to be traceable, stable and contained. However, these ways of thinking about both technology and ethics obscure a fundamental issue, namely that machine learning systems entangle data, algorithms and more-than-human environments in ways that challenge a well-defined separation. This article investigates the fundamental fallacy of most data reuse strategies as well as their regulation and mitigation strategies that data can somehow be followed, contained and controlled in machine learning processes. Instead, the article argues that we need to understand the reuse of data as an inherently entangled phenomenon. To examine this tension between the discursive regimes and the realities of data reuse, we advance the notion of reuse entanglements as an analytical lens. The main contribution of the article is the conceptualization of reuse that places entanglements at its core and the articulation of its relevance using empirical illustrations. This is important, we argue, for our understanding of the nature of data and algorithms, for the practical uses of data and algorithms and our attitudes regarding ethics, responsibility and regulation.",

keywords = "algorithms, Data reuse, datasets, entanglements, ethics, machine learning",

author = "Thylstrup, {Nanna Bonde} and Hansen, {Kristian Bondo} and Mikkel Flyverbom and Louise Amoore",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2022.",

year = "2022",

doi = "10.1177/20539517221139785",

language = "English",

volume = "9",

journal = "Big Data & Society",

issn = "2053-9517",

publisher = "SAGE Publications",

number = "2",

}

RIS

TY - JOUR

T1 - Politics of data reuse in machine learning systems

T2 - Theorizing reuse entanglements

AU - Thylstrup, Nanna Bonde

AU - Hansen, Kristian Bondo

AU - Flyverbom, Mikkel

AU - Amoore, Louise

N1 - Publisher Copyright: © The Author(s) 2022.

PY - 2022

Y1 - 2022

N2 - Policy discussions and corporate strategies on machine learning are increasingly championing data reuse as a key element in digital transformations. These aspirations are often coupled with a focus on responsibility, ethics and transparency, as well as emergent forms of regulation that seek to set demands for corporate conduct and the protection of civic rights. And the Protective measures include methods of traceability and assessments of ‘good’ and ‘bad’ datasets and algorithms that are considered to be traceable, stable and contained. However, these ways of thinking about both technology and ethics obscure a fundamental issue, namely that machine learning systems entangle data, algorithms and more-than-human environments in ways that challenge a well-defined separation. This article investigates the fundamental fallacy of most data reuse strategies as well as their regulation and mitigation strategies that data can somehow be followed, contained and controlled in machine learning processes. Instead, the article argues that we need to understand the reuse of data as an inherently entangled phenomenon. To examine this tension between the discursive regimes and the realities of data reuse, we advance the notion of reuse entanglements as an analytical lens. The main contribution of the article is the conceptualization of reuse that places entanglements at its core and the articulation of its relevance using empirical illustrations. This is important, we argue, for our understanding of the nature of data and algorithms, for the practical uses of data and algorithms and our attitudes regarding ethics, responsibility and regulation.

AB - Policy discussions and corporate strategies on machine learning are increasingly championing data reuse as a key element in digital transformations. These aspirations are often coupled with a focus on responsibility, ethics and transparency, as well as emergent forms of regulation that seek to set demands for corporate conduct and the protection of civic rights. And the Protective measures include methods of traceability and assessments of ‘good’ and ‘bad’ datasets and algorithms that are considered to be traceable, stable and contained. However, these ways of thinking about both technology and ethics obscure a fundamental issue, namely that machine learning systems entangle data, algorithms and more-than-human environments in ways that challenge a well-defined separation. This article investigates the fundamental fallacy of most data reuse strategies as well as their regulation and mitigation strategies that data can somehow be followed, contained and controlled in machine learning processes. Instead, the article argues that we need to understand the reuse of data as an inherently entangled phenomenon. To examine this tension between the discursive regimes and the realities of data reuse, we advance the notion of reuse entanglements as an analytical lens. The main contribution of the article is the conceptualization of reuse that places entanglements at its core and the articulation of its relevance using empirical illustrations. This is important, we argue, for our understanding of the nature of data and algorithms, for the practical uses of data and algorithms and our attitudes regarding ethics, responsibility and regulation.

KW - algorithms

KW - Data reuse

KW - datasets

KW - entanglements

KW - ethics

KW - machine learning

UR - http://www.scopus.com/inward/record.url?scp=85144219501&partnerID=8YFLogxK

U2 - 10.1177/20539517221139785

DO - 10.1177/20539517221139785

M3 - Journal article

AN - SCOPUS:85144219501

VL - 9

JO - Big Data & Society

JF - Big Data & Society

SN - 2053-9517

IS - 2

ER -

ID: 356953330

Institut for Kunst- og Kulturvidenskab