47 min

Julian Posada, The Coloniality Of Data Work For Machine Learning Ethics of AI in Context

    • Technology

Many research and industry organizations outsource data generation, annotation, and algorithmic verification—or data work—to workers worldwide through digital platforms. A subset of the gig economy, these platforms consider workers independent users with no employment rights, pay them per task, and control them with automated algorithmic managers. This talk explores how the coloniality of data work is characterized by an extractivist method of generating data that privileges profit and the epistemic dominance of those in power. Social inequalities are reproduced through the data production process, and local worker communities mitigate these power imbalances by relying on family members, neighbours, and colleagues online. Furthermore, management in outsourced data production ensures that workers’ voices are suppressed in the data annotation process through algorithmic control and surveillance, resulting in datasets generated exclusively by clients, with their worldviews encoded in algorithms through training.

Julian Posada
Faculty of Information
University of Toronto

Many research and industry organizations outsource data generation, annotation, and algorithmic verification—or data work—to workers worldwide through digital platforms. A subset of the gig economy, these platforms consider workers independent users with no employment rights, pay them per task, and control them with automated algorithmic managers. This talk explores how the coloniality of data work is characterized by an extractivist method of generating data that privileges profit and the epistemic dominance of those in power. Social inequalities are reproduced through the data production process, and local worker communities mitigate these power imbalances by relying on family members, neighbours, and colleagues online. Furthermore, management in outsourced data production ensures that workers’ voices are suppressed in the data annotation process through algorithmic control and surveillance, resulting in datasets generated exclusively by clients, with their worldviews encoded in algorithms through training.

Julian Posada
Faculty of Information
University of Toronto

47 min

Top Podcasts In Technology

nFactorial Podcast
nFactorial school
Lex Fridman Podcast
Lex Fridman
GEMBA PODCAST
Маргулан Сейсембаев
Запуск завтра
libo/libo
Радио-Т
Umputun, Bobuk, Gray, Ksenks, Alek.sys
Podlodka Podcast
Егор Толстой, Стас Цыганов, Екатерина Петрова и Евгений Кателла