Visual Exploration of Indirect Bias in Language Models

Louis-Alexandre, Judith; Waldner, Manuela

Visual Exploration of Indirect Bias in Language Models

Files

001-005.pdf (512.94 KB)

1026-mm.mp4 (49.06 MB)

1026-mm.pdf (122.02 KB)

Date

2023

Authors

Louis-Alexandre, Judith
Waldner, Manuela

Publisher

The Eurographics Association

Abstract

Language models are trained on large text corpora that often include stereotypes. This can lead to direct or indirect bias in downstream applications. In this work, we present a method for interactive visual exploration of indirect multiclass bias learned by contextual word embeddings. We introduce a new indirect bias quantification score and present two interactive visualizations to explore interactions between multiple non-sensitive concepts (such as sports, occupations, and beverages) and sensitive attributes (such as gender or year of birth) based on this score.

CCS Concepts: Human-centered computing -> Visual analytics; Computing methodologies -> Natural language processing

        @inproceedings{10.2312:evs.20231034
,
booktitle = {EuroVis 2023 - Short Papers
},
editor = {Hoellt, Thomas and 
Aigner, Wolfgang and 
Wang, Bei
},
title = {{Visual Exploration of Indirect Bias in Language Models
}},
author = {Louis-Alexandre, Judith and 
Waldner, Manuela
},
year = {2023
},
publisher = {The Eurographics Association
},
ISBN = {978-3-03868-219-6
},
DOI = {10.2312/evs.20231034
}
}

URI

https://doi.org/10.2312/evs.20231034
https://diglib.eg.org:443/handle/10.2312/evs20231034

Collections

EuroVisShort2023

Full item page