Taking disagreements into consideration: human annotation variability in privacy policy analysis

Authors

  • Tian Wang University of Illinois Urbana-Champaign
  • Yuanye Ma University of Illinois Discovery Partners Institute
  • Catherine Blake University of Illinois Urbana-Champaign
  • Masooda Bashir University of Illinois Urbana-Champaign
  • Ryan Wang University of Illinois Urbana-Champaign

DOI:

https://doi.org/10.47989/ir30iConf47581

Keywords:

privacy policy, annotator disagreement, natural language processing, machine learning, human label variation

Abstract

Introduction. Privacy policies inform users about data practices but are often complex and difficult to interpret. Human annotation plays a key role in understanding privacy policies, yet annotation disagreements highlight the complexity of these texts. Traditional machine learning models prioritize consensus, overlooking annotation variability and its impact on accuracy.

Method. This study examines how annotation disagreements affect machine learning performance using the OPP-115 corpus. It compares majority vote and union methods with alternative strategies to assess their impact on policy classification.

Analysis. The study evaluates whether increasing annotator consensus improves model effectiveness and if disagreement-aware approaches yield more reliable results.

Results. Higher agreement levels improve model performance across most categories. Complete agreement yields the best F1-scores, especially for First Party Collection/Use and Third-Party Sharing/Collection. Annotation disagreements significantly impact classification outcomes, underscoring the need for understanding annotation disagreements.

Conclusions. Ignoring annotation disagreements can misrepresent model accuracy. This study proposes new evaluation strategies that account for annotation variability, offering a more realistic approach to privacy policy analysis. Future work should explore the causes of annotation disagreements to improve machine learning transparency and reliability.

Downloads

Published

2025-03-11

How to Cite

Wang, T., Ma, Y., Blake, C., Bashir, M., & Wang, R. (2025). Taking disagreements into consideration: human annotation variability in privacy policy analysis. Information Research an International Electronic Journal, 30(iConf), 81–92. https://doi.org/10.47989/ir30iConf47581

Issue

Section

Peer-reviewed papers