Taking disagreements into consideration: human annotation variability in privacy policy analysis

Tian Wang; Yuanye Ma; Catherine Blake; Masooda Bashir; Ryan Wang

doi:10.47989/ir30iConf47581

Authors

Tian Wang University of Illinois Urbana-Champaign
Yuanye Ma University of Illinois Discovery Partners Institute
Catherine Blake University of Illinois Urbana-Champaign
Masooda Bashir University of Illinois Urbana-Champaign
Ryan Wang University of Illinois Urbana-Champaign

DOI:

https://doi.org/10.47989/ir30iConf47581

Keywords:

privacy policy, annotator disagreement, natural language processing, machine learning, human label variation

Abstract

Introduction. Privacy policies inform users about data practices but are often complex and difficult to interpret. Human annotation plays a key role in understanding privacy policies, yet annotation disagreements highlight the complexity of these texts. Traditional machine learning models prioritize consensus, overlooking annotation variability and its impact on accuracy.

Method. This study examines how annotation disagreements affect machine learning performance using the OPP-115 corpus. It compares majority vote and union methods with alternative strategies to assess their impact on policy classification.

Analysis. The study evaluates whether increasing annotator consensus improves model effectiveness and if disagreement-aware approaches yield more reliable results.

Results. Higher agreement levels improve model performance across most categories. Complete agreement yields the best F1-scores, especially for First Party Collection/Use and Third-Party Sharing/Collection. Annotation disagreements significantly impact classification outcomes, underscoring the need for understanding annotation disagreements.

Conclusions. Ignoring annotation disagreements can misrepresent model accuracy. This study proposes new evaluation strategies that account for annotation variability, offering a more realistic approach to privacy policy analysis. Future work should explore the causes of annotation disagreements to improve machine learning transparency and reliability.

Taking disagreements into consideration: human annotation variability in privacy policy analysis

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

About the Journal

Make a Submission

Information