A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

Zhai, Wanyue; Rusert, Jonathan; Shafiq, Zubair; Srinivasan, Padmini

Computer Science > Computation and Language

arXiv:2203.11849 (cs)

[Submitted on 22 Mar 2022]

Title:A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

Authors:Wanyue Zhai, Jonathan Rusert, Zubair Shafiq, Padmini Srinivasan

View PDF

Abstract:Recent advances in natural language processing have enabled powerful privacy-invasive authorship attribution. To counter authorship attribution, researchers have proposed a variety of rule-based and learning-based text obfuscation approaches. However, existing authorship obfuscation approaches do not consider the adversarial threat model. Specifically, they are not evaluated against adversarially trained authorship attributors that are aware of potential obfuscation. To fill this gap, we investigate the problem of adversarial authorship attribution for deobfuscation. We show that adversarially trained authorship attributors are able to degrade the effectiveness of existing obfuscators from 20-30% to 5-10%. We also evaluate the effectiveness of adversarial training when the attributor makes incorrect assumptions about whether and which obfuscator was used. While there is a a clear degradation in attribution accuracy, it is noteworthy that this degradation is still at or above the attribution accuracy of the attributor that is not adversarially trained at all. Our results underline the need for stronger obfuscation approaches that are resistant to deobfuscation

Comments:	9 pages, 7 figures, 3 tables, ACL 2022
Subjects:	Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2203.11849 [cs.CL]
	(or arXiv:2203.11849v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2203.11849

Submission history

From: Wanyue Zhai [view email]
[v1] Tue, 22 Mar 2022 16:26:09 UTC (656 KB)

Computer Science > Computation and Language

Title:A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Girl Has A Name, And It's ... Adversarial Authorship Attribution for Deobfuscation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators