Who Wrote When? Author Diarization in Social Media Discussions

Boenninghoff, Benedikt; Hosseini, Henry; Nickel, Robert M.; Kolossa, Dorothea

Forschungsartikel in Sammelband (Konferenz) | Peer reviewed

Zusammenfassung

We are proposing a novel framework for author diarization, i.e. attributing comments in online discussions to individual authors. We consider an innovative approach that merges pre-trained neural representations of writing style with author-conditional encoder-decoder diarization, enhanced by a Conditional Random Field with Viterbi decoding for alignment refinement. Additionally, we introduce two new large-scale German language datasets, one for authorship verification and the other for author diarization. We evaluate the performance of our diarization framework on these datasets, offering insights into the strengths and limitations of this approach.

Details zur Publikation

Herausgeber*innenAl-Onaizan, Yaser; Bansal, Mohit; Chen, Yun-Nung
BuchtitelFindings of the Association for Computational Linguistics: EMNLP 2024
Seitenbereich15721-15734
VerlagSelbstverlag / Eigenverlag
ErscheinungsortMiami, Florida, USA
StatusVeröffentlicht
Veröffentlichungsjahr2024
Sprache, in der die Publikation verfasst istEnglisch
KonferenzEmpirical Methods in Natural Language Processing (EMNLP), Miami, Florida, Vereinigte Staaten
Link zum Volltexthttps://aclanthology.org/2024.findings-emnlp.922
StichwörterNLP; Deep Learning; Author Diarization; Social Media

Autor*innen der Universität Münster

Hosseini, Henry Simon
Institut für Wirtschaftsinformatik (WI)