Publications

View publication

Title AU-TextAlign: A Pipeline for Word-Level Facial Expression Tagging in LLMs
Authors Jose Guillen, Valentín Barriere, Mauricio Araya
Publication date 2025
Abstract We present AU-TextAlign, a method for embedding facial
expressions into text by aligning speech transcriptions with Action Units
(AUs) from the Facial Action Coding System. The approach links detected AUs
with individual spoken words, producing structured annotations that can be
directly processed by Large Language Models (LLMs). We test whether LLMs can
infer facial expressions from text alone and find that, while current models
fail to recover most expressions without explicit cues, they respond more
affectively when AU tags are provided. This negative result highlights a gap
in multimodal understanding and motivates the integration of structured
non-verbal signals into language interfaces. Our work opens a pathway toward
socially aware systems that bridge verbal and non-verbal
communication.
Conference name International Conference on Pattern Recognition Systems
Publisher IEEE-xplore
Reference URL View reference page