U-papers :: View publication

View publication

Title	AU-TextAlign: A Pipeline for Word-Level Facial Expression Tagging in LLMs
Authors	Jose Guillen, Valentín Barriere, Mauricio Araya
Publication date	2025
Abstract	We present AU-TextAlign, a method for embedding facial expressions into text by aligning speech transcriptions with Action Units (AUs) from the Facial Action Coding System. The approach links detected AUs with individual spoken words, producing structured annotations that can be directly processed by Large Language Models (LLMs). We test whether LLMs can infer facial expressions from text alone and find that, while current models fail to recover most expressions without explicit cues, they respond more affectively when AU tags are provided. This negative result highlights a gap in multimodal understanding and motivates the integration of structured non-verbal signals into language interfaces. Our work opens a pathway toward socially aware systems that bridge verbal and non-verbal communication.
Conference name	International Conference on Pattern Recognition Systems
Publisher	IEEE-xplore
Reference URL