Sparse landmarks for facial action unit detection using vision transformer and perceiver

Çakır, Duygu; Yılmaz, Görkem; Arıca, Nafiz

dc.contributor.author	Çakır, Duygu
dc.contributor.author	Yılmaz, Görkem
dc.contributor.author	Arıca, Nafiz
dc.date.accessioned	2024-10-09T06:42:48Z
dc.date.available	2024-10-09T06:42:48Z
dc.date.issued	2024	en_US
dc.identifier.citation	Cakir, D., Yilmaz, G., & Arica, N. (2024). Sparse landmarks for facial action unit detection using vision transformer and perceiver. International Journal of Computational Science and Engineering, 27(5), 607-620.	en_US
dc.identifier.issn	1742-7185
dc.identifier.uri	https://hdl.handle.net/20.500.12960/1679
dc.description.abstract	The ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.	en_US
dc.language.iso	eng	en_US
dc.publisher	Inderscience Publishers	en_US
dc.relation.ispartof	International Journal of Computational Science and Engineering	en_US
dc.relation.isversionof	10.1504/IJCSE.2024.141343	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Action unit detection	en_US
dc.subject	Perceiver	en_US
dc.subject	Sparse learning	en_US
dc.subject	Vision transformer	en_US
dc.title	Sparse landmarks for facial action unit detection using vision transformer and perceiver	en_US
dc.type	article	en_US
dc.authorid	0000-0002-3810-5866	en_US
dc.department	Mühendislik Fakültesi, Bilişim Sistemleri Mühendisliği	en_US
dc.contributor.institutionauthor	Arıca, Nafiz
dc.identifier.volume	27	en_US
dc.identifier.issue	5	en_US
dc.identifier.startpage	607	en_US
dc.identifier.endpage	620	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US

Bu öğenin dosyaları:

Dosyalar	Boyut	Biçim	Göster
Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Bilişim Sistemleri Mühendisliği Koleksiyonu [27]
Scopus İndeksli Yayınlar Koleksiyonu [1313]
Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu [1369]
WoS Indexed Publications Collection

Basit öğe kaydını göster