Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorÇakır, Duygu
dc.contributor.authorYılmaz, Görkem
dc.contributor.authorArıca, Nafiz
dc.date.accessioned2024-10-09T06:42:48Z
dc.date.available2024-10-09T06:42:48Z
dc.date.issued2024en_US
dc.identifier.citationCakir, D., Yilmaz, G., & Arica, N. (2024). Sparse landmarks for facial action unit detection using vision transformer and perceiver. International Journal of Computational Science and Engineering, 27(5), 607-620.en_US
dc.identifier.issn1742-7185
dc.identifier.urihttps://hdl.handle.net/20.500.12960/1679
dc.description.abstractThe ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.en_US
dc.language.isoengen_US
dc.publisherInderscience Publishersen_US
dc.relation.ispartofInternational Journal of Computational Science and Engineeringen_US
dc.relation.isversionof10.1504/IJCSE.2024.141343en_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectAction unit detectionen_US
dc.subjectPerceiveren_US
dc.subjectSparse learningen_US
dc.subjectVision transformeren_US
dc.titleSparse landmarks for facial action unit detection using vision transformer and perceiveren_US
dc.typearticleen_US
dc.authorid0000-0002-3810-5866en_US
dc.departmentMühendislik Fakültesi, Bilişim Sistemleri Mühendisliğien_US
dc.contributor.institutionauthorArıca, Nafiz
dc.identifier.volume27en_US
dc.identifier.issue5en_US
dc.identifier.startpage607en_US
dc.identifier.endpage620en_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US


Bu öğenin dosyaları:

DosyalarBoyutBiçimGöster

Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster