| dc.contributor.author | Çakır, Duygu | |
| dc.contributor.author | Yılmaz, Görkem | |
| dc.contributor.author | Arıca, Nafiz | |
| dc.date.accessioned | 2025-04-11T11:24:16Z | |
| dc.date.available | 2025-04-11T11:24:16Z | |
| dc.date.issued | 2025 | en_US |
| dc.identifier.citation | Cakir, D., Yilmaz, G., & Arica, N. (2025). Enhanced facial action unit detection with adaptable patch sizes on representative landmarks. Neural Computing and Applications, 37(5), 3777-3791. | en_US |
| dc.identifier.issn | 0941-0643 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12960/1766 | |
| dc.description.abstract | The human face displays expressions through the contraction of various facial muscles. The Facial Action Coding System (FACS) is a widely accepted taxonomy that describes all visible changes in the face in terms of action units (AUs). In this study, AUs are examined by finding the most active landmarks of the face and then examining the most representative patch sizes of each landmark for the AU detection task. Sparse learning is employed to learn the most active landmarks for each AU, and then the active landmark patches are fed to ViT and Perceiver mechanisms independently. Experiments indicate that using active landmark patches with their most representative size improves the results when compared to using all the landmarks, especially when it is used on more challenging datasets as a support for the attention mechanism of the classifier. The results demonstrate that the proposed method improves the performance of the employed models and are further supported by experiments conducted across different datasets. | en_US |
| dc.language.iso | eng | en_US |
| dc.publisher | Springer Science and Business Media Deutschland GmbH | en_US |
| dc.relation.ispartof | Neural Computing and Applications | en_US |
| dc.relation.isversionof | 10.1007/s00521-024-10836-5 | en_US |
| dc.rights | info:eu-repo/semantics/embargoedAccess | en_US |
| dc.subject | Facial action unit detection | en_US |
| dc.subject | Perceiver | en_US |
| dc.subject | Sparse landmarks | en_US |
| dc.subject | Vision transformer | en_US |
| dc.title | Enhanced facial action unit detection with adaptable patch sizes on representative landmarks | en_US |
| dc.type | article | en_US |
| dc.authorid | 0000-0002-3810-5866 | en_US |
| dc.department | Mühendislik Fakültesi, Bilişim Sistemleri Mühendisliği | en_US |
| dc.contributor.institutionauthor | Arıca, Nafiz | |
| dc.identifier.volume | 37 | en_US |
| dc.identifier.issue | 5 | en_US |
| dc.identifier.startpage | 377 | en_US |
| dc.identifier.endpage | 3791 | en_US |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |