Publication
Fine-grained fish classification from small to large datasets with vision transformers
| dc.contributor.author | Veiga, Ricardo | |
| dc.contributor.author | Rodrigues, Joao | |
| dc.date.accessioned | 2024-11-02T13:23:58Z | |
| dc.date.available | 2024-11-02T13:23:58Z | |
| dc.date.issued | 2024 | |
| dc.description.abstract | Fish species Fine-Grained Visual Classification (FGVC) is important for ecological research, environmental management, and biodiversity monitoring, as accurate fish species identification is crucial for assessing the health of marine ecosystems, monitoring changes in biodiversity, and converting conservation plans into action. Although Convolutional Neural Network (CNN)s have been the conventional approach for FGVC, their effectiveness in differentiating visually similar species is not always satisfactory. The advent of Vision Transformer (ViT)s, in particular the Shifted window (Swin) Transformer, has demonstrated potential in addressing these issues by using sophisticated self-attention and feature extraction techniques. This paper proposes a method of combining the FGVC Plug-in Module (FGVC-PIM) and the Swin Transformer. The FGVC-PIM improves classification by concentrating on the most discriminative image regions, while the Swin Transformer acts as the framework and provides strong hierarchical feature extraction. The performance of the method was assessed on 14 different datasets, which included 19 distinct subsets with varying environmental conditions and image quality. With the proposed method it was achieved state-of-the-art results in 13 of these subsets, exhibiting better accuracy and robustness than previous methods, in 2 subsets (not yet explored by other authors) new baseline results are presented, and in the remaining 4 it was achieved results always above 83%. | eng |
| dc.description.sponsorship | UIDP/04516/2020; 2022.11602.BD | |
| dc.identifier.doi | 10.1109/access.2024.3443654 | |
| dc.identifier.issn | 2169-3536 | |
| dc.identifier.uri | http://hdl.handle.net/10400.1/26198 | |
| dc.language.iso | eng | |
| dc.peerreviewed | yes | |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
| dc.relation | NOVA Laboratory for Computer Science and Informatics | |
| dc.relation.ispartof | IEEE Access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Transformers | |
| dc.subject | Computer vision | |
| dc.subject | Feature extraction | |
| dc.subject | Task analysis | |
| dc.subject | Convolutional neural networks | |
| dc.subject | Biological system modeling | |
| dc.subject | Biodiversity | |
| dc.subject | Marine ecosystems | |
| dc.subject | Monitoring | |
| dc.subject | Fish | |
| dc.subject | Fine-grained visual classification | |
| dc.subject | Marine biodiversity monitoring | |
| dc.subject | Swin transformer | |
| dc.title | Fine-grained fish classification from small to large datasets with vision transformers | eng |
| dc.type | journal article | |
| dspace.entity.type | Publication | |
| oaire.awardTitle | NOVA Laboratory for Computer Science and Informatics | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F04516%2F2020/PT | |
| oaire.citation.endPage | 113660 | |
| oaire.citation.startPage | 113642 | |
| oaire.citation.title | IEEE Access | |
| oaire.citation.volume | 12 | |
| oaire.fundingStream | 6817 - DCRRNI ID | |
| oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |
| person.familyName | Veiga | |
| person.familyName | Rodrigues | |
| person.givenName | Ricardo | |
| person.givenName | Joao | |
| person.identifier | 1603578 | |
| person.identifier.ciencia-id | D212-85A6-C85A | |
| person.identifier.ciencia-id | 8A19-98F7-9914 | |
| person.identifier.orcid | 0000-0002-7557-8304 | |
| person.identifier.orcid | 0000-0002-3562-6025 | |
| person.identifier.scopus-author-id | 57203130604 | |
| person.identifier.scopus-author-id | 55807461600 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| relation.isAuthorOfPublication | 60cb9fb6-94b2-4657-935e-3e7eea696a49 | |
| relation.isAuthorOfPublication | 683ba85b-459c-4789-a4ff-a4e2a904b295 | |
| relation.isAuthorOfPublication.latestForDiscovery | 60cb9fb6-94b2-4657-935e-3e7eea696a49 | |
| relation.isProjectOfPublication | 1122b3d4-9740-4ad7-9abf-86bb7a3615da | |
| relation.isProjectOfPublication.latestForDiscovery | 1122b3d4-9740-4ad7-9abf-86bb7a3615da |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Fine-Grained_Fish_Classification_From_Small_to_Large_Datasets_With_Vision_Transformers.pdf
- Size:
- 3.97 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 3.46 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
