Logo do repositório
 
Publicação

From words to visuals: a transformer-based multi-modal framework for emotion-driven tourism analytics

datacite.subject.sdg09:Indústria, Inovação e Infraestruturas
datacite.subject.sdg08:Trabalho Digno e Crescimento Económico
datacite.subject.sdg11:Cidades e Comunidades Sustentáveis
dc.contributor.authorCalderón-Fajardo, Víctor
dc.contributor.authorRodríguez-Rodríguez, Ignacio
dc.contributor.authorPuig-Cabrera, Miguel
dc.date.accessioned2026-03-09T09:57:03Z
dc.date.available2026-03-09T09:57:03Z
dc.date.issued2025-07-22
dc.description.abstractTraditional tourism analytics have primarily relied on isolated sentiment analysis and image processing techniques, often failing to capture the subtle interaction between textual expressions and visual aesthetics inherent in tourist experiences. This study addresses these limitations by proposing a novel multi-modal framework that transforms textual reviews into AI-generated images using standardized prompts, thereby converting affective signals into explicit visual features. Leveraging stateof-the-art models—such as Distilled Bidirectional Encoder Representations from Transformers (DistilBERT) for fine-grained emotion recognition and Contrastive Language–Image Pre training (CLIP) for semantic extraction of visual attributes— our approach maps complex sentiments onto interpretable visual characteristics, integrating explainable features to uncover the underlying structure in tourist perceptions. This approach enhances classification performance and provides a transparent mechanism for understanding how distinct emotional states correspond to specific visual cues. Experimental evaluations on a dataset encompassing four diverse tourist destinations—Berlin, Dublin, Cairo, and Málaga—demonstrate high classification accuracy and robust correlations between text-derived emotions and image-based features, close to more powerful embedding methods. Significant correlations were observed between emotions and visual features, e.g., brightness and contentment, as well as between entropy and shame, indicating that our method efficiently captures the affective resonance between visual and textual modalities. Our findings underscore the transformative potential of converting textual sentiment into visual representations to facilitate more accurate, interpretable, and actionable analytics in the tourism sector. This framework suggests promising avenues for dynamic destination characterization, informed marketing strategies, and enhanced urban planning initiatives, laying the foundation for future advancements in multimodal tourism analytics.eng
dc.description.sponsorshipRYC2023-045296-I; MICIU/AEI/10.13039/501100011033
dc.identifier.doi10.1007/s40558-025-00334-2
dc.identifier.eissn1943-4294
dc.identifier.issn1098-3058
dc.identifier.urihttp://hdl.handle.net/10400.1/28375
dc.language.isoeng
dc.peerreviewedyes
dc.publisherSpringer
dc.relationResearch Centre for Tourism, Sustainability and Well-being
dc.relation.ispartofInformation Technology & Tourism
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectMultimodal tourism analytics
dc.subjectTransformer models
dc.subjectText-toImage generation
dc.subjectAffective sentiment analysis
dc.subjectExplainable AI
dc.subjectDestination classification
dc.titleFrom words to visuals: a transformer-based multi-modal framework for emotion-driven tourism analyticseng
dc.typejournal article
dspace.entity.typePublication
oaire.awardNumberUIDB/04020/2020
oaire.awardTitleResearch Centre for Tourism, Sustainability and Well-being
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F04020%2F2020/PT
oaire.citation.issue4
oaire.citation.titleInformation Technology and Tourism
oaire.citation.volume27
oaire.fundingStream6817 - DCRRNI ID
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyNamePuig-Cabrera
person.givenNameMiguel
person.identifier.ciencia-id4816-E98C-E353
person.identifier.orcid0000-0003-4524-9830
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
relation.isAuthorOfPublicatione926f262-ecb5-44df-9de7-454755dac26e
relation.isAuthorOfPublication.latestForDiscoverye926f262-ecb5-44df-9de7-454755dac26e
relation.isProjectOfPublicationfa579efb-63c0-486e-b05d-859542b73647
relation.isProjectOfPublication.latestForDiscoveryfa579efb-63c0-486e-b05d-859542b73647

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
s40558-025-00334-2.pdf
Tamanho:
3.72 MB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
3.46 KB
Formato:
Item-specific license agreed upon to submission
Descrição: