Exploring few-shot approaches to automatic text complexity assessment in european portuguese

Ribeiro, Eugénio; Antunes, David; Mamede, Nuno; Baptista, Jorge

Publicação

Exploring few-shot approaches to automatic text complexity assessment in european portuguese

2025-08-21Artigo científico

datacite.subject.sdg	04:Educação de Qualidade
datacite.subject.sdg	09:Indústria, Inovação e Infraestruturas
datacite.subject.sdg	10:Reduzir as Desigualdades
dc.contributor.author	Ribeiro, Eugénio
dc.contributor.author	Antunes, David
dc.contributor.author	Mamede, Nuno
dc.contributor.author	Baptista, Jorge
dc.date.accessioned	2026-04-29T10:17:02Z
dc.date.available	2026-04-29T10:17:02Z
dc.date.issued	2025-08-21
dc.description.abstract	The automatic assessment of text complexity has an important role to play in the context of language education. In this study, we shift the focus from L2 learners to adult native speakers with low literacy by exploring the new iRead4Skills dataset in European Portuguese. Furthermore, instead of relying on classical machine learning approaches or fine-tuning a pre-trained language model, we leverage the capabilities of prompt-based Large Language Models (LLMs), with a special focus on few-shot prompting approaches. We explore prompts with varying degrees of information, as well as different example selection approaches. Overall, the results of our experiments reveal that even a single example significantly increases the performance of the model and that few-shot approaches generalize better than fine-tuned models. However, automatic complexity assessment is a difficult and highly subjective task that is still far from solved.	eng
dc.identifier.doi	10.5753/jbcs.2025.5820
dc.identifier.issn	1678-4804
dc.identifier.uri	http://hdl.handle.net/10400.1/28799
dc.language.iso	eng
dc.peerreviewed	yes
dc.publisher	Brazilian Computer Society (SBC)
dc.relation	Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa
dc.relation.ispartof	Journal of the Brazilian Computer Society
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Text complexity
dc.subject	Readability
dc.subject	Few-shot prompting
dc.subject	Large language models
dc.title	Exploring few-shot approaches to automatic text complexity assessment in european portuguese	eng
dc.type	journal article
dspace.entity.type	Publication
oaire.awardNumber	UIDB/50021/2020
oaire.awardTitle	Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa
oaire.awardURI	info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F50021%2F2020/PT
oaire.citation.endPage	710
oaire.citation.issue	1
oaire.citation.startPage	690
oaire.citation.title	Journal of the Brazilian Computer Society
oaire.citation.volume	31
oaire.fundingStream	6817 - DCRRNI ID
oaire.version	http://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyName	Baptista
person.givenName	Jorge
person.identifier.ciencia-id	7010-5366-22C5
person.identifier.orcid	0000-0003-4603-4364
person.identifier.rid	H-7699-2013
person.identifier.scopus-author-id	14035269500
project.funder.identifier	http://doi.org/10.13039/501100001871
project.funder.name	Fundação para a Ciência e a Tecnologia
relation.isAuthorOfPublication	e817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isAuthorOfPublication.latestForDiscovery	e817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isProjectOfPublication	0b14d63a-8f78-4e31-8a86-b72e1f07871f
relation.isProjectOfPublication.latestForDiscovery	0b14d63a-8f78-4e31-8a86-b72e1f07871f

Ficheiros

Principais

A mostrar 1 - 1 de 1

Nome:: 5820-Article Text-31156-1-10-20250821.pdf
Tamanho:: 265.59 KB
Formato:: Adobe Portable Document Format

Ver/Abrir

Licença

A mostrar 1 - 1 de 1

Nome:: license.txt
Tamanho:: 3.46 KB
Formato:: Item-specific license agreed upon to submission
Descrição:

Ver/Abrir

Coleções

FCH2-Artigos (em revistas ou actas indexadas)