Logo do repositório
 
Publicação

The role of adverbs in language variety identification: the case of Portuguese multi-word adverbs

datacite.subject.sdg04:Educação de Qualidade
datacite.subject.sdg09:Indústria, Inovação e Infraestruturas
datacite.subject.sdg10:Reduzir as Desigualdades
dc.contributor.authorMeira Grein Muller, Izabela
dc.contributor.authorBaptista, Jorge
dc.contributor.authorMamede, Nuno
dc.date.accessioned2026-04-08T10:06:36Z
dc.date.available2026-04-08T10:06:36Z
dc.date.issued2024-06-20
dc.description.abstractThis paper aims to assess the role of multi-word compound adverbs in distinguishing Brazilian Portuguese (PT-BR) from European Portuguese (PT-PT). For this study, a large lexicon of Portuguese multi-word adverbs (3,665) was annotated with diatopic information regarding language variety, which has not been available so far. The paper then investigates the distribution of this category in the DSL (Dialect and Similar Language) corpus of journalistic texts, representing Brazilian (PT-BR) and European Portuguese (PT-PT). Results indicate a substantial similarity between the two varieties, with a considerable overlap in the lexicon of multiword adverbs. Additionally, specific adverbs unique to each language variety were identified. Lexical entries recognized in the corpus represent 18.2% (PT-BR) to 19.5% (PT-PT) of the lexicon, and approximately 5,700 matches in each partition. While many of the matches are spurious due to ambiguity with otherwise nonidiomatic, free strings, occurrences of adverbs marked as exclusive to one variety in texts from the other variety are rare.eng
dc.identifier.doi10.18653/v1/2024.vardial-1.8
dc.identifier.urihttp://hdl.handle.net/10400.1/28617
dc.language.isoeng
dc.peerreviewedyes
dc.publisherAssociation for Computational Linguistics
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.titleThe role of adverbs in language variety identification: the case of Portuguese multi-word adverbseng
dc.typeconference object
dspace.entity.typePublication
oaire.citation.conferenceDate2024
oaire.citation.conferencePlaceMexico City, Mexico
oaire.citation.endPage106
oaire.citation.startPage99
oaire.citation.titleProceedings of the Eleventh Workshop on NLP for Similar Languages, Varieties, and Dialects (VarDial 2024)
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyNameMeira Grein Muller
person.familyNameBaptista
person.givenNameIzabela
person.givenNameJorge
person.identifier.ciencia-id7010-5366-22C5
person.identifier.orcid0000-0002-1826-3787
person.identifier.orcid0000-0003-4603-4364
person.identifier.ridH-7699-2013
person.identifier.scopus-author-id14035269500
relation.isAuthorOfPublication4bace0e3-0ae2-4ef3-83a7-9e0780b9a0a9
relation.isAuthorOfPublicatione817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isAuthorOfPublication.latestForDiscoverye817fa28-a005-40e2-9ba4-03fdaedd7df3

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
2024.vardial-1.8.pdf
Tamanho:
172.04 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
3.46 KB
Formato:
Item-specific license agreed upon to submission
Descrição: