Publication
Mapping, filtering and measuring impact of ambiguous words in Portuguese
dc.contributor.author | Baptista, Jorge | |
dc.contributor.author | Faísca, Luís | |
dc.date.accessioned | 2014-07-31T11:10:58Z | |
dc.date.available | 2014-07-31T11:10:58Z | |
dc.date.issued | 2007 | |
dc.date.updated | 2014-07-30T10:39:29Z | |
dc.description.abstract | This paper deals with ambiguous simple words of Portuguese. The Portuguese dictionary of simple inflected words contains (DELAF) 936.215 entries, from which there are 889.986 different inflected forms. It is possible to obtain the full list of ambiguous inflected forms (43.126), that is, word forms belonging to different categories and/or lemmas: capital,A/N/N (capital). We may consider A/N/N an ambiguity class. There are 137 ambiguity classes. Each ambiguity class presents a certain level of ambiguity (Amb) that corresponds to the number of lexical entries associated to each ambiguous form (again, for class A/N/N Amb=3). Based on this information it is possible to map how ambiguity affects the lexicon. Using the frequency information associated to the list of tokens of a large corpus (the CETEMPÚBLICO corpus, with 200 million words), it is possible to calculate how ambiguity affects real texts. Combining the two types of information, it is possible to devise and evaluate different strategies to reduce lexical ambiguity. | por |
dc.identifier.citation | Baptista, Jorge; Faísca, Luís. Mapping, filtering and measuring impact of ambiguous words in Portuguese, In Formaliser les langues avec l’ordinateur: de INTEX à Nooj, 305-324, ISBN: 978-2-84867-189-5. Besançon: Presses Universitaires de Franche-Comté, 2007. | por |
dc.identifier.isbn | 978-2-84867-189-5 | |
dc.identifier.other | AUT: JBA00689; LFA00717; | |
dc.identifier.uri | http://hdl.handle.net/10400.1/4884 | |
dc.language.iso | eng | por |
dc.peerreviewed | yes | por |
dc.publisher | Presses Universitaires de Franche-Comté | por |
dc.subject | Processamento Computacional de Linguagem Natural | por |
dc.subject | Línguística de corpora | por |
dc.title | Mapping, filtering and measuring impact of ambiguous words in Portuguese | por |
dc.type | conference object | |
dspace.entity.type | Publication | |
oaire.citation.conferencePlace | Sofia | por |
oaire.citation.endPage | 324 | por |
oaire.citation.startPage | 305 | por |
oaire.citation.title | 6th INTEX Workshop | por |
person.familyName | Baptista | |
person.familyName | Faísca | |
person.givenName | Jorge | |
person.givenName | Luís | |
person.identifier | A-4633-2013 | |
person.identifier.ciencia-id | 7010-5366-22C5 | |
person.identifier.ciencia-id | 5719-6727-C596 | |
person.identifier.orcid | 0000-0003-4603-4364 | |
person.identifier.orcid | 0000-0003-4859-8817 | |
person.identifier.rid | H-7699-2013 | |
person.identifier.scopus-author-id | 14035269500 | |
person.identifier.scopus-author-id | 6503944802 | |
rcaap.rights | openAccess | por |
rcaap.type | conferenceObject | por |
relation.isAuthorOfPublication | e817fa28-a005-40e2-9ba4-03fdaedd7df3 | |
relation.isAuthorOfPublication | e21a01b7-3ea3-45b1-97db-f6553ead69a1 | |
relation.isAuthorOfPublication.latestForDiscovery | e817fa28-a005-40e2-9ba4-03fdaedd7df3 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Mapping, filtering and measuring impact of ambiguous words in Portuguese.pdf
- Size:
- 347.94 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.61 KB
- Format:
- Item-specific license agreed upon to submission
- Description: