Repository logo
 
Publication

Mapping, filtering and measuring impact of ambiguous words in Portuguese

dc.contributor.authorBaptista, Jorge
dc.contributor.authorFaísca, Luís
dc.date.accessioned2014-07-31T11:10:58Z
dc.date.available2014-07-31T11:10:58Z
dc.date.issued2007
dc.date.updated2014-07-30T10:39:29Z
dc.description.abstractThis paper deals with ambiguous simple words of Portuguese. The Portuguese dictionary of simple inflected words contains (DELAF) 936.215 entries, from which there are 889.986 different inflected forms. It is possible to obtain the full list of ambiguous inflected forms (43.126), that is, word forms belonging to different categories and/or lemmas: capital,A/N/N (capital). We may consider A/N/N an ambiguity class. There are 137 ambiguity classes. Each ambiguity class presents a certain level of ambiguity (Amb) that corresponds to the number of lexical entries associated to each ambiguous form (again, for class A/N/N Amb=3). Based on this information it is possible to map how ambiguity affects the lexicon. Using the frequency information associated to the list of tokens of a large corpus (the CETEMPÚBLICO corpus, with 200 million words), it is possible to calculate how ambiguity affects real texts. Combining the two types of information, it is possible to devise and evaluate different strategies to reduce lexical ambiguity.por
dc.identifier.citationBaptista, Jorge; Faísca, Luís. Mapping, filtering and measuring impact of ambiguous words in Portuguese, In Formaliser les langues avec l’ordinateur: de INTEX à Nooj, 305-324, ISBN: 978-2-84867-189-5. Besançon: Presses Universitaires de Franche-Comté, 2007.por
dc.identifier.isbn978-2-84867-189-5
dc.identifier.otherAUT: JBA00689; LFA00717;
dc.identifier.urihttp://hdl.handle.net/10400.1/4884
dc.language.isoengpor
dc.peerreviewedyespor
dc.publisherPresses Universitaires de Franche-Comtépor
dc.subjectProcessamento Computacional de Linguagem Naturalpor
dc.subjectLínguística de corporapor
dc.titleMapping, filtering and measuring impact of ambiguous words in Portuguesepor
dc.typeconference object
dspace.entity.typePublication
oaire.citation.conferencePlaceSofiapor
oaire.citation.endPage324por
oaire.citation.startPage305por
oaire.citation.title6th INTEX Workshoppor
person.familyNameBaptista
person.familyNameFaísca
person.givenNameJorge
person.givenNameLuís
person.identifierA-4633-2013
person.identifier.ciencia-id7010-5366-22C5
person.identifier.ciencia-id5719-6727-C596
person.identifier.orcid0000-0003-4603-4364
person.identifier.orcid0000-0003-4859-8817
person.identifier.ridH-7699-2013
person.identifier.scopus-author-id14035269500
person.identifier.scopus-author-id6503944802
rcaap.rightsopenAccesspor
rcaap.typeconferenceObjectpor
relation.isAuthorOfPublicatione817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isAuthorOfPublicatione21a01b7-3ea3-45b1-97db-f6553ead69a1
relation.isAuthorOfPublication.latestForDiscoverye817fa28-a005-40e2-9ba4-03fdaedd7df3

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mapping, filtering and measuring impact of ambiguous words in Portuguese.pdf
Size:
347.94 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: