Publication
Automated anonymization of text documents
dc.contributor.author | Dias, Francisco | |
dc.contributor.author | Mamede, Nuno | |
dc.contributor.author | Baptista, Jorge | |
dc.date.accessioned | 2017-04-07T15:57:20Z | |
dc.date.available | 2017-04-07T15:57:20Z | |
dc.date.issued | 2016 | |
dc.description.abstract | Sharing data in the form of text is important for a wide range of activities but it also raises a concern about privacy when sharing data that could be sensitive. Automated text anonymization is a solution for removing all the sensitive information from documents. However, this is a challenging task due to the unstructured form of textual data and the ambiguity of natural language. In this work, we present our implementation of an automated anonymization system, built in a modular structure, for documents written in Portuguese. Four different methods of anonymization are evaluated and compared. Two methods replace the sensitive information by artificial labels: suppression and tagging. The other two methods replace the information by textual expressions: random substitution and generalization. Evaluation showed that the use of the tagging and the generalization methods facilitates the reading of an anonymized text while preventing some semantic drifts caused by the remotion of the original information. | |
dc.identifier.isbn | 978-1-5090-0622-9 | |
dc.identifier.other | AUT: JBA00689; | |
dc.identifier.uri | http://hdl.handle.net/10400.1/9683 | |
dc.language.iso | eng | |
dc.peerreviewed | yes | |
dc.publisher | IEEE | |
dc.relation | Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa | |
dc.relation.isbasedon | WOS:000390749101060 | |
dc.title | Automated anonymization of text documents | |
dc.type | journal article | |
dspace.entity.type | Publication | |
oaire.awardTitle | Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa | |
oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UID%2FCEC%2F50021%2F2013/PT | |
oaire.citation.conferencePlace | Vancouver, Canada | |
oaire.citation.endPage | 1294 | |
oaire.citation.startPage | 1287 | |
oaire.citation.title | 2016 IEEE Congress on Evolutionary Computation | |
oaire.fundingStream | 6817 - DCRRNI ID | |
person.familyName | Baptista | |
person.givenName | Jorge | |
person.identifier.ciencia-id | 7010-5366-22C5 | |
person.identifier.orcid | 0000-0003-4603-4364 | |
person.identifier.rid | H-7699-2013 | |
person.identifier.scopus-author-id | 14035269500 | |
project.funder.identifier | http://doi.org/10.13039/501100001871 | |
project.funder.name | Fundação para a Ciência e a Tecnologia | |
rcaap.rights | restrictedAccess | |
rcaap.type | article | |
relation.isAuthorOfPublication | e817fa28-a005-40e2-9ba4-03fdaedd7df3 | |
relation.isAuthorOfPublication.latestForDiscovery | e817fa28-a005-40e2-9ba4-03fdaedd7df3 | |
relation.isProjectOfPublication | 2a35928a-08d0-4ab5-b856-f7e98ac2783f | |
relation.isProjectOfPublication.latestForDiscovery | 2a35928a-08d0-4ab5-b856-f7e98ac2783f |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- 9683 feito.pdf
- Size:
- 177.54 KB
- Format:
- Adobe Portable Document Format