Repository logo
 
Publication

Automated anonymization of text documents

dc.contributor.authorDias, Francisco
dc.contributor.authorMamede, Nuno
dc.contributor.authorBaptista, Jorge
dc.date.accessioned2017-04-07T15:57:20Z
dc.date.available2017-04-07T15:57:20Z
dc.date.issued2016
dc.description.abstractSharing data in the form of text is important for a wide range of activities but it also raises a concern about privacy when sharing data that could be sensitive. Automated text anonymization is a solution for removing all the sensitive information from documents. However, this is a challenging task due to the unstructured form of textual data and the ambiguity of natural language. In this work, we present our implementation of an automated anonymization system, built in a modular structure, for documents written in Portuguese. Four different methods of anonymization are evaluated and compared. Two methods replace the sensitive information by artificial labels: suppression and tagging. The other two methods replace the information by textual expressions: random substitution and generalization. Evaluation showed that the use of the tagging and the generalization methods facilitates the reading of an anonymized text while preventing some semantic drifts caused by the remotion of the original information.
dc.identifier.isbn978-1-5090-0622-9
dc.identifier.otherAUT: JBA00689;
dc.identifier.urihttp://hdl.handle.net/10400.1/9683
dc.language.isoeng
dc.peerreviewedyes
dc.publisherIEEE
dc.relationInstituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa
dc.relation.isbasedonWOS:000390749101060
dc.titleAutomated anonymization of text documents
dc.typejournal article
dspace.entity.typePublication
oaire.awardTitleInstituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UID%2FCEC%2F50021%2F2013/PT
oaire.citation.conferencePlaceVancouver, Canada
oaire.citation.endPage1294
oaire.citation.startPage1287
oaire.citation.title2016 IEEE Congress on Evolutionary Computation
oaire.fundingStream6817 - DCRRNI ID
person.familyNameBaptista
person.givenNameJorge
person.identifier.ciencia-id7010-5366-22C5
person.identifier.orcid0000-0003-4603-4364
person.identifier.ridH-7699-2013
person.identifier.scopus-author-id14035269500
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsrestrictedAccess
rcaap.typearticle
relation.isAuthorOfPublicatione817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isAuthorOfPublication.latestForDiscoverye817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isProjectOfPublication2a35928a-08d0-4ab5-b856-f7e98ac2783f
relation.isProjectOfPublication.latestForDiscovery2a35928a-08d0-4ab5-b856-f7e98ac2783f

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
9683 feito.pdf
Size:
177.54 KB
Format:
Adobe Portable Document Format