Repository logo
 
Publication

Assisting European Portuguese teaching: linguistic features extraction and automatic readability classifier

dc.contributor.authorCurto, Pedro
dc.contributor.authorMamede, Nuno
dc.contributor.authorBaptista, Jorge
dc.date.accessioned2017-04-07T15:57:37Z
dc.date.available2017-04-07T15:57:37Z
dc.date.issued2016
dc.description.abstractThis paper describes two automatic systems: a linguistic features extractor and a text readability classifier for European Portuguese texts. Its main goal is to assist the selection of adequate reading materials to support Portuguese teaching, especially as a second language. To the feature extraction from texts, the system uses several Natural Language Processing (NLP) tools. Currently, 52 features are extracted: parts-of-speech (POS), syllables, words, chunks and phrases, averages and frequencies, among others. A classifier was created using these features and a corpus, previously annotated readability level, adopting the five-levels language classification official standard for Portuguese as Second Language. In a five-levels (from A1 to C1) scenario, the best-performing learning algorithm (LogitBoost) achieved an accuracy of 75.11% with a root mean square error (RMSE) of 0.269. In a three-levels (A, B and C) scenario, the best-performing learning algorithm (C4.5 grafted) achieved 81.44% accuracy, with a RMSE of 0.346.
dc.identifier.doi10.1007/978-3-319-29585-5_5
dc.identifier.isbn978-3-319-29585-5; 978-3-319-29584-8
dc.identifier.issn1865-0929
dc.identifier.otherAUT: JBA00689;
dc.identifier.urihttp://hdl.handle.net/10400.1/9766
dc.language.isoeng
dc.peerreviewedyes
dc.publisherInst Syst & Technologies Informat, Control & Commun; Int Soc Engn EducInst Syst & Technologies Informat, Control & Commun; Int Soc Engn Educ
dc.relation.isbasedonWOS:000371386000005
dc.titleAssisting European Portuguese teaching: linguistic features extraction and automatic readability classifier
dc.typejournal article
dspace.entity.typePublication
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/5876/UID%2FCEC%2F50021%2F2013/PT
oaire.citation.conferencePlaceLisbon, Portugal
oaire.citation.endPage96
oaire.citation.startPage81
oaire.citation.titleInternational Conference on Computer Supported Education
oaire.citation.titleCSEDU 2015: Computer Supported Education
oaire.fundingStream5876
person.familyNameBaptista
person.givenNameJorge
person.identifier.ciencia-id7010-5366-22C5
person.identifier.orcid0000-0003-4603-4364
person.identifier.ridH-7699-2013
person.identifier.scopus-author-id14035269500
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccess
rcaap.typearticle
relation.isAuthorOfPublicatione817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isAuthorOfPublication.latestForDiscoverye817fa28-a005-40e2-9ba4-03fdaedd7df3
relation.isProjectOfPublication4b33c456-e2db-4613-a2ef-db1484b29ab7
relation.isProjectOfPublication.latestForDiscovery4b33c456-e2db-4613-a2ef-db1484b29ab7

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
9766.pdf
Size:
193.62 KB
Format:
Adobe Portable Document Format