Publication
Assisting European Portuguese teaching: linguistic features extraction and automatic readability classifier
dc.contributor.author | Curto, Pedro | |
dc.contributor.author | Mamede, Nuno | |
dc.contributor.author | Baptista, Jorge | |
dc.date.accessioned | 2017-04-07T15:57:37Z | |
dc.date.available | 2017-04-07T15:57:37Z | |
dc.date.issued | 2016 | |
dc.description.abstract | This paper describes two automatic systems: a linguistic features extractor and a text readability classifier for European Portuguese texts. Its main goal is to assist the selection of adequate reading materials to support Portuguese teaching, especially as a second language. To the feature extraction from texts, the system uses several Natural Language Processing (NLP) tools. Currently, 52 features are extracted: parts-of-speech (POS), syllables, words, chunks and phrases, averages and frequencies, among others. A classifier was created using these features and a corpus, previously annotated readability level, adopting the five-levels language classification official standard for Portuguese as Second Language. In a five-levels (from A1 to C1) scenario, the best-performing learning algorithm (LogitBoost) achieved an accuracy of 75.11% with a root mean square error (RMSE) of 0.269. In a three-levels (A, B and C) scenario, the best-performing learning algorithm (C4.5 grafted) achieved 81.44% accuracy, with a RMSE of 0.346. | |
dc.identifier.doi | 10.1007/978-3-319-29585-5_5 | |
dc.identifier.isbn | 978-3-319-29585-5; 978-3-319-29584-8 | |
dc.identifier.issn | 1865-0929 | |
dc.identifier.other | AUT: JBA00689; | |
dc.identifier.uri | http://hdl.handle.net/10400.1/9766 | |
dc.language.iso | eng | |
dc.peerreviewed | yes | |
dc.publisher | Inst Syst & Technologies Informat, Control & Commun; Int Soc Engn EducInst Syst & Technologies Informat, Control & Commun; Int Soc Engn Educ | |
dc.relation.isbasedon | WOS:000371386000005 | |
dc.title | Assisting European Portuguese teaching: linguistic features extraction and automatic readability classifier | |
dc.type | journal article | |
dspace.entity.type | Publication | |
oaire.awardURI | info:eu-repo/grantAgreement/FCT/5876/UID%2FCEC%2F50021%2F2013/PT | |
oaire.citation.conferencePlace | Lisbon, Portugal | |
oaire.citation.endPage | 96 | |
oaire.citation.startPage | 81 | |
oaire.citation.title | International Conference on Computer Supported Education | |
oaire.citation.title | CSEDU 2015: Computer Supported Education | |
oaire.fundingStream | 5876 | |
person.familyName | Baptista | |
person.givenName | Jorge | |
person.identifier.ciencia-id | 7010-5366-22C5 | |
person.identifier.orcid | 0000-0003-4603-4364 | |
person.identifier.rid | H-7699-2013 | |
person.identifier.scopus-author-id | 14035269500 | |
project.funder.identifier | http://doi.org/10.13039/501100001871 | |
project.funder.name | Fundação para a Ciência e a Tecnologia | |
rcaap.rights | openAccess | |
rcaap.type | article | |
relation.isAuthorOfPublication | e817fa28-a005-40e2-9ba4-03fdaedd7df3 | |
relation.isAuthorOfPublication.latestForDiscovery | e817fa28-a005-40e2-9ba4-03fdaedd7df3 | |
relation.isProjectOfPublication | 4b33c456-e2db-4613-a2ef-db1484b29ab7 | |
relation.isProjectOfPublication.latestForDiscovery | 4b33c456-e2db-4613-a2ef-db1484b29ab7 |
Files
Original bundle
1 - 1 of 1