Browsing by Author "Batista, Fernando"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
- Early experiments on automatic annotation of Portuguese medieval textsPublication . Bico, Maria Inês; Baptista, Jorge; Batista, Fernando; Cardeira, EsperançaThis paper presents the challenges and solutions adopted to the lemmatization and part-of-speech (PoS) tagging of a corpus of Old Portuguese texts (up to 1525), to pave the way to the implementation of an automatic annotation of these Medieval texts. A highly granular tagset, previously devised for Modern Portuguese, was adapted to this end. A large text (similar to 155 thousand words) was manually annotated for PoS and lemmata and used to train an initial PoS-tagger model. When applied to two other texts, the resulting model attained 91.2% precision with a textual variant of the same text, and 67.4% with a new, unseen text. A second model was then trained with the data provided by the previous three texts and applied to two other unseen texts. The new model achieved a precision of 77.3% and 82.4%, respectively.
- Examining Airbnb guest satisfaction tendencies: a text mining approachPublication . Cavique, Mariana; Ribeiro, Ricardo; Batista, Fernando; Correia, AntóniaGiven Airbnb's changes since its inception and the dynamism of customer preferences, a study that sheds light on how customer satisfaction is evolving is relevant. An automated method is proposed for identifying these satisfaction tendencies at a large scale. This study follows a text mining approach to analyse 590,070 reviews posted between 2010 and 2019 on the Airbnb platform in Lisbon. Topic Modelling is employed in order to identify the main topics discussed in the reviews, and Sentiment Analysis to understand the topics that compose guest's satisfaction in the context of Airbnb services. Three major topics are extracted from Airbnb reviews: 'host's service', 'physical aspects', and 'location'. Although a positivity bias in guest reviews is confirmed, the satisfaction level seems to be decreasing over the years. The results also reveal that 'physical aspects' is the predominant topic when considering the negative guest reviews. This research considers big data the base to create knowledge, data spanning over the years, offering consistency to the research.