Mille, Simon (Date of defense: 2014-07-25)
The present Ph.D. thesis addresses the problem of deep data-driven Natural Language Generation (NLG), and in particular the role of proper corpus annotation schemata for stochastic sentence realization. ...
Casamayor, Gerard (Date of defense: 2021-04-26)
Text summarization deals with the automatic creation of summaries from one or more documents, either by extracting fragments from the input text or by generating an abstract de novo. Research in recent ...
Rodríguez Fernández, Sara (Date of defense: 2018-03-19)
Suele admitirse que las colocaciones en el sentido de coocurrencias idiosincráticas de palabras son un reto en el aprendizaje de lenguas. Los estudiantes producen frecuentemente combinaciones “agramaticales”' ...
Pérez-Mayos, Laura (Date of defense: 2022-06-15)
Pretrained Transformer-based language models have quickly replaced traditional approaches to model NLP tasks, pushing the state of the art to new levels, and will certainly continue to be very influential ...
Teixeira Fortuna, Paula Cristina (Date of defense: 2023-03-06)
The detection of hate speech in online spaces is traditionally conceptualized as a classification task that uses Machine Learning (ML)-driven Natural Language Processing (NLP) techniques. In accordance ...
Domínguez Bajo, Mónica (Date of defense: 2017-11-17)
This dissertation presents an empirical study on the information structure– prosody interface based on: (i) a formal description of hierarchical thematicity within a systematic language model for ...
Soler Company, Juan (Date of defense: 2017-07-06)
Author profiling and identification are two areas of data-driven computational linguistics that have gained a lot of relevance due to their potential applications in, e.g., forensic linguistic studies, ...