Results
Resources
The Simple.Text tool transforms texts in Spanish to make them more readable and accessible for people with intellectual disabilities. Following the Easy-to-Read guidelines (Norma UNE 153101:2018 EX sobre Lectura Fácil), this tool processes the input text and returns a simplified version that complies with these standards. The currently implemented functionalities include the simplification of superlatives, adverbs ending in -mente, as well as the simplification of numerical expressions, among others.
Demo link: https://simpletext.demos.gplsi.es/
ClearSim is a parallel corpus in Spanish consisting of public administration texts, both in their original versions and adapted to Easy-to-Read and facilitated formats. This resource includes 15,000 original texts, 10,000 texts simplified with ChatGPT and reviewed by humans, and more than 4,000 texts adapted by experts following Easy-to-Read guidelines. The texts have been selected from the most important municipalities in Alicante, more specifically, from the sections of sport, culture and leisure, and have been adapted and validated in collaboration with APSA, an NGO specialised in adapting texts for people with disabilities. The corpus is aligned at the document level, and part of the texts have been tagged and aligned at the sentence level, making it useful for machine learning tasks and linguistic studies.
Corpus of public administration documents, in easy reading and original version when available. This corpus is still under development and at the moment contains 140,000 tokens of documents such as the Statute of Autonomy of the Community of Madrid or the Guide to the institutions and organizations of the European Union.
It can be used to develop alignment methods between easy-to-read and original versions, training generative models for simplification, among other Natural Language Processing tasks.
Corpus link: Corpus ClearText
Publications
A Review of Research-based Automatic Text Simplification Tools. https://acl-bg.org/proceedings/2023/RANLP%202023/RANLP%202023%20Proceedings.pdf
A Review of Parallel Corpora for Automatic Text Simplification. Key Challenges Moving Forward. https://link.springer.com/chapter/10.1007/978-3-031-35320-8_5
Automatic Text Simplification for People with Cognitive Disabilities: Resource Creation within the ClearText Project. https://tsar-workshop.github.io/program/papers/espinosa-zaragoza-etal-2023-automatic.pdf
CLEAR.TEXT Enhancing the Modernization Public Sector Organizations by Deploying Natural Language Processing to Make Their Digital Content CLEARER to Those with Cognitive Disabilities. https://ceur-ws.org/Vol-3516/paper09.pdf