Automated Readability Assessment for Spanish e-Government Information
Open Access
- 21 January 2021
- journal article
- Published by International Association for Digital Transformation and Technological Innovation in Journal of Information Systems Engineering and Management
- Vol. 6 (2), em0137
- https://doi.org/10.29333/jisem/9620
Abstract
This paper automatically evaluates the readability of Spanish e-government websites. Specifically, the websites collected explain e-government administrative procedures. The evaluation is carried out through the analysis of different linguistic characteristics that are presumably associated with a better understanding of these resources. To this end, texts from websites outside the government websites have been collected. These texts clarify the procedures published on the Spanish Government’s websites. These websites constitute the part of the corpus considered as the set of easy documents. The rest of the corpus has been completed with counterpart documents from government websites. The text of the documents has been processed, and the difficulty is evaluated through different classic readability metrics. At a later stage, automatic learning methods are used to apply algorithms to predict the difficulty of the text. The results of the study show that government web pages show high values for comprehension difficulty. This work proposes a new Spanish-language corpus of official e-government websites. In addition, a large number of combined linguistic attributes are applied, which improve the identification of the level of comprehensibility of a text with respect to classic metrics.Keywords
This publication has 14 references indexed in Scilit:
- Measuring text difficulty using parse‐tree frequencyJournal of the Association for Information Science and Technology, 2017
- Skills MatterPublished by Organisation for Economic Co-Operation and Development (OECD) ,2016
- Handbook of Research on Comparative Approaches to the Digital Age Revolution in Europe and the AmericasPublished by IGI Global ,2016
- NLP–Based Readability Assessment of Health–Related Texts: a Case Study on Italian Informed Consent FormsPublished by Association for Computational Linguistics (ACL) ,2015
- Combining NLP with evidence-based methods to find text metrics related to perceived and actual text difficultyPublished by Association for Computing Machinery (ACM) ,2012
- Reconstructing Readability: Recent Developments and Recommendations in the Analysis of Text DifficultyEducational Psychology Review, 2011
- The Percentage of Words Known in a Text and Reading ComprehensionThe Modern Language Journal, 2011
- The measurement of readabilityACM Journal of Computer Documentation, 2000
- An introduction to latent semantic analysisDiscourse Processes, 1998
- Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted PersonnelPublished by Defense Technical Information Center (DTIC) ,1975