Automated Readability Assessment for Spanish e-Government Information

Open Access

journal article
Published by International Association for Digital Transformation and Technological Innovation in Journal of Information Systems Engineering and Management

Vol. 6 (2), em0137
https://doi.org/10.29333/jisem/9620

Abstract

This paper automatically evaluates the readability of Spanish e-government websites. Specifically, the websites collected explain e-government administrative procedures. The evaluation is carried out through the analysis of different linguistic characteristics that are presumably associated with a better understanding of these resources. To this end, texts from websites outside the government websites have been collected. These texts clarify the procedures published on the Spanish Government’s websites. These websites constitute the part of the corpus considered as the set of easy documents. The rest of the corpus has been completed with counterpart documents from government websites. The text of the documents has been processed, and the difficulty is evaluated through different classic readability metrics. At a later stage, automatic learning methods are used to apply algorithms to predict the difficulty of the text. The results of the study show that government web pages show high values for comprehension difficulty. This work proposes a new Spanish-language corpus of official e-government websites. In addition, a large number of combined linguistic attributes are applied, which improve the identification of the level of comprehensibility of a text with respect to classic metrics.

Keywords

This publication has 14 references indexed in Scilit:

Measuring text difficulty using parse‐tree frequency
Journal of the Association for Information Science and Technology, 2017
Skills Matter
Published by Organisation for Economic Co-Operation and Development (OECD) ,2016
Handbook of Research on Comparative Approaches to the Digital Age Revolution in Europe and the Americas
Published by IGI Global ,2016
NLP–Based Readability Assessment of Health–Related Texts: a Case Study on Italian Informed Consent Forms
Published by Association for Computational Linguistics (ACL) ,2015
Combining NLP with evidence-based methods to find text metrics related to perceived and actual text difficulty
Published by Association for Computing Machinery (ACM) ,2012
Reconstructing Readability: Recent Developments and Recommendations in the Analysis of Text Difficulty
Educational Psychology Review, 2011
The Percentage of Words Known in a Text and Reading Comprehension
The Modern Language Journal, 2011
The measurement of readability
ACM Journal of Computer Documentation, 2000
An introduction to latent semantic analysis
Discourse Processes, 1998
Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel
Published by Defense Technical Information Center (DTIC) ,1975

Cited by 6 articles