Using a Chinese treebank to measure dependency distance

1 January 2009

journal article
research article
Published by Walter de Gruyter GmbH in Corpus Linguistics and Linguistic Theory

Vol. 5 (2), 161-174
https://doi.org/10.1515/cllt.2009.007

Abstract

This article describes a method for calculating the ‘dependency distance’ between the words in a text – i.e. the number of words that separate each word from the word on which it depends syntactically – and reports the results of applying this method to a Chinese treebank. This study shows that Chinese dependencies tend strongly to be governor-final and that the mean dependency distance of words is much higher for Chinese than for other languages that have been studied including English, German and Japanese. It is unclear whether this difference means that Chinese is syntactically more difficult to process.

Keywords

This publication has 2 references indexed in Scilit:

Consequences of the Serial Nature of Linguistic Input for Sentenial Complexity
Cognitive Science, 2005
Linguistic complexity: locality of syntactic dependencies
Cognition, 1998

Cited by 39 articles