Learning Subjective Language

Top Cited Papers

Open Access

1 September 2004

journal article
Published by MIT Press in Computational Linguistics

Vol. 30 (3), 277-308
https://doi.org/10.1162/0891201041850885

Abstract

Subjectivity in natural language refers to aspects of language used to express opinions, evaluations, and speculations. There are numerous natural language processing applications for which subjectivity analysis is relevant, including information extraction and text categorization. The goal of this work is learning subjective language from corpora. Clues of subjectivity are generated and tested, including low-frequency words, collocations, and adjectives and verbs identified using distributional similarity. The features are also examined working together in concert. The features, generated from different data sets using different procedures, exhibit consistency in performance in that they all do better and worse on the same data sets. In addition, this article shows that the density of subjectivity clues in the surrounding context strongly affects how likely it is that a word is subjective, and it provides the results of an annotation study assessing the subjectivity of sentences with high-density features. Finally, the clues are used to perform opinion piece recognition (a type of text categorization and genre detection) to demonstrate the utility of the knowledge acquired in this article.

Keywords

This publication has 5 references indexed in Scilit:

Extracting the Lowest-Frequency Words: Pitfalls and Possibilities
Computational Linguistics, 2000
Recognizing subjectivity: a case study in manual tagging
Natural Language Engineering, 1999
Quantification of rewriting by the Brothers Grimm: A comparison of successive versions of three tales
Computers and the Humanities, 1989
Representing de re and de dicto belief reports in discourse and narrative
Proceedings of the IEEE, 1986
BORIS—An experiment in in-depth understanding of narratives
Artificial Intelligence, 1983

Cited by 318 articles