SUMMARIZATION OF SPOKEN LECTURES BASED ON LINGUISTIC SURFACE AND PROSODIC INFORMATION
- 1 January 2006
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
We aim to extract automatically the summarization of spoken lectures for conferences and classes. For this purpose, at first we compared results of summarization extracted by human subjects. We found large differences with every subject. Then we investigated the relations between linguistic surface information and human results, and we obtained useful linguistic surface information. Next, we summarized spoken lectures on conferences and classes using the linguistic information. Additionally, we also focused on prosodic features; F0 and power. We conducted the same experiments on them. Lastly, we combined linguistic surface information and prosodic information. As a result, the proposed automatic summarization produced a better F- measure (0.599), k-value (0.420) and Rouge metric (0.758) comparable with human results.Keywords
This publication has 7 references indexed in Scilit:
- Analysis and processing of lecture audio dataPublished by Association for Computational Linguistics (ACL) ,2004
- Automatic speech summarization based on sentence extraction and compactionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Advances in automatic meeting record creation and accessPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Extractive summarization of voicemail using lexical and prosodic feature subset selectionPublished by International Speech Communication Association ,2001
- Summarizing multilingual spoken negotiation dialoguesPublished by Association for Computational Linguistics (ACL) ,2000
- ON THE SPECIFICATION OF TERM VALUES IN AUTOMATIC INDEXINGJournal of Documentation, 1973
- Measuring nominal scale agreement among many raters.Psychological Bulletin, 1971