A visual analysis on recognizability and discriminability of onomatopoeia words with DCNN features
- 1 June 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2015 IEEE International Conference on Multimedia and Expo (ICME)
- No. 19457871,p. 1-6
- https://doi.org/10.1109/icme.2015.7177453
Abstract
In this paper, we examine the relation between onomatopoeia and images using a large number of Web images. The objective of this paper is to examine if the images corresponding to Japanese onomatopoeia words which express the feeling of visual appearance can be recognized by the state-of-the-art visual recognition methods. In our work, first, we collect the images corresponding to onomatopoeia words using an Web image search engine, and then we filter out noise images to obtain clean dataset with automatic image re-ranking method. Next, we analyze the recognizability of various kinds of onomatopoeia images using improved Fisher vector (IFV) and deep convolutional neural network (DCNN) features. In addition, we collect images corresponding to the pairs of nouns and onomatopoeia words, and we examine if the images associated with the same nouns and the different onomatopoeia words are visually discriminable or not. By the experiments, it has been shown that the DCNN features extracted from the layer 7 of Overfeat's network pre-trained with the ILSVRC 2013 data have prominent ability to represent onomatopoeia images, and most of the onomatopoeia words have visual characteristics which can be recognized.Keywords
This publication has 3 references indexed in Scilit:
- Harvesting Image Databases from the WebIEEE Transactions on Pattern Analysis and Machine Intelligence, 2010
- Exploring features in a Bayesian framework for material recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Probabilistic web image gatheringPublished by Association for Computing Machinery (ACM) ,2005