Food Category Representatives: Extracting Categories from Meal Names in Food Recordings and Recipe Data
- 1 April 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in 2015 IEEE International Conference on Multimedia Big Data
Abstract
Food Log is a multimedia recording tool for producing food records for many individuals. In one year of operation, Food Log has produced more than one million food records for meals eaten by users. We found nearly 70,000 unique food records among these data. In analyzing them, one of the challenges is to extract meal categories from such a large number of records. In this paper, we propose a method for compressing a meal name into a shorter representation. First, we collect similar meal names using a k-nearest neighbor search. Next, we construct a word graph to model the relationship between the meal names and items in the database. We select representative words by identifying minimal paths in the word graph. Finally, we obtain a few words that represent categorical information about the original meal name. We applied the method to data in food records for both Food Log and the Rakuten Recipe database. Our results show that the method worked effectively for both datasets.Keywords
This publication has 12 references indexed in Scilit:
- Frequency statistics of words used in Japanese food records of FoodLogPublished by Association for Computing Machinery (ACM) ,2014
- Comparative Study of the Routine Daily Usability of FoodLogJournal of Diabetes Science and Technology, 2014
- Topic Extraction Based on Knowledge Cluster in the Field of Micro-blogLecture Notes in Computer Science, 2014
- Evaluation of Sentence Compression Techniques against Human PerformanceLecture Notes in Computer Science, 2014
- CUES: A New Hierarchical Approach for Document ClusteringJournal of Pattern Recognition Research, 2013
- Generic title labeling for clustered documentsExpert Systems with Applications, 2010
- A Review of Web Document Clustering ApproachesPublished by Springer Science and Business Media LLC ,2006
- Power laws, Pareto distributions and Zipf's lawContemporary Physics, 2005
- Summarization beyond sentence extraction: A probabilistic approach to sentence compressionArtificial Intelligence, 2002
- Text categorization with Support Vector Machines: Learning with many relevant featuresLecture Notes in Computer Science, 1998