Annotating scientific papers for mathematical formula search

Abstract
In recent years, growing numbers of scientific papers have been published in XML format generating a large published base of MathML-style formulas. Although these formulas can be indexed and searched based on their XML tree structures, they generally lack sufficient information for semantic interpretation. We propose an annotation design for linking mathematical formulas to natural language descriptions in the surrounding text. We also introduce potential applications for this annotation framework.