Tools
Below is a list of tools that may be useful for the MathIR task. This list is non-exhaustive, and recommendations from participants are quite welcome.
Evaluation and Visualization
- SEPIA : Standard Evaluation Package for Information Access Systems. (https://code.google.com/p/sepia/) (used with MathML extension)
- trec_eval : A program to evaluate TREC results. (http://trec.nist.gov/trec_eval/) (Notes on Trec Eval)
- MathJax : Javascript tools for rendering math formulae (LaTeX of MathML) in web pages (http://www.mathjax.org/)
Data Conversion
- LaTeXML : A LaTeX to MathML converter. (http://dlmf.nist.gov/LaTeXML/)
- docs2harvest : Tool for parsing html / xhtml documents and generate harvest files with the Content Math data only. (https://github.com/KWARC/mws)
- mathml-converter : Tool for converting MathML into keywords (http://code.google.com/p/mathml-converter/)
- PlaneText : Tool for Bridging XML/XHTML Documents and NLP Tools (http://kmcs.nii.ac.jp/planetext/)
Participant Math Search Engines & Tools
-
Tangent : Math search engine
- http://www.cs.rit.edu/~dprl/files/tangent_0.2.zip
- http://www.cs.rit.edu/~dprl/files/tangent-solr.zip
- online demonstration (formula search v1, ca. 2013; random query – might need to click again)
- min : Math entry and search interface (online demonstration)
- Other source code and tools created by RIT dprl lab