AMI is a software tool to extract and analyze the world's scientific facts in scholarly publications, theses and reports
AMI will be used to power the Content Mine: see video (https://vimeo.com/78353557). Everyone is welcome, either to help develop/document software or extract and analyze content.
AMI consists of about 8 projects, with production and development versions (*-dev). The first (
XHTML2STM) is where users should start.
toplevel package for converting
HTMLto Science Technical Medical (STM). It uses domain-specific visitors (e.g. chemistry, phylogenetics, metabolism).
- SVG2XML. Converts
SVGlibrary (based on
XOM) including tools for building higher level graphics primitives (squares, circles, arrows, etc.)
- PDF2SVG. Parser/converter from
SVG, including normalization to Unicode where possible.
- CRAWLERREPO. Crawler for Open scientific publications. Repository using CKAN Datahub.io