OpenXML: equations not extracted

Issue #1349 new
Chase Tingley created an issue

Based on discussion here: https://groups.google.com/g/okapi-users/c/ufI1W0E3LsA/m/rTdTyjnJAQAJ?utm_medium=email&utm_source=footer

Manuel’s testcase is attached.

It seems like these should be supported for extraction as a config option.

Additional context from Manuel:

Repositioning them is always likely to be necessary in languages that don't have the same syntactical order as the source language.

With regards to modifying the content of the equation, again it might depend on whether the target language uses the same, say, decimal separators as the source language.

For example, 2.95 should be 2,95 in many Roman languages (notice the comma).

It might also depend on the type of text. A textbook is likely to follow these rules more strictly than a scientific paper.

An option in the filter configuration file that allows the user to expose it or not would be ideal.

A complex tag structure is often the case anyway for this kind of math notation (outside of equation boxes), at least with the default OmegaT OpenXML filter. Translators must rely on previewing the source file (and/or the target files) to see what the equation looks like.

Comments (0)

  1. Log in to comment