decode when parsing BibTeX

Issue #112 new
Nico Schlömer created an issue

When parsing a BibTeX file, all entries are read as-is, e.g.,

data = bibtex.Parser().parse_file(args.infile)

will produce data with LaTeX escape sequences etc., e.g.,

('Publisher', 'W{\"o}ley \\& Co.')

I would like to have all entries decoded into proper LaTeX unicode strings, i.e.,

('Publisher', 'Wöley & Co.')

Right now, I manually plow through the database with codecs.decode(value, 'ulatex'), and it'd be great if parse_file() supported this right off the bat.

Comments (0)

  1. Log in to comment