ASCII chars not un-escaped with Encoding Conversion Step

Issue #318 resolved
Former user created an issue

Original issue 318 created by @ysavourel on 2013-03-05T16:42:49.000Z:

Some ASCII characters seem to not been un-escaped by the Encoding Conversion step when going to UTF-8

See http://tech.groups.yahoo.com/group/okapitools/message/3579

Comments (3)

  1. Former user Account Deleted
    • changed status to open

    Comment 1. originally posted by @ysavourel on 2013-08-17T13:29:07.000Z:

    It seems the conversion code skips the ASCII characters:

    if ( value < 128 ) {
    // Unknown pattern or ASCII values: Keep it as it
    // (so <, &, ", etc.. stay escaped)
    tmp.append(m.group());
    }

    I guess we were playing it safe.
    But we can be more specific and preserve only '<', '&', '>', ''' and '"' and convert all other ASCII.

  2. Log in to comment