Checkmate ”Blacklist” function does not work for Japanese
Original issue 442 created by tsuruku... on 2015-02-04T13:39:40.000Z:
I tried "Blacklist" function for English to Japanese translation.
But it does not work at all.
What is the expected output? What do you see instead?
Probably, I guess that "Blacklist" works as if "¥b" is put before and after a listed word.
For example, if "function" is a blacklisted word, Checkmate responds to "function", but not "malfunction".
This is convenient for European language, but it is critical for Japanese (probably, Chinese, Korean, too).
Because, in most cases, Asian language does not contain any "space" in a sentence.
What version of the product are you using? On what operating system?
Checkmate version 0.26
I hope your support.
Comments (6)
-
Account Deleted -
Account Deleted - attached SampleSource.txt.ttx
- attached SampleBlacklist.txt
Comment 2. originally posted by tsuruku... on 2015-02-05T13:51:32.000Z:
Thank you for your quick reply.
I created simple file attached.
One is Trados.ttx, another is Blacklist UTF-8. -
Account Deleted Comment 3. originally posted by tsuruku... on 2015-02-12T14:34:12.000Z:
Is my request hard to follow?
If you have any questions or need more sample, I will support you. -
Account Deleted - changed status to open
Comment 4. originally posted by @fliden on 2015-02-13T00:37:04.000Z:
Hi there, Yves is out of the office until next week but I'm planning to take a look at it tomorrow.
-
Account Deleted - changed status to resolved
Comment 5. originally posted by @fliden on 2015-02-14T03:02:29.000Z:
Ok, this snapshot has an option to allow match blacklist terms even if they are substrings.
-
Account Deleted Comment 6. originally posted by @ysavourel on 2015-02-19T03:31:47.000Z:
- Log in to comment
Comment 1. originally posted by @ysavourel on 2015-02-04T14:00:47.000Z:
The blacklist checker doesn't use regex like \b, but does checks for character types. So the algorythm needs to be updated to work with Japanese, etc.
Could you provide an example oj Japanese of a translated string and several black listed terms, so we can try to tweak the code? Thanks.