QS2 can sometimes fail to remove statements, when the values are strings which contain diacritics

Issue #88 new
Jheald created an issue

I have noticed this now a couple of times, trying to remove statements with strings that contain umlauts.

I suspect what is happening is that there may be more than one sequence of bytes that can represent a character with umlauts, and for some reason the sequence I'm presenting isn't exactly matching what QS2 is seeing from Wikidata, even though the screen display appears identical. (I found I also had some problems with this in Perl).

For example
-Q51507396 P2093 "Johann Wilhelm Peter Hùˆbener" failed to remove this author string; so I had to go back and do it by hand - diff

If possible, it would be nice if QS2 could try a match without diacritics as a fall-back, if the initial match to remove a string produces an error.