Invisible character breaks categories
Issue #48
new
There are a number of categories with "" in them (that is the invisible unicode character %E2%80%8E). It is very easy to accidentally add a category with this character when copying and pasting, which will then break the tool because it is not a valid category name. I could almost 50 such BaGLAMa pages, and all have simply always shown 0 page views.
Can this character (or any trailing white space characters) be stripped when a user is adding a new category? Or, ideally, validate the user-supplied category names against live Commons and return an error if it isn't a match.
And can all the existing broken BaGLAMa pages be repaired and data regenerated?