Allow tokenization across -

We have a field which contains text entries in the form XX-YYY-####. We currently use the exact text searcher to aggregate a list of how many tickets contain duplicates. We would like to create a filter that would pick out specific YYY values.

The current behavior with the contains operator (~) seems to only work when the YYY value is listed as regular text (as a separate word) in the field, but not when it is in the standard form. That suggests to me that XX-YYY-#### is being tokenized as a single word, rather than as components. Alternately, if there were a way for contains to search any substring regardless of tokenization would meet our needs.

Concrete Example: "Affected Item" ~ "\"SIS\"" -> Entries with " SIS " in the field, but not entries with "XX-SIS-####"

Comments (8)