Remove and in authorfield in lucene

Create issue
Issue #1730 invalid
Stephan Doerfel created an issue

The word "and" in the author field (used as delim) is currently indexed in lucene, causing a search for "and" to yield almost all posts. Please remove the delimiter from the author field

Comments (7)

  1. Former user Account Deleted

    Commented by telekoma: I changed the DiacriticsLowerCaseFilteringAnalyzer, which is responsible for indexing the data for the full text search, to use a multilanguage stop word list

  2. Stephan Doerfel reporter

    Not knowing about the specifics of DiacriticsLowerCaseFilteringAnalyzer: Please make sure, that no parts of the Names are being removed. Instead, only the delimiter and should be removed. Thus an author named Zu should not be removed because "zu" is a german stopword.

  3. Former user Account Deleted

    Commented by telekoma: That would be the case, BUT only when you do a full text search. We want to use the stop words in the full text search only since it often returns a great amount of results. The specific search for an author via e.g. http://www.bibsonomy.org/author/zu won't be affected. Indeed it would be some kind of compromise

  4. Log in to comment