1. Matt Chaput
  2. whoosh

Commits

Show all
Author Commit Message Date Builds
Matt Chaput
Added tag 2.7.1 for changeset 1bd4b9792eed
Matt Chaput
Bumped version number for bugfix release.
Tags
2.7.1
Matt Chaput
Removed accidentally committed debug prints. Fixes issue #434.
Matt Chaput
Fix test accidentally left with assert False at the end.
Branches
nextgen
Matt Chaput
Remove docstring chickenscratch.
Branches
nextgen
Matt Chaput
Add IDEA's .cache directory to hgignore.
Branches
nextgen
Matt Chaput
Add IDEA's .cache directory to hgignore.
Matt Chaput
Replaced porter stemming algorithm implementation with one based on the one in NLTK. Fixes issue #390.
Branches
nextgen
Matt Chaput
Write offsets in VarBytesColumn when there are more than a certain number of rows. Deriving the offsets from the lengths can become very slow when a column has many rows. Many thanks to Christian Jacobsen! Fixes issue #393.
Matt Chaput
Fix forward-compatibility issue for Python 3.x.
Matt Chaput
Initial unfinished, massive checkin of next-gen architecture.
Branches
nextgen
Matt Chaput
Added a test for pickling a schema with a stemming analyzer.
Matt Chaput
Fix reporting of total count in FilterCollector, based on PR #63 by Jannon Frank. Set the 'collector' attribute of the results objects to be the FilterCollector. Removed some code in FilterCollector.get_count() that subtracts the filtered_count from the child's results, because for collectors that compute the count, the FilterCollector has already only told them to collect on valid filtered items. Also a minor PEP8 fix in collectors.py.
Matt Chaput
Merged in dan_black/whoosh/rmdir-gone-no-problem (pull request #49) Don't raise an error is we try to remove a directory that doesn't exist.
Daniel Black
IOError from rmdir is ok if the error was ENOENT
Daniel Black
Reorder the self._tempstorage.destroy() in SegmentWriter._finish to before the lock is released
Matt Chaput
Merged in dalbani/whoosh/dalbani/fix-sample-highlight-class-1447100867681 (pull request #66) Pass replace argument to get_text. Thanks Damiano!
Damiano Albani
Fix sample highlight class
Matt Chaput
Merged in nijel/whoosh/spanish-tokenizer (pull request #67) Spanish stemmer fixes. Thanks Michal!
Michal Čihař
Add sanity check for Spanish stemmer The word can be just one character long here.
Michal Čihař
Created new branch spanish-tokenizer
Matt Chaput
Merged in nijel/whoosh/romanian-stemmer (pull request #68) Romanian stemmer and unicode. Thanks Michal!
Michal Čihař
Skip ISO-8859-1 suffixes on Unicode strings When processing unicode strings, this fails with: File "/usr/lib/python2.7/dist-packages/whoosh/lang/snowball/romanian.py" line 227 in stem if word.endswith(suffix): UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal not in range(128)
Branches
romanian-stemmer
Michal Čihař
Created new branch romanian-stemmer
Branches
romanian-stemmer
Matt Chaput
Merged in assem.ch/whoosh-2/assem.ch/a-typo-in-whooshfieldskeyword-documentat-1452698455601 (pull request #69) A typo in `whoosh.fields.KEYWORD` documentation. Thanks assem!
Assem Chelli
A typo in `whoosh.fields.KEYWORD` documentation in initialization, he parameter name is `commas` instead of `comma`.
Matt Chaput
Merged in wilberforce/whoosh (pull request #62) Set analyzer on IDLIST field.
wilberforce
Backed out changeset 96255fc8ff17 Didn't mean for this to be included in my pull request to the base repo.
wilberforce
Change the version of my fork to be distinct from the base repo. This will mean that when we change the requirement back to the original repo dev VMs will update the library automatically.
wilberforce
Reinstate fields.IDLIST's analyzer. This was removed in 95da3b30a0aa, but if the field has no analyzer adding documents fails with "TypeError: 'NoneType' is not callable".
  1. Prev
  2. Next