1. quake0day
  2. whoosh

Commits

Author Commit Message Date Builds
Matt Chaput
Added test/tmp, .coverage, and nosetests.xml to ignore list.
Matt Chaput
Change ArrayValues to take the array type as an argument.
Matt Chaput
Commented out utf16 and utf32 encoders, they're causing problems on Shining Panda.
Matt Chaput
Fixed unicode handling in FSA/FST more. Fixed lots of bytes/unicode and other Python 3 compatibility issues. Unit tests pass on Python 3.
Matt Chaput
Fixed FST code to work with unicode strings and Py3 bytes objects.
Matt Chaput
Fixed max-line-length
Matt Chaput
Added pylint configuration file to source control.
Matt Chaput
Added tag 2.3.2 for changeset 303cef16ed5e
Branches
cleanup2.4
Matt Chaput
Bumped version to 2.3.2.
Tags
2.3.2
Branches
cleanup2.4
Matt Chaput
Fixed problem with range query parsing regex (was too greedy across multiple ranges). Fixes issue #212.
Branches
cleanup2.4
Matt Chaput
Fixed typo in PyStemmerFilter unit test.
Branches
cleanup2.4
Matt Chaput
Fixed mistyped formula in BM25 function. Many, many thanks to BB user leifos for this. Fixes issue #209.
Branches
cleanup2.4
Matt Chaput
Fixed bugs in Regex private methods. Thanks Thomas! Fixes issue #206.
Branches
cleanup2.4
Matt Chaput
Fixed incorrect signature/behavior of Regex._get_pattern(). Thanks Thomas! (This seems like it was leftover from design explorations.) Fixes issue #205.
Branches
cleanup2.4
Matt Chaput
Fixed obsolete references to MultiPool.postingqueue. Thanks Thomas! Fixes issue #203.
Branches
cleanup2.4
Matt Chaput
Fixed obsolete reference to fileindex._create_index().
Matt Chaput
Incorporated fixes for PyStemmerFilter from Jan Schrewe (BB pull request #7). Fixes issue #208.
Branches
cleanup2.4
Matt Chaput
Naming version 2.4 cleanup branch.
Branches
cleanup2.4
Matt Chaput
Doc fixes from Ollie Rutherfurd (BB pull request #6).
Matt Chaput
Experimented with intersection meta-cursor. Added GraphReader.find_path() and edited within() to use it. Converted Values methods to staticmethods.
Matt Chaput
Removed dumb method that was accidentally committed.
Matt Chaput
Added skip_to() method to cursor, renamed some methods.
Matt Chaput
Added dawg.Cursor object for navigating graphs quickly, moved high-level methods from GraphReader onto it.
Matt Chaput
Bugfixes and tests for FST features. Fixed a typo in whoosh.compat. Changed node writing from a node method to a writer method.
Matt Chaput
Reimplemented word graph code to be faster, use less memory. Added (untested) FST code. New GraphWriter/GraphReader keeps track of multiple roots instead of using the field name as the first key. Removed ability to keep a word graph in memory only. Moved low-level DAWG tests to new unit test module. Fixed bugs in iter_items/iter_postings.
Matt Chaput
Big refactoring to make filedb use a pluggable codec for writing and reading to disk. This is still unstable. Multiprocessing isn't done and it might not work with old indices.
Matt Chaput
- More work on new multiprocessing writer - Removed max-WOL statistic.
Matt Chaput
Forgot to add externalsort module to last commit.
Matt Chaput
- In process of replacing multiprocessing writers -- non-functional in this commit. - Min/max/total field lengths now stored in lengths file instead of segment. - Greatly simplified file indexing code. - Simplified testing objects. - Replaced filedb.pools with support.externalsort. - Decoupled segments from generations, in anticipation of future work. - Rewrote "separate spelling" code path. - Replaced LRU decorator with simpler "double barrel" implementation. - Added create_temp…
Matt Chaput
Added object identity comparison to BuildNode.__eq__().
  1. Prev
  2. Next