Commits

Author Commit Message Labels Comments Date
Matt Chaput
Fixed bugs in Regex private methods. Thanks Thomas! Fixes issue #206.
Branches
cleanup2.4
Matt Chaput
Fixed incorrect signature/behavior of Regex._get_pattern(). Thanks Thomas! (This seems like it was leftover from design explorations.) Fixes issue #205.
Branches
cleanup2.4
Matt Chaput
Fixed obsolete references to MultiPool.postingqueue. Thanks Thomas! Fixes issue #203.
Branches
cleanup2.4
Matt Chaput
Fixed obsolete reference to fileindex._create_index().
Matt Chaput
Incorporated fixes for PyStemmerFilter from Jan Schrewe (BB pull request #7). Fixes issue #208.
Branches
cleanup2.4
Matt Chaput
Naming version 2.4 cleanup branch.
Branches
cleanup2.4
Matt Chaput
Doc fixes from Ollie Rutherfurd (BB pull request #6).
Matt Chaput
Experimented with intersection meta-cursor. Added GraphReader.find_path() and edited within() to use it. Converted Values methods to staticmethods.
Matt Chaput
Removed dumb method that was accidentally committed.
Matt Chaput
Added skip_to() method to cursor, renamed some methods.
Matt Chaput
Added dawg.Cursor object for navigating graphs quickly, moved high-level methods from GraphReader onto it.
Matt Chaput
Bugfixes and tests for FST features. Fixed a typo in whoosh.compat. Changed node writing from a node method to a writer method.
Matt Chaput
Reimplemented word graph code to be faster, use less memory. Added (untested) FST code. New GraphWriter/GraphReader keeps track of multiple roots instead of using the field name as the first key. Removed ability to keep a word graph in memory only. Moved low-level DAWG tests to new unit test module. Fixed bugs in iter_items/iter_postings.
Matt Chaput
Big refactoring to make filedb use a pluggable codec for writing and reading to disk. This is still unstable. Multiprocessing isn't done and it might not work with old indices.
Matt Chaput
- More work on new multiprocessing writer - Removed max-WOL statistic.
Matt Chaput
Forgot to add externalsort module to last commit.
Matt Chaput
- In process of replacing multiprocessing writers -- non-functional in this commit. - Min/max/total field lengths now stored in lengths file instead of segment. - Greatly simplified file indexing code. - Simplified testing objects. - Replaced filedb.pools with support.externalsort. - Decoupled segments from generations, in anticipation of future work. - Rewrote "separate spelling" code path. - Replaced LRU decorator with simpler "double barrel" implementation. - Added create_temp…
Matt Chaput
Added object identity comparison to BuildNode.__eq__().
Matt Chaput
Renamed NestedDocument -> Nested, cleaned up code, added tests.
Branches
nested
Matt Chaput
Added IndexWriter.group/start_group/end_group, fixed up NestedDocument query. Scoring system and Query.matcher() now allow more scoring flexibility.
Branches
nested
Matt Chaput
Merge with mainline.
Branches
nested
Matt Chaput
Added tag 2.3.1 for changeset 19c2df0a94ef
Matt Chaput
Bumped version to 2.3.1.
Tags
2.3.1
Matt Chaput
Fixed logic in set_caching_policy() method.
Matt Chaput
Fixed handling of Q codes in fieldcaches. Thanks Thomas! Fixes issue #200.
Matt Chaput
Added tag 2.3 for changeset 5d1064ce4c85
Matt Chaput
Updated release notes, bumped version number to 2.3.0.
Tags
2.3
Matt Chaput
Added hyphen to the list of word chars in the wildcard regex to make the result more predictable. A true fix will have to wait for a more advanced parser.
Matt Chaput
Merging branches.
Matt Chaput
Added functions for merging Schema objects.
  1. Prev
  2. Next