Commits

Author Commit Message Labels Comments Date
Matt Chaput
Minor formatting, changed default block size.
Matt Chaput
Fixed directory deletion.
Matt Chaput
Fixed docstring. Changed Index.searcher() to pass keyword arguments to the Searcher constructor. Added Index.unlock() before cleaning old files when create = True.
Matt Chaput
Clarified docstring, removed obsolete attribute.
Matt Chaput
First version of a setup.py script for Whoosh.
Matt Chaput
Docstring cleanup.
Matt Chaput
highlight.py: cleaned up some dumb decisions. qparser.py: clarified comment.
Matt Chaput
"Protected" methods need to be locked with an RLock, not a Lock.
Matt Chaput
Returned to a Lucene-like highlighting system in highlight.py. Removed code from passages.py.
Matt Chaput
Small changes for simplification and consistency.
Matt Chaput
Changed from_ back to iter_from.
Matt Chaput
Changed "checkclosed" decorator to "protected" and added thread synchronization to it.
Matt Chaput
Changed the way Multi*Reader objects are instantiated, to make it easier to use hierarchical Readers.
Matt Chaput
analysis.py: - Consolidated CommaSeparatedAnalyzer and SpaceSeparatedAnalyzer into KeywordAnalyzer. - Default is now for StopFilter to remove stopped words from the token stream. classify.py: - Updated code to work with Results object implementation of important_terms(). fields.py: - unstopped() wrapper removes stopped words from token stream before indexing. index.py: - Minor fixes. passages.py: - Rewrite to revert back to my original conception of highlighting, backtracking from using the Minion code. Not finished yet. tables.py: - Fixed copy_data -- should have had separate arguments for the key on the incoming table and the key on the outgoing table. writing.py: - Fixed logic on when to call _merge_segments. - Minor style cleanup. - Changed implementation of sorting in scoring.py and searching.py. Scoring/sorting now happens in the Searcher instead of the Results. - @checkclosed decorator on methods checks whether the parent object has been closed before proceeding. - Minor docstring formatting cleanup.
Matt Chaput
Minor formatting.
Matt Chaput
Removed unused import.
Matt Chaput
Changed vector_as to use Format.interpreter(). Minor docstring formatting.
Matt Chaput
Removed unused import.
Matt Chaput
Added experimental CharacterBoosts format, cleaned up PositionBoosts. Added Format.interpreter() method to return data_to_X method. Minor docstring formatting.
Matt Chaput
Minor formatting change.
Matt Chaput
Minor docstring fixes. Moved some __x__ methods. Removed obsolete code.
Matt Chaput
OK, so it turns out inheriting from (object, Mixin) is not right...
Matt Chaput
Fixed up docstrings. Changed uses of isinstance(x, type) to callable(x) for flexibility. util.ClosableMixin makes classes with a close() method into context managers. store.py: made Storage.lock() return True on success. postpool.py: commented out MemoryPool. qparser.py: improved handling of analyzers. query.py: Variations now computes variants on-the-fly. searching.py: Fixed up __getitem__.
Matt Chaput
Broke deletion methods out into SupportsDeletion mixin. IndexWriter now has commit() and cancel() methods instead of close().
Matt Chaput
analysis.py: Fixed stop filter handling. fields.py: Removed duplicate methods. index.py: Fixed loading of indexes. test_tables.py: Removed obsolete test. Changed implementation of copying raw posting data to allow buffered copying. Moved creating and opening tables into the storage implementation (part of an experiment to store indexes in a SQLite database). Renamed field_words() to lexicon(). Renamed iter_from() to from_(). Commented out "record table" implementations.
Matt Chaput
Removed unused file.
Matt Chaput
classify.py: minor cleanup. spelling.py: SpellChecker.add_field() now uses the term count as the word score.
Matt Chaput
Changed Weighting implementation so instead of keeping a reference to the searcher in the Weighting object, the searcher is passed as an argument to Weighting.score(). This solves the chicken-and-egg problem, where the searcher wanting a reference to the weighting and vice-versa, better than the previous inelegant and inflexible Weighting.set_searcher().
Matt Chaput
query.py: cleaned up code writing.py: added guard for non-enumerated mergetype values. tests/*.py: fixed tests to match previous changes.
Matt Chaput
Removed file accidentally added to SVN.
  1. Prev
  2. Next