1. Matt Chaput
  2. whoosh

Commits

Author Commit Message Date Builds
Matt Chaput
Clarified docstring, removed obsolete attribute.
Matt Chaput
First version of a setup.py script for Whoosh.
Matt Chaput
Docstring cleanup.
Matt Chaput
highlight.py: cleaned up some dumb decisions. qparser.py: clarified comment.
Matt Chaput
"Protected" methods need to be locked with an RLock, not a Lock.
Matt Chaput
Returned to a Lucene-like highlighting system in highlight.py. Removed code from passages.py.
Matt Chaput
Small changes for simplification and consistency.
Matt Chaput
Changed from_ back to iter_from.
Matt Chaput
Changed "checkclosed" decorator to "protected" and added thread synchronization to it.
Matt Chaput
Changed the way Multi*Reader objects are instantiated, to make it easier to use hierarchical Readers.
Matt Chaput
analysis.py: - Consolidated CommaSeparatedAnalyzer and SpaceSeparatedAnalyzer into KeywordAnalyzer. - Default is now for StopFilter to remove stopped words from the token stream. classify.py: - Updated code to work with Results object implementation of important_terms(). fields.py: - unstopped() wrapper removes stopped words from token stream before indexing. index.py: - Minor fixes. passages.py: - Rewrite to revert back to my original conception of highlighting, backtracking from using the Minion code. Not finished yet. tables.py: - Fixed copy_data -- should have had separate arguments for the key on the incoming table and the key on the outgoing table. writing.py: - Fixed logic on when to call _merge_segments. - Minor style cleanup. - Changed implementation of sorting in scoring.py and searching.py. Scoring/sorting now happens in the Searcher instead of the Results. - @checkclosed decorator on methods checks whether the parent object has been closed before proceeding. - Minor docstring formatting cleanup.
Matt Chaput
Minor formatting.
Matt Chaput
Removed unused import.
Matt Chaput
Changed vector_as to use Format.interpreter(). Minor docstring formatting.
Matt Chaput
Removed unused import.
Matt Chaput
Added experimental CharacterBoosts format, cleaned up PositionBoosts. Added Format.interpreter() method to return data_to_X method. Minor docstring formatting.
Matt Chaput
Minor formatting change.
Matt Chaput
Minor docstring fixes. Moved some __x__ methods. Removed obsolete code.
Matt Chaput
OK, so it turns out inheriting from (object, Mixin) is not right...
Matt Chaput
Fixed up docstrings. Changed uses of isinstance(x, type) to callable(x) for flexibility. util.ClosableMixin makes classes with a close() method into context managers. store.py: made Storage.lock() return True on success. postpool.py: commented out MemoryPool. qparser.py: improved handling of analyzers. query.py: Variations now computes variants on-the-fly. searching.py: Fixed up __getitem__.
Matt Chaput
Broke deletion methods out into SupportsDeletion mixin. IndexWriter now has commit() and cancel() methods instead of close().
Matt Chaput
analysis.py: Fixed stop filter handling. fields.py: Removed duplicate methods. index.py: Fixed loading of indexes. test_tables.py: Removed obsolete test. Changed implementation of copying raw posting data to allow buffered copying. Moved creating and opening tables into the storage implementation (part of an experiment to store indexes in a SQLite database). Renamed field_words() to lexicon(). Renamed iter_from() to from_(). Commented out "record table" implementations.
Matt Chaput
Removed unused file.
Matt Chaput
classify.py: minor cleanup. spelling.py: SpellChecker.add_field() now uses the term count as the word score.
Matt Chaput
Changed Weighting implementation so instead of keeping a reference to the searcher in the Weighting object, the searcher is passed as an argument to Weighting.score(). This solves the chicken-and-egg problem, where the searcher wanting a reference to the weighting and vice-versa, better than the previous inelegant and inflexible Weighting.set_searcher().
Matt Chaput
query.py: cleaned up code writing.py: added guard for non-enumerated mergetype values. tests/*.py: fixed tests to match previous changes.
Matt Chaput
Removed file accidentally added to SVN.
Matt Chaput
Added TODO comment.
Matt Chaput
Changed integer constants to classes.
Matt Chaput
analysis.py: cleaned up a bit of style. query.py: updated references to searcher.doc_count to doc_count_all(). reading.py: updated reference to field to field.format. scoring.py: updated reference to self.fieldnum in BM25F. searching.py: Removed unused import. Fixed up Searcher interface to match its sub-objects. Added Searcher.stored_fields() method. writing.py: Used constants for merge policy argument to IndexWriter.close() …
  1. Prev
  2. Next