Commits

Author Commit Message Labels Comments Date
Matt Chaput
Added logic to protect SpanWrappingMatcher against trying to get the spans when the underlying matcher is inactive.
Matt Chaput
Added more reasonable error messages for methods on non-indexed fields.
Matt Chaput
Fixed error in variations() function affecting words eding in "e". Made simplify() return normalized results.
Matt Chaput
Bumped version and changed index format to account for changes due to index compression work.
Matt Chaput
Term index and vector files now use 16-bit codes instead of full field names.
Matt Chaput
Result[] now returns a Hit object. Switched tf back to float in case it messed with quality optimizations.
Matt Chaput
Changes to reduce index size, see issue #47. Miscellaneous fixes and improvements.
Matt Chaput
Added compression of filedb posting data.
Matt Chaput
Added caching to FileWriter.update_document(). Increased version number.
Matt Chaput
Adding Eclipse .settings dir to .hgeclipse.
Matt Chaput
Added unit test for Facets object.
Matt Chaput
Fix for counts() and categorize() after changed array to a dict.
Matt Chaput
Changed interface of Facets object to take the Searcher at instantiation.
Matt Chaput
Check that the matcher is still active before calling _get_spans() in SpanWrappingMatcher. Fixes issue #44.
Matt Chaput
Added spans() method to WrappingMatcher base class. Fixes issue #43.
Matt Chaput
Fixed typo in Weighting compatibility class. Fixes issue #42.
Matt Chaput
Bumped version number.
Matt Chaput
Merging branches.
Matt Chaput
Added clustering functions to classify module for future use.
Matt Chaput
Made Searcher a context manager (to close itself).
Matt Chaput
Fixed missing spans() method from MultiMatcher.
Matt Chaput
Fixed term range parsing. Bumped version number.
Matt Chaput
Bumped version number.
Matt Chaput
Instead of simply sorting the collected heap in reverse, sort by reversed score and then by forward document number.
Matt Chaput
Implemented date range parsing.
Matt Chaput
Fixed up dateparse for simple dates (no ranges yet).
Matt Chaput
Shouldn't have used random.sample() since it only picks items from the list once.
Matt Chaput
Less obtuse implementation of unique_name() function.
Matt Chaput
Additonal work on date query parsing.
Matt Chaput
Checking in initial infrastructure to support parsing date queries.
  1. Prev
  2. Next