- marked as enhancement
Adjust the near-duplicate detection algorithm
Issue #2
new
Take the size of the text into account while you calculate the similarity.
If they are below a certain token size, use another measure, for instance Gestalt approach.
Comments (2)
-
reporter -
reporter -
assigned issue to
-
assigned issue to
- Log in to comment