HTTPS SSH

TExtractor

Usage:

>>> from textractor import TExtractor
>>> extractor = TExtractor()
>>> extractor.index('test.docx')
[u'workflow_history', u'portal_workflow', u'review_history',
 u'implementation', u'organizations', u'Illustrations', ...]
>>> extractor.index('test.pdf')
[u'workflow_history', u'portal_workflow', u'review_history',
 u'implementation', u'organizations', u'Illustrations', ...]
>>>