Overview

Description

A python application, that helps to classify text document between scientific and non-scientific. SVM is used as classifier.

Necessary components

Python, WxPython, scikit-learn, Gedit.

Ubuntu / Mint

$ sudo apt-get install python python-wxtools python-sklearn gedit

How to run

$ python main.py

Presentation (MIPT conference)

Site, presentation

Tests

Tests help me to select best features and improve classification quality.

Screenshots

MainFrame

PLotDialog