This is the wiki page for the Law and Public Policy Lab text analysis platform. The goal is an integrated text analysis platform for social science research. This is an integrated system for storage, management, and analysis of text data and social-science data.
Feature Extraction (/features/)
Unsupervised Dimension Reduction (/unsupervised/)
Supervised Dimension Reduction and Prediction (/supervised/)
Word clouds, topic clusters, binned scatter plots, etc.
Databases on sqlite, postgreSQL and table functionalities on postgres
We need to work on a test suite. This should include measures of fit, cross validation, perplexity, etc.
Eventually, we would like to do a GUI. This should be able to pull Google N-grams, for example.