The Allen AI Science Challenge
Pham Quang Nhat Minh
Competition homepage: https://www.kaggle.com/c/the-allen-ai-science-challenge.
We run the model on Mac OS enviroment. Memory: 16GB, 2.6 GHz Core i7.
The program use the following softwares for text analysis and information retrieval.
How to run the model on the test set
Indexing the text collection
Run the script indexing.sh in the root directory of the project
Download Stanford CoreNLP version 3.6.0 for text analysis
Go to the directory ./tools and run the shell script download.sh
Run the model on the test data set
Given the test data with the same format as validation data set (without information about correct answers), under the root directory of the project, run the shell script run.sh with the filename of the test data as its argument.
sh run.sh <test_data_file>