1. Leandro Nunes
  2. ospa-crawler

Overview

HTTPS SSH
OSPA-Crawler

This is a crawler for the OSPA[1] website[2].

[1] http://en.wikipedia.org/wiki/Orquestra_Sinfônica_de_Porto_Alegre
[2] http://www.ospa.org.br/

----------------------------------------------------

Boot the infrastructure (using Vagrant):

$ vagrant up
$ vagrant ssh

Dependencies:

$ cd /vagrant
$ sh install_dependencies.sh

Usage:

$ cd /vagrant/ospa
$ scrapy crawl ospa -o items.json -t json

The output will be written to items.json file.