Overview

How often is certain political figures mentioned in the news?

Keywords: Screenscraping, flask, python URL: Website

(danish text from here on out)

En læringsprojekt i flask.

Relativ simple flask app, som helt basalt gør følgende:

  1. Henter rå html for hver af de fire netaviser (information, berlingske, politiken ong JP).
  2. Finder forekomster af politikernavne/alias'er
  3. Viser antal forekomster på en graf
  4. Gemmer hver forekomst, inkl. kontekst i en mongodatabase

P.t. sker ovenstående een gang i timen. Dette styres via et simpelt cronjob som kalder get_em.py. Nogen forsøger at overbevise mig om at Celery ikke er overkill til det her.

Har du ideer er du velkommen til at tilføje issues, eller ligefrem forke koden og lave en pull request.

License: Open Source - completely free - MIT license. Take it, use it :-)

Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for ProjectModifiedEvent.java.
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.