csxj-crawler /

Filename Size Date modified Message
119 B
1.7 KB
659 B
39 B
95 B
997 B

Uses python 2.6

3rd party Dependencies

  • scrapy's HtmlXPathSelector : because any BeautifulSoup-based app is an half-assed implementation of XPath anyway.
  • BeautifulSoup : Quickly navigate data from html pages (legacy, will probably be replaced by scrapy at some point)
  • chardet : useful to fight encoding issues


Still thinking about it. Until then, this is not public domain and I retain full copyright.

Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.