1. Kenneth Love
  2. DjangoCon 2011 Notes

Source

DjangoCon 2011 Notes / docs / scraping.rst

Diff from to

File docs/scraping.rst

  • Ignore whitespace
 * ``etree``: powerful and fast  for SOAP or other xml-formatted content
 * ``html``: best for web sites & irregular content
 
-    ``lxml.html``: hidden gems
-    __________________________
+``lxml.html``: hidden gems
+__________________________
 
     ``cssselect``
         utilizes css element syntax to find and highlight html elements.