1. Olivier Lauzanne
  2. pyquery
  3. Issues
Issue #10 wontfix

pyquery fails without errors when

Jorge Vargas
created an issue

<html xmlns="http://www.w3.org/1999/xhtml">

and the content doesn't validates.

Comments (3)

  1. Julio Biason

    I had a similar problem, although it was not a complete fail.

    With the same problem (it's quite common with Blizzard pages, which I'm trying to scrap information since their API is gone), all tags have a "{http://www.w3.org/1999/xhtml}" prepended to them. So an anchor actually becomes "{http://www.w3.org/1999/xhtml}a", table data is "{http://www.w3.org/1999/xhtml}td" and so on.

    If you try to pass that to the class (e.g. doc("{http://www.w3.org/1999/xhtml}a.class")), it fails saying that "{" is an invalid character.

  2. Log in to comment