Commits

Uche Ogbuji committed 9c44bb5

Minor tweaks to NYT demo

Comments (0)

Files changed (1)

demos/akara/amara/nyt2skos.py

 
 The program basically will GET urls like:
 
-  http://topics.nytimes.com/topics/reference/timestopics/all/[a-z]
+  http://topics.nytimes.com/topics/reference/timestopics/subjects/[a-z]/index.html
 
 and scrape the topics out of them, and then persist the SKOS data as RDF/XML to stdout
+
+Sources are non-well-formed despite XHTML 1.0 DTD:
+
+curl http://topics.nytimes.com/topics/reference/timestopics/subjects/a/index.html | xmllint -
 """
 
 import sys
             (E((SKOS_NAMESPACE, u'skos:broader'), {(RDF_NAMESPACE, u'rdf:resource'): ORGANIZATIONS})) if 'timestopics/organizations' in uri else (),
           )
         )
-    break
 
 cursor.close()