Commits

Charlie Clark committed 3d2eeb5

Handle database updates after importing multiple datasets better.

Comments (0)

Files changed (1)

httparchive/httparchive/scripts/update.py

 INSERT INTO sites (url, added)
 SELECT url, labelDate FROM pages
 WHERE url NOT in
-(SELECT url FROM sites)"""
+(SELECT url FROM sites)
+GROUP BY url
+ORDER BY labelDate"""
 
 update_stats = """
 UPDATE stats