Commits

Charlie Clark  committed 3d2eeb5

Handle database updates after importing multiple datasets better.

  • Participants
  • Parent commits ffae84a

Comments (0)

Files changed (1)

File httparchive/httparchive/scripts/update.py

 INSERT INTO sites (url, added)
 SELECT url, labelDate FROM pages
 WHERE url NOT in
-(SELECT url FROM sites)"""
+(SELECT url FROM sites)
+GROUP BY url
+ORDER BY labelDate"""
 
 update_stats = """
 UPDATE stats