Commits

Gregory Petukhov committed 3b8c86c

Fix bug in HTML cleaning algorithm - make temporary hack

Comments (0)

Files changed (2)

feedzilla/util/clean.py

                 if 'img/' == elem[0] and 'src' == attr:
                     continue
                 del elem[1][attr]
-    return htmldata.tagjoin(tree)
+    data = htmldata.tagjoin(tree)
+    # Temporary hack
+    # htmldata doing something shitty with html:
+    # tagjoin return invalid DIV
+    # Data for testing: http://py-algorithm.blogspot.com/2011/04/blog-post_3267.html
+    data = normalize_html(data)
+    return data
             data_files.append(os.path.join(prefix, f))
 
 setup(
-    version = '0.1.16',
+    version = '0.1.17',
     description = 'Django application for atom/rss feeds aggregation i.e. planet engine',
     author = 'Grigoriy Petukhov',
     author_email = 'lorien@lorien.name',