Toby Inkster committed abcf187

similar modules on CPAN

Comments (0)

Files changed (1)


+=head2 Similar Modules on CPAN
+=item * L<HTML::FormatText>
+About 15 years old, and still maintained, this falls into the "mature"
+category. This module is based on L<HTML::Tree>, so its HTML parser may
+not behave as closely to modern browsers as HTML::HTML5::Parser's parsing,
+but its conversion to text seems somewhat better than HTML::HTML5::ToText's
+default output (i.e. with no traits applied).
+At the time of writing, its bug queue on lists eight bugs, some
+quite serious. However, since being taken over by its latest maintainer,
+there seems to be progress being made on them.
+Fairly extensible, but not in the mix-and-match traits way allowed by 
+=item * L<HTML::FormatText::WithLinks>
+An extension of HTML::FormatText.
+=item * L<HTML::FormatText::WithLinks::AndTables>
+An extension of HTML::FormatText::WithLinks.
+The code that deals with tables is pretty crude compared with
+HTML::HTML5::ToText::Trait::RenderTables. It doesn't support C<colspan>,
+C<rowspan>, or the C<< <th> >> element. But on the other hand, it does
+support C<align>...
+L<swings and roundabouts|>.
+=item * L<LEOCHARRE::HTML::Text>
+Very basic conversion; basically just tag stripping using regular expressions.
+=item * L<HTML::FormatExternal>
+Passes HTML through external command-line tools such as `lynx`. Obviously
+this has limited portability.
 =head1 AUTHOR
 Toby Inkster E<lt>tobyink@cpan.orgE<gt>.