lnkckr is a link checking library with command-line script
linkcheck. It's intended to be used to check broken links and it can be interrupted at any time, then resumes from saved JSON file.
It supports Python 2.7 and Python 3.
pip install lnkckr
linkcheck [-c CHECKER] [-u|--update] [-f FILE] [-j JSON] [-s STATUS] [-x EXCLUDE_STATUS]
CHECKERis what checker used to process
--updateinstructs lnkckr to update the JSON with input file. Normally, when
-j JSONpresents, lnkckr ignore the input file.
FILEis the input filename or URL.
JSONis the filename of saved progress file. If the
FILEis a filename, then
FILEcan be omitted, an filename is assigned automatically unless using different filename is desired.
STATUSindicates re-check url with specific status.
EXCLUDE_STATUSis a CSV which lists what links with specified status will not be shown in report section. Default is
Here is a sample:
list: input file is a list of links, one URL per line.
html: input file is a HTML file.
blogger: input file is a Blogger XML Export file.
[---]: the url hasn't been checked.
[???]: where the
???is the HTTP status code.
[###]: means the fragment in the URL can't be found in the response body in format of
[SCH]: unsupported HTTP scheme.
[SKP]: the url is skipped.
[RRR]: the url results reaching maximal redirection limit.
[XXX]: unknown error.
: timeout when trying to check the url.
The output of lnkckr is similar to my two-year-old shell script
linkckr.sh, I have also included it in the repository since lnkckr is kind of continuation of
linkckr.sh, here is a blog post about it.
- bea: Blogger Export Analyzer, not directly related, but also uses the Blogger XML Export file.
- announcement: the blog post of lnkckr.
- b.py: Command-line posting script, it uses lnkckr to check links.
lnkckr is licensed under the MIT License, see COPYING. Copyright (C) 2013 by Yu-Jie Lin