home *** CD-ROM | disk | FTP | other *** search
- From: churchh@uts.cc.utexas.edu (Henry Churchyard)
- Subject: htmlchek - HTML Error Checker and Utilities, v4.1
- Newsgroups: comp.sources.misc
- Organization: The University of Texas at Austin; Austin, Texas
- Keywords: WWW HTML 2.0 HTML 3.0 Arena Netscape SGML awk perl error checker search-and-replace v.4.1
-
- Submitted-by: churchh@uts.cc.utexas.edu (Henry Churchyard)
- Archive-name: htmlchek/patch02
- Environment: AWK || Perl
- Patch-To: htmlchek: Volume 47, Issue 48-54
-
- This is a bugfix and update to my htmlchek HTML error checker program,
- adding several minor features for greater convenience of use, and
- bringing the version number to 4.1. The program checks for quite a
- number of possible defects in the HTML (Hyper-Text Mark-up Language)
- version 2.0 SGML files used on the World-Wide Web; it makes no claim
- to understand all of SGML, but is easy and relatively simple to use,
- gives lots of information (including about many stylistically bad
- practices), can do local cross-reference checking and generate
- rudimentary reference-dependency maps, and can be run on any platform
- for which an awk or perl language interpreter is available.
-
- The htmlchek distribution also includes a number of supplemental
- utilities, including the htmlsrpl.pl HTML-aware search-and-replace
- program, which uses either literal strings or regular expressions;
- acts either only outside HTML/SGML tags, or only within tags; can be
- restricted to operate only within and/or only outside specified
- elements; and can also upper-case tag names. Other utilities are:
-
- makemenu -- Makes simple menu for HTML files, based on each file's <TITLE>;
- can also make a simple table of contents based on <H1>-<H6>
- headings.
- xtraclnk.pl -- Extracts links/anchors from HTML files; isolates text
- contained in <A> and <TITLE> elements.
- dehtml -- Removes all HTML markup, preliminary to spell check.
- entify -- Replaces high Latin-1 alphabetic characters with ampersand
- entities for safe 7-bit transport.
- metachar -- Trivial program to protect HTML/SGML metacharacters "&<>" in
- plain text that is to be included in an HTML file.
-
- The patched 4.1 version of htmlchek is (or soon will be) available
- for anonymous FTP, in convenient .tar.Z and .zip formats, at the
- following URL:
-
- ftp://ftp.cs.buffalo.edu/pub/htmlchek/
-
- The documentation to htmlchek can be browsed over the Web, at this
- location:
-
- http://uts.cc.utexas.edu/~churchh/htmlchek.html
-
- Changes in this release include:
-
- Don't warn about null <TEXTAREA></TEXTAREA> element; only check for
- inappropriate whitespace within elements commonly rendered as
- underlined (<A> and <U>); check ordering of head tags before body tags
- even in absence of explicit <head>...</head>; allow comments between
- list items; only output non-numeric unquoted option values in each
- file; corrected processing of HTML3 <LH>; updated HTML 3 language
- definition to January 19 1995 draft; tinkered with Netscape extensions
- language-definition yet again; added inline=1 command-line parameter;
- added listfile=/lf= command-line parameter (especially for greater
- MS-DOS convenience); allow cf= as abbreviation of configfile=;
- ampersands followed by non-alphabetics generate warnings rather than
- errors (so corresponding erromessage was removed from entify); added
- "changed"/"unchanged" STDERR messages to htmlsrpl.pl output; added
- .gif's to documentation; added awk-perl.html to documentation; added
- index.html menu to documentation.
-
- New files in this release are:
-
- README.41 Update notes
- index.html HTML version of README.40, README.41, and menu
- awk-perl.html Where to obtain Awk and Perl
- geterr.sh Trivial script to extract only ERROR! messages
- from htmlchek output
- geterwrn.sh Trivial script to extract only ERROR!/Warning!
- messages from htmlchek output
- ___
- awk.gif | .gif files used
- camel.gif | in htmlchek HTML
- ftp.gif | documentation
- htmlchek.gif | (uuencoded as .uue
- htmlchks.gif | files in the
- valdhtml.gif | comp.sources.misc
- warning.gif ___| Usenet distribution)
-
-
- To apply the patch, cd into the htmlchek source directory, unshar
- this file and type:
-
- patch < htmlc41a.ptc
- patch < htmlc41b.ptc
- patch < htmlc41c.ptc
-
- Use a uudecode program to convert the .uue files to .gif's.
-