Mailing List Archive

Support open source code!


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

tlug: HTML beautifier



Jonathan Byrne - 3Web writes:

 > 1) Convert all alphabetic characters following a < to uppercase, until a
 > space or = sign is encountered.  It should not be spooked by double-byte
 > characters, and of course, ignore all characters other than alphabetic
 > ones, the = sign, or a space.

James Clark's SP package includes a program called sgmlnorm that will
do this very easily, e.g.

$ sgmlnorm gnarly.html > nice.html

It won't lowercase the attribute values, but as Jim says, that could
have unwanted side-effects.

Matter of fact ... do you have nsgmls? Then you may very well have
sgmlnorm, too. If not ... let's see, it should be in the Jade RPM, or
you can get it (or at least good info about getting it) from:

http://www.jclark.com/

Matt Gushee
Oshamanbe, Hokkaido
--------------------------------------------------------------
Next Meeting: 8 August, Tokyo Station Yaesu central gate 12:30
featuring Linux on multiple platforms:
i386, Sparc, PA-Risc, Amiga, SGI, Alpha, PalmPilot, ...
Next Nomikai: September, 19:30 Tengu TokyoEkiMae 03-3275-3691
--------------------------------------------------------------
Sponsor: PHT, makers of TurboLinux http://www.pht.co.jp


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links