Michael Hicks <hick0088 at tc.umn.edu> writes: > Dave Sherohman wrote: > > > > The Identity of the god R\355g > > from the Eddic Poem R\355gs\376\372la > > > > How would I go about cleaning this up and making it web-readable? > > Ideally, I would like to do it such that the \nnn codes will display > > the correct Icelandic characters in a browser, but I suspect that > > transliterating to standard ASCII would be the most widely-compatible > > way to go. > > If you want to convert to 8-bit ASCII, this perl script will work: > > #!/usr/bin/perl > > @lines = <STDIN>; > foreach $line (@lines) > { > system ("echo -en \'$line\'"); > } > > However, if you want an HTML version, the best thing to do would be to > convert the \nnn strings into the appropriate HTML entities. \355 is > the character `í', which is `í' in HTML, for example. It would > be easy to write an appropriate perl or sed script to fix it up. Doh! Right, this is the correct advice. Glad I managed to come up with *no* advice, instead of actually making a *stupid* suggestion! -- David Dyer-Bennet / Welcome to the future! / dd-b at dd-b.net SF: http://www.dd-b.net/dd-b/ Minicon: http://www.mnstf.org/minicon/ Photos: http://dd-b.lighthunters.net/