Mailing List Archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [tlug] Translating old to new kanji forms using tr



On 6/28/05, Danny Wilde <fuzakenbo@example.com> wrote:

> I've done something like this once, some time ago. The solution I used was
> to write a script in Perl. You could just write loads of s/kanji1/kanji2/,
> for example, (s/A/B/ means "substitute A for B" in Perl) or you could stuff
> all the kanji into an associative array and match them using a regular
> expression. I think (not sure) that the newer versions of perl have a \cJ
> operator which matches one Japanese character, so if the above-mentioned
> character overlap is the cause of the problem, then it would be solved that
> way.

Perl 5.8 and newer handle multi-byte encodings properly in regular expressions.

-Josh


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links