[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Malformed UTF-8 character ...
On May 6, 2004 at 10:40, Tadamasa Teranishi wrote:
(B> > Figuring it was a LANG envariable setting, I explicitly sent LANG
(B> > to en_US (it was defaulted to en_US.UTF-8), but it did not fix it.
(B> > Maybe I should try en_US.ISO-8859-1?
(B> xxxx.UTF-8 is not supported.
(BI'm aware of this.
(B> You Instead of "en_US.UTF-8" You have to set "C".
(B> probably "LC_ALL" or "LC_CTYPE" etc. It is en_US.UTF-8.
(B> Please set up LC_ALL=C and use mknmz.
(BIs there any drawback of including the "use bytes" pragma to
(Bavoid this problem? Is there a need to support older versions
(Bof perl that do not support the pragma?
(BAs a sanity check, namazu could do a locale check (checking various
(Benvariables), and if set to a UTF-8 locale, could either generate
(Ba warning, and fallback to the C locale, or could error out
(Bstating unsupported locale.
(BWith later linux distributions now defaulting to UTF-8-based locales,
(Bsuch checks may eliminate user mail to the list about this.