Namazu-users-en(old)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Malformed UTF-8 character ...

From: Earl Hood <earl@xxxxxxxxxxxx>
Date: Fri, 07 May 2004 14:49:42 -0500
X-ml-name: namazu-users-en
X-mail-count: 00501
References: <200405052211.i45MBXx19118@gator.earlhood.com> <4099978C.AF28FE4F@asahi-net.or.jp>

On May 6, 2004 at 10:40, Tadamasa Teranishi wrote:
(B
(B> > Figuring it was a LANG envariable setting, I explicitly sent LANG
(B> > to en_US (it was defaulted to en_US.UTF-8), but it did not fix it.
(B> > Maybe I should try en_US.ISO-8859-1?
(B> 
(B> xxxx.UTF-8 is not supported.
(B
(BI'm aware of this.
(B
(B> You Instead of "en_US.UTF-8" You have to set "C".
(B> 
(B> probably "LC_ALL" or "LC_CTYPE" etc. It is en_US.UTF-8.
(B> Please set up LC_ALL=C and use mknmz.
(B
(BIs there any drawback of including the "use bytes" pragma to
(Bavoid this problem?  Is there a need to support older versions
(Bof perl that do not support the pragma?
(B
(BAs a sanity check, namazu could do a locale check (checking various
(Benvariables), and if set to a UTF-8 locale, could either generate
(Ba warning, and fallback to the C locale, or could error out
(Bstating unsupported locale.
(B
(BWith later linux distributions now defaulting to UTF-8-based locales,
(Bsuch checks may eliminate user mail to the list about this.
(B
(B--ewh

Follow-Ups:
- Re: Malformed UTF-8 character ...
  - From: Tadamasa Teranishi

References:
- Malformed UTF-8 character ...
  - From: Earl Hood
- Re: Malformed UTF-8 character ...
  - From: Tadamasa Teranishi

Prev by Date: Re: Malformed UTF-8 character ...
Next by Date: Re: How to integrate Mhonarc with Namazu
Previous by thread: Re: Malformed UTF-8 character ...
Next by thread: Re: Malformed UTF-8 character ...
Index(es):
- Date
- Thread