[Namazu-users-en] Re: Problems with mknmz and Perl 5.8.6
earl at earlhood.com
Sun Jun 12 04:39:29 JST 2005
On June 11, 2005 at 13:11, Tadamasa Teranishi wrote:
> Perhaps, NMZ.field.subject.i is broken.
> What is the version of Namazu used?
> > Also, the "Malformed UTF-8 ..." warnings are popping up, regardles
> > of what LANG or LC_ALL are set to. I had to add a 'use bytes' pragma
> > to mailnews.pl at line 212 to get rid of the warnings.
> Please try.
> $ env LC_ALL=C mknmz ...
I have, in a myriad of ways. I just recreated things on one of my
local systems to make analysis easier.
I've made available of the command used and the output of a
stock namazu 2.0.14 installation available for your examination at
<http://www.mhonarc.org/tmp/mknmz-out.txt.gz>. I.e. No modifications
to namazu code is done, so the many "malformed utf-8 ..." messages
are provided. Perl also complains about wide characters in print.
I've also made available the input files and NMZ.* files at
the following locations:
The following is version information from mknmz:
Coding System: euc
Supported media types: (23)
Unsupported media types: (10) marked with minus (-) probably missing application
in your $path.
- application/excel: excel.pl
- application/ichitaro7: taro7_10.pl
- application/pdf: pdf.pl
- application/powerpoint: powerpoint.pl
- application/rtf: rtf.pl
- application/x-deb: deb.pl
- application/x-dvi: dvi.pl
- application/x-js-taro: taro7_10.pl
- application/x-tex: tex.pl
- audio/mpeg: mp3.pl
text/html; x-type=mhonarc: mhonarc.pl
text/plain; x-type=rfc: rfc.pl
The following is the output of doing a search via `namazu' from the
namazu -s -n 3 -f cgi-bin/.namazurc '+from:earl' \
References: [ +from:earl: 49 ]
Total 49 documents matching your query.
1. er things I want hidden (score: 1)
/~listsarc/archive/html/namazu-users-en/2004-09/msg00005.html (8,178 bytes)
2. g indexing (score: 1)
/~listsarc/archive/html/namazu-users-en/2004-05/msg00011.html (7,732 bytes)
3. med UTF-8 character ... (score: 1)
/~listsarc/archive/html/namazu-users-en/2004-05/msg00004.html (8,738 bytes)
Current List: 1 - 3
Notice how the first part of the subject strings are clipped. Doing
a search for "PHP" provides no hits, which is should.
If you require any other information, I will provide it.
Thanks for your help,
Earl Hood, <earl at earlhood.com>
PGP Public Key: <http://www.earlhood.com/gpgpubkey.txt>
More information about the Namazu-users-en