Namazu-win32-users-ja(旧)


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Excelファイルの検索



山田@京都です。
 はじめまして。Namazu for Win32をインストールして、Excelファイルの全文
検索をテストしてみたのですが、
mknmzコマンドでインデックスは、出来ているようなのですが、namazuコマンド
では、「検索式にマッチ
する文章は、ありませんでした。」と表示されます。

Ms-wordの文章は、ちゃんと検索できているようです。
Wordファイルが検索できでも、Excelファイルは、検索できないということがあ
るのでしょうか?
ご教授の程、よろしくお願いします。

(動作環境)
OS:Windows98SE
Excel 2002(10.2614.2625)
Word 2002(10.2627.2625)

「perl  nmzchk.pl > nmzchk.txt」の実行結果を載せておきます。

Content-type: text/plain

=== printout opendir(CURDIR,".") ===
name:nmzchk.pl
 dev=2 ino=0 mode=33206 nlink=1
 uid=0 gid=0 rdev=2 size=4879
 atime=1029942000 mtime=973710388 ctime=1029814243
 blksize= blocks=
name:NMZSETUP.BAT
 dev=2 ino=0 mode=33279 nlink=1
 uid=0 gid=0 rdev=2 size=14575
 atime=1029942000 mtime=1006870680 ctime=1029814243
 blksize= blocks=
name:nmzchk.txt
 dev=2 ino=0 mode=33206 nlink=1
 uid=0 gid=0 rdev=2 size=0
 atime=1029942000 mtime=1029982268 ctime=1029910150
 blksize= blocks=

=== printout $ENV ===
HOME = >>>C:\namazu<<<
ITAIJIDICTPATH = >>>c:\kakasi\share\kakasi\itaijidict<<<
KANWADICTPATH = >>>c:\kakasi\share\kakasi\kanwadict<<<
LANG = >>>ja_JP.SJIS<<<
MKNMZRC = >>>C:\namazu\etc\namazu\mknmzrc<<<
--- printout C:\namazu\etc\namazu\mknmzrc ---
package conf;  # Don't remove this line!

$HTML_SUFFIX = "html?|[ps]html|html\\.[a-z]{2}";

$ALLOW_FILE = ".*\\.(?:$HTML_SUFFIX)|.*\\.txt" . # HTML, plain text

$DENY_FILE =
".*\\.(gif|png|jpg|jpeg)|.*\\.tar\\.gz|core|.*\\.bak|.*~|\\..*|\x23.*";

$DIRECTORY_INDEX = "";

$REMAIN_HEADER = "From|Date|Message-ID";

$SEARCH_FIELD =
"message-id|subject|from|date|uri|newsgroups|to|summary|size";

$META_TAGS = "keywords|description";

$FIELD_ALIASES = ('title' => 'subject', 'author' => 'from');

$NON_SEPARATION_ELEMENTS =
'A|TT|CODE|SAMP|KBD|VAR|B|STRONG|I|EM|CITE|FONT|U|'.

$ON_MEMORY_MAX   = 5000000;

$FILE_SIZE_MAX   = 2000000;

$TEXT_SIZE_MAX   =  600000;

$WORD_LENG_MAX   = 128;

$LIBDIR = 'C:/namazu/share/namazu/pl';

$FILTERDIR = 'C:/namazu/share/namazu/filter';

$TEMPLATEDIR = 'C:/namazu/share/namazu/template';

1;

-------------------------
NAMAZULOCALEDIR = >>>C:\namazu\share\locale<<<
NAMAZURC = >>>C:\namazu\etc\namazu\namazurc<<<
--- printout C:\namazu\etc\namazu\namazurc ---
Index         C:\namazu\var\namazu\index

Lang          ja_JP.SJIS

-------------------------
PATH =
>>>C:\NAMAZU\BIN;C:\PERL\BIN\;C:\WINDOWS;C:\WINDOWS;C:\WINDOWS\COMMAND;C:\JDK1.3.1_04\BIN;C:\KAKASI\BIN;<<<

=== where ===
C:\NAMAZU\BIN/namazu.exe
C:\PERL\BIN\/perl.exe
C:\KAKASI\BIN/kakasi.exe
C:\NAMAZU\BIN/mknmz.bat
C:\NAMAZU\BIN/gcnmz.bat
C:\PERL\BIN\/ppm.bat
C:\PERL\BIN\/pl2bat.bat

=== versions ===
--- namazu -v ---
namazu of Namazu 2.0.10
Copyright (C) 1997-1999 Satoru Takabayashi All rights reserved.
Copyright (C) 2000,2001 Namazu Project All rights reserved.
This is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2, or (at your option)
any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty
of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.
------------
--- namazu -C ---
読み込んだ設定ファイル: C:\namazu\etc\namazu\namazurc
--
インデックス (Index):    C:\namazu\var\namazu\index
ログの記録 (Logging):    on
使用する言語 (Lang):     ja_JP.SJIS
スコア計算 (Scoring):    tfidf
テンプレート (Template):
ヒット件数の上限 (MaxHit):      10000
マッチする語の上限 (MaxMatch):  1000
強調タグ (EmphasisTags): <strong class="keyword"> </strong>
------------
--- perl -v ---

This is perl, v5.6.1 built for MSWin32-x86-multi-thread
(with 1 registered patch, see perl -V for more detail)

Copyright 1987-2001, Larry Wall

Binary build 630 provided by ActiveState Tool Corp.
http://www.ActiveState.com
Built 09:32:05 Nov 11 2001


Perl may be copied only under the terms of either the Artistic License
or the
GNU General Public License, which may be found in the Perl 5 source kit.

Complete documentation for Perl, including FAQ lists, should be found on

this system using `man perl' or `perldoc perl'.  If you have access to
the
Internet, point your browser at http://www.perl.com/, the Perl Home
Page.

------------
--- nkf -v ---
コマンドまたはファイル名が違います.
------------
--- kakasi -v ---
KAKASI - Kanji Kana Simple Inverter  Version 2.3.4
Copyright (C) 1992-1999 Hironobu Takahashi. All rights reserved.

Usage: kakasi -a[jE] -j[aE] -g[ajE] -k[ajKH] -E[aj] -K[ajkH] -H[ajkK]
-J[ajkKH]
              -i{oldjis,newjis,dec,euc,sjis}
-o{oldjis,newjis,dec,euc,sjis}
              -r{hepburn,kunrei} -p -s -f -c"chars"  [jisyo1, jisyo2,,,]

      Character Sets:
       a: ascii  j: jisroman  g: graphic  k: kana (j,k     defined in
jisx0201)
       E: kigou  K: katakana  H: hiragana J: kanji(E,K,H,J defined in
jisx0208)

      Options:
      -i: input coding system    -o: output coding system
      -r: romaji conversion system
      -p: list all readings (with -J option)
      -s: insert separate characters (with -J option)
      -f: furigana mode (with -J option)
      -c: skip chars within jukugo (with -J option: default TAB CR LF
BLANK)
      -C: romaji Capitalize (with -Ja or -Jj option)
      -U: romaji Upcase     (with -Ja or -Jj option)
      -u: call fflush() after 1 character output
      -w: wakatigaki mode

Report bugs to <bug-kakasi@xxxxxxxxxx>.
------------
--- mknmz -v ---
mknmz of Namazu 2.0.10
Copyright (C) 1997-1999 Satoru Takabayashi All rights reserved.
Copyright (C) 2000,2001 Namazu Project All rights reserved.

This is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2, or (at your option)
any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty
of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.
------------
--- mknmz -C ---
読み込んだ設定ファイル: C:/namazu/etc/namazu/mknmzrc
システム: MSWin32
Namazu: 2.0.10
Perl: 5.006001
NKF: module_nkf
KAKASI: module_kakasi -ieuc -oeuc -w
茶筌: chasen -j -F '%m '
わかち書き: module_kakasi -ieuc -oeuc -w
メッセージの言語: ja_JP.SJIS
言語: ja_JP.SJIS
文字コード: sjis
CONFDIR: C:/namazu/etc/namazu
LIBDIR: C:/namazu/share/namazu/pl
FILTERDIR: C:/namazu/share/namazu/filter
TEMPLATEDIR: C:/namazu/share/namazu/template
対応メディアタイプ:
  application/excel
  application/ichitaro4
  application/ichitaro5
  application/ichitaro6
  application/ichitaro7
  application/msword
  application/rtf
  application/x-gzip
  application/x-js-taro
  message/news
  message/rfc822
  text/hnf
  text/html
  text/html; x-type=mhonarc
  text/plain
  text/plain; x-type=rfc
  text/x-hdml
------------
--- zcat --version ---
コマンドまたはファイル名が違います.
------------
--- gzip --version ---
コマンドまたはファイル名が違います.
------------
--- groff --version ---
コマンドまたはファイル名が違います.
------------
--- jgroff --version ---
コマンドまたはファイル名が違います.
------------
--- pdftotext -v ---
コマンドまたはファイル名が違います.
------------
--- xlhtml ---
コマンドまたはファイル名が違います.
------------
--- wvhtml ---
コマンドまたはファイル名が違います.
------------
--- wvversion ---
コマンドまたはファイル名が違います.
------------
--- gcc --version ---
コマンドまたはファイル名が違います.
------------
--- make --version ---
コマンドまたはファイル名が違います.
------------
--- gettext --version ---
コマンドまたはファイル名が違います.
------------
--- autoconf --version ---
コマンドまたはファイル名が違います.
------------
--- automake --version ---
コマンドまたはファイル名が違います.
------------
--- libtool --version ---
コマンドまたはファイル名が違います.
------------

--- File::MMagic ---
1.13
--- NKF ---
1.92
--- Text::Kakasi ---
1.05
--- Text::Chasen ---
Not found Text::Chasen !!!

--- web server ---
HTTP/1.0 200 OK

Server: Microsoft-PWS-95/2.0

Date: Thu, 22 Aug 2002 02:11:18 GMT

Content-Type: text/html

Accept-Ranges: bytes

Last-Modified: Fri, 18 Oct 1996 02:00:00 GMT

Content-Length: 1181