[Namazu-users-en] Re: html data not indexed in text/html mails

swati swati_longia at sifycorp.com
Thu Aug 4 14:38:23 JST 2005


Hello all,
This is a sample mail that i was trying to index and search on.

Return-Path: <robert at data.com>
Delivered-To: mmm.abcd.net-xvzcf at mmm.abcd.net
X-MMS-INDEX:541434691.1122069334
X-QHPSI: clean
X-Filter: xFilter/xyz Revision 2.26 (http://mail.xyz.com)
Received: (abc 6132 invoked by uid 511); Sat Jul 23 03:25:34 2005
X-QHPSI: clean
Received: (abc 6124 invoked by uid 511); 23 Jul 2005 03:25:32 +0530
Received: from 0.1.0.1 (HELO xyz.com) (0.1.0.1)
  by 127.0.0.1 with SMTP; 23 Jul 2005 03:25:32 +0530
Received: (xyz 23168 invoked by uid 510); 23 Jul 2005 03:30:10 +0530
XDelivered-To: xyz.com-xvzf at xyz.com
X-Filter: xFilter/xyz Revision 2.32 (http://mail.xyz.com)
Received: (abc 23166 invoked by uid 510); Sat Jul 23 03:30:10 2005
Received: (abc 23163 invoked by uid 510); 23 Jul 2005 03:30:10 +0530
Received: from 1.0.0.1 (HELO cokrelay3) (1.0.0.1)
  by 0.1.0.1 with SMTP; 23 Jul 2005 03:30:10 +0530
Received: (xyz 13895 invoked by uid 508); 23 Jul 2005 03:30:12 +0530
Received: from 1.1.1.1 (HELO cokrelay3) (1.1.1.1)
  by 1.1.1.1 with SMTP; 23 Jul 2005 03:30:12 +0530
Received: (xyz 13877 invoked by uid 508); 23 Jul 2005 03:30:12 +0530
DomainKey-Status: no signature
Received: from 0.0.0.0 (HELO com.125) (0.0.0.0)
  by 1.1.1.1 with SMTP; 23 Jul 2005 03:30:12 +0530
Received: by com.125 (PowerMTA(TM) v1.5); Fri, 22 Jul 2005 16:23:03 -0400 (envelope-from <robert at data.com>)
To: xvzf at xyz.com
From: Robert at xyz.com, G at xyz.com, Allen <robert at data.com
X-Mailer: Mcamp:int6-1.1-1110-Sq
X-INFO_AZ: WldScGRHOXlRSE5wWm5rdVkyOXQK
X-INFO_BZ: TWpNek5DNUZVMGxmVWtkQk1TNW9ZalV5TlE9PQo=
X-INFO_CZ: UVdOMGFYWmxUV0ZwYkdsdVp3PT0K
MIME-Version: 1.0
Date: Fri Jul 22 16:23:03 2005
Subject: Robert Allen Multiple Income streams
Message-ID: <WldScGRHOXlRSE5wWm5rdVkyOXQK at mfm49data.com>
Content-Type: text/html; charset=us-ascii
Content-Transfer-Encoding: 7Bit
X-Bogosity:0.371433

<center><font size=1 face=verdana,sans-serif>If you cannot see this page, please <a href=http://WldScGRHOXlR.mfm49data.com/host/RGA1/?e=WldScGRHOXlRSE5wWm5rdVkyOXQK&c=TWpNek5DNUZVMGxmVWtkQk1TNW9ZalV5TlE9PQo=&l=UVdOMGFYWmxUV0ZwYkdsdVp3PT0K>Refresh this</a>.<br>*For Outlook users, if images are blocked, please right click on the image and select, 'Add to safe sender list.' </font></center>

<center><font style="font-size: 1pt" face="Times New Roman">simplifier.  maxim angers aster prophesy dirge penalizing memorials.  intellectual primers pasted majesty confederation appointment smeared.  acceptors improper optic equally extracts Uruguay scaffold.  preassigned cackles assassinate perpetual lanes quell bothers.  radars faiths tracker cripple costly Texan reproducible.  playful protestor decelerates Zoe affixing confided replay.  chat racketeer Argentinian Ludmilla Rutherford mallet suffixes.  </font></center>

<html>
<head>
        <style type="text/css" media="screen">
                <!--
                        body  { color: black; font-family: Arial, Verdana, Helvetica, sans-serif }
                        .headline   { font-size: 1.7em; text-align: right; padding-right: 20px }
                        .message    { font-size: 2em; text-align: right; padding-top: 0px; padding-right: 6px }
                        .offer     { font-size: 15px; font-weight: 700; padding-right: 15px }
                -->
        </style>
</head>
<body bgcolor="#FFFFFF" leftmargin="0" topmargin="0" marginwidth="0" marginheight="0">
                <div align="center">
                        <br>
                        <table width="600" height="369" border="0" cellpadding="0" cellspacing="0">
                                <tr height="299">
                                        <td valign="top" width="600" height="299" background="http://WldScGRHOXlR.mfm49data.com/host//RGA1/index_01.jpg">
                                                <p class="headline"><br>
                                                        FEAR Is Not a<br>
                                                        FACTOR...</p>
                                                <p class="message">It's Your Turn<br>
                                                        to Take On<br>
                                                        My Public<br>

                                                        Challenge!</p>
                                        </td>
                                </tr>
                                <tr>
                                        <td width="600"><a href="http://WldScGRHOXlR.data.com/host/RGA1/?e=WldScGRHOXlRSE5wWm5rdVkyOXQK&c=TWpNek5DNUZVMGxmVWtkQk1TNW9ZalV5TlE9PQo=&l=UVdOMGFYWmxUV0ZwYkdsdVp3PT0K"><img src="http://WldScGRHOXlR.9data.com/host/RGA1/index_02.jpg" width="600" height="42" border="0" alt=""></a></td>
                                </tr>
                                <tr height="28">
                                        <td width="600" height="28" background="http://WldScGRHOXlR.data.com/host/RGA1/index_03.jpg">
                                                <div align="right" class="offer"></div>
                                        </td>
                                </tr>
                        </table>

<div align="center"><img src="http://www.deliver.com/creatives/footer_text_2.gif" /></div></body>
</html>
<div align=center>
<table border=0 cellpadding=1 cellpadding=0 bgcolor=silver>
<tr>
<td align=right>
<center><font style="font-size: 1pt" face="Times New Roman">simplifier.  maxim angers aster prophesy dirge penalizing memorials.  intellectual primers pasted majesty confederation appointment smeared.  acceptors improper optic equally extracts Uruguay scaffold.  preassigned cackles assassinate perpetual lanes quell bothers.  radars faiths tracker cripple costly Texan reproducible.  playful protestor decelerates Zoe affixing confided replay.  chat racketeer Argentinian Ludmilla Rutherford mallet suffixes.  </font></center>
<font face=verdana,sans-serif size=1 color=#444444></td>
</tr>
<tr>
<td>
<table border=0 cellpadding=1 cellspacing=0 bgcolor=#444444
 width=1>
<tr>
<td>
<table border=0 cellpadding=3 cellspacing=0 bgcolor=silver
 width=550>
<tr>
<td><font face=verdana,sans-serif size=1 color=#444444>
<strong>Opt-out <a href=http://WldScGRHOXlR.data.com/block/u.php?addy=xvzf@xyz.com&ID=TWpNek5DNUZVMGxmVWtkQk1TNW9ZalV5TlE9PQo= style=color:#333333><strong>http://WldScGRHOXlR.9data.com/block/u.php?addy=xvzf@xyz.com&ID=TWpNek5DNUZVMGxmVWtkQk1TNW9ZalV5TlE9PQo=</strong></a><br>
or Send mail to <br><strong>Members from Members List Management, 21346 Saint Andrews blvd. #214, BocaRaton, FL, 33433</strong><br>
</font>
</td>
</tr>
</table>
</td>
</tr>
</table>

</td>
</tr>
</table>
</div>
<img src=http://image.data.com/image.php?e=WldScGRHOXlRSE5wWm5rdVkyOXQK&c=TWpNek5DNUZVMGxmVWtkQk1TNW9ZalV5TlE9PQo=&l=UVdOMGFYWmxUV0ZwYkdsdVp3PT0K width=1 height=1>

<center><font style="font-size: 1pt" face="Times New Roman">simplifier.  maxim angers aster prophesy dirge penalizing memorials.  intellectual primers pasted majesty confederation appointment smeared.  acceptors improper optic equally extracts Uruguay scaffold.  preassigned cackles assassinate perpetual lanes quell bothers.  radars faiths tracker cripple costly Texan reproducible.  playful protestor decelerates Zoe affixing confided replay.  chat racketeer Argentinian Ludmilla Rutherford mallet suffixes.  </font></center>




In this mail I am able to search on the words like rober and streams, which exists in the header part. But the words like fear or member or primers, which exists inside the html part of the mail are not indexed or searched. I tried the new verison of namazu (namazu-2.0.15pre1 ) with that also i am not able to index/search this type of mail.

Can anyone give some suggestions as to how I can make these mails also indexd and searched.

Thanks in advance

Sincerely,
Swati








Yukio USUDA wrote:

>namazu-2.0.14 bypasses attached files and multipart entities when indexing 
>mails.
>namazu-2.0.15 will support to index the multipart entities. It is next 
>release feature.
>You can try namazu 2.0.15 prerelease version.
>http://www.namazu.org/test/namazu-2.0.15pre1.tar.gz
>Keep in mind that a prerelease version has not been through exhaustive testing.
>
>To handle Attached base64 bodies, MIME::Base64 and MIME::QuotedPrint are 
>required.(perl5.8 contains them.)
>http://search.cpan.org/dist/MIME-Base64/
>And mknmz require --decode-base64 option when handling multipart entities.
>
>Yukio USUDA
>
>  
>
********** DISCLAIMER **********
Information contained and transmitted by this E-MAIL is proprietary to 
Sify Limited and is intended for use only by the individual or entity to 
which it is addressed, and may contain information that is privileged, 
confidential or exempt from disclosure under applicable law. If this is a 
forwarded message, the content of this E-MAIL may not have been sent with 
the authority of the Company. If you are not the intended recipient, an 
agent of the intended recipient or a  person responsible for delivering the 
information to the named recipient,  you are notified that any use, 
distribution, transmission, printing, copying or dissemination of this 
information in any way or in any manner is strictly prohibited. If you have 
received this communication in error, please delete this mail & notify us 
immediately at admin at sifycorp.com


More information about the Namazu-users-en mailing list