>>        // just remove invalid characters: in php
>>        $pattern ="/[\x-\x8\xb-\xc\xe-\x1f]/";
>>        $string = preg_replace($pattern,'',$string);

----- Original Message ----- 
From: "Jagdip Singh" <[EMAIL PROTECTED]>
To: "'Lucene Users List'" <[EMAIL PROTECTED]>
Sent: Monday, July 07, 2003 7:53 AM
Subject: making XML from articles


> Hi,
> I am trying to use Lucene for searching articles (text files) and web
> pages. I am thinking of converting those articles to XML files and then
> feed to Lucene for indexing.
> I have not done anything much with XML before and trying to know if this
> is going to be a better idea in term of searching. 
> How can I convert text into XML?
>  
> Please suggest me if someone has faced similar situation before.
>  
> Regards, 
> Jagdip
> 

Reply via email to