>> // just remove invalid characters: in php >> $pattern ="/[\x-\x8\xb-\xc\xe-\x1f]/"; >> $string = preg_replace($pattern,'',$string);
----- Original Message ----- From: "Jagdip Singh" <[EMAIL PROTECTED]> To: "'Lucene Users List'" <[EMAIL PROTECTED]> Sent: Monday, July 07, 2003 7:53 AM Subject: making XML from articles > Hi, > I am trying to use Lucene for searching articles (text files) and web > pages. I am thinking of converting those articles to XML files and then > feed to Lucene for indexing. > I have not done anything much with XML before and trying to know if this > is going to be a better idea in term of searching. > How can I convert text into XML? > > Please suggest me if someone has faced similar situation before. > > Regards, > Jagdip >
