http://sourceforge.net/projects/weblucene/

WebLucene: Lucene Web interface, use XML as a lightweight protocol. 

Developer convert data source (text, DB, MS Word, PDF... etc) into standard xml format 
indexing with lucene engine, and get full text search result via HTTP, with XML format 
output, user can easily intergrated with JSP ASP PHP front end or use XSLT at server 
side transform output.

Developer can intergrate lucene full text search engine with old MSSQL + ASP MySQL + 
PHP Oracle + JSP based web applications.

    MySQL  \                                                  / JSP    
    Oracle - DB      -  ==>   XML ==> (Lucene Index) ==> XML  -  ASP    
    MSSQL  /                                                  -  PHP            
             MS Word /                                        \         / XHTML
                 PDF /                                         =XSLT=> -  text
                                                                        \ XML
                                                                     
                         \_________Web Lucene____________/ 
                           
i18n issue: for Java is Unicode based, user can indexing data source(XML) in different 
charset into one lucene index(in unicode) and output result according to client 
browser support languages.
      GBK          \                                       / BIG5
      BIG5          -  UNICODE        ====>       Unicode -  GB2312
      SJIS          -   (XML)                     (XML)   -  SJIS
      ISO-8859-1   /                                       \ ISO-8859-1


Che, Dong
http://www.chedong.com/tech/




Reply via email to