you need to use JAMA combined with Lucene, using the vectors that are
builded by lucene to compute SVD with JAMA

http://math.nist.gov/javanumerics/jama/

Best

jose

On 3/28/07, Mark Stiner <[EMAIL PROTECTED]> wrote:


                    Hi,

I have a research project where I want to implement LSI technique. The
scenario is something as follows.

Search
the news sites for the locally event based news. Cluster the similar
news items together. For example hurricane in New York city.

We want to apply basic LSI as follows

   -Key word extraction
   -Filter using stop list
   -Stemming
   -Option: Synonym detection
   - Frequency Matrix
   - SVD Decomposition

  -Cluster related News items

The input data will be from the web based news sites such as Yahoo, google
etc.

How can we use Lucene to achieve this. Please provide me the steps.

Thanks.

Faikeeyes






____________________________________________________________________________________
Now that's room service!  Choose from over 150,000 hotels
in 45,000 destinations on Yahoo! Travel to find your fit.
http://farechase.yahoo.com/promo-generic-14795097




--
José Ramón Pérez Agüera

Dept. de Ingeniería del Software e Inteligencia Artificial
Despacho 411 tlf. 913947599
Facultad de Informática
Universidad Complutense de Madrid

Reply via email to