Hi,

I have a research project where I want to implement LSI technique. The scenario 
is something as follows.

Search
the news sites for the locally event based news. Cluster the similar
news items together. For example hurricane in New York city.

We want to apply basic LSI as follows

   -Key word extraction
   -Filter using stop list
   -Stemming
   -Option: Synonym detection
   - Frequency Matrix
   - SVD Decomposition

  -Cluster related News items

The input data will be from the web based news sites such as Yahoo, google etc.

How can we use Lucene to achieve this. Please provide me the steps.

Thanks.

Faikeeyes
                    



 
____________________________________________________________________________________
Now that's room service!  Choose from over 150,000 hotels
in 45,000 destinations on Yahoo! Travel to find your fit.
http://farechase.yahoo.com/promo-generic-14795097

Reply via email to