Hello group, coming back to the discussion about probabilistic and vector space model (which occured here some time ago), I would like to ask something related.
I only know the index structure Lucene offers. Does a IR system, based on the probabilistic model (e.g. Okapi) look different from a VS model? If yes, why? I hope this questions is not too stupid. I am mainly interested because of some theoretical background... Karl > Uh, there are lots of ways to construct an inverted index. > Citeseer will give you more than you can read on this topic. > > As for Lucene, see File Formats section on the site. > > Otis > > --- Karl Koch <[EMAIL PROTECTED]> wrote: > > If I create an standard index, what does Lucene store in this index? > > > > What should be stored in an index at least? Just a link to the file > > and > > keywords? Or also wordnumbers? What else? > > > > Does somebody know a paper which discusses this problem of "what to > > put in > > an good universal IR index" ? > > > > Cheers, > > Karl > > > > -- > > +++ NEU bei GMX und erstmalig in Deutschland: T�V-gepr�fter > > Virenschutz +++ > > 100% Virenerkennung nach Wildlist. Infos: > > http://www.gmx.net/virenschutz > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > -- +++ NEU bei GMX und erstmalig in Deutschland: T�V-gepr�fter Virenschutz +++ 100% Virenerkennung nach Wildlist. Infos: http://www.gmx.net/virenschutz --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
