So to put it in a simple way, I need to tokenize all my data and create an index which I load into memory...? Is this how it is usually done? For example, does my browser (firefox) keep an index of all the words present in urls and page titles on memory at any given time?
On Wed, Apr 18, 2012 at 7:15 PM, Danny Yoo <d...@cs.wpi.edu> wrote: >> >> I think the subfield you're looking for is called "information retrieval", >> and there are textbooks on it. > > Managing Gigabytes, for example: > > http://ww2.cs.mu.oz.au/mg/ ____________________ Racket Users list: http://lists.racket-lang.org/users