Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by LarsAronsson:
http://wiki.apache.org/nutch/Features

The comment on the change is:
Features of Nutch search could be documented here, if I knew the answers.

New page:
Missing from the current Nutch documentation (Tutorial, FAQ) is a list of 
features. This wiki page could help, if someone who knows the answers can edit 
it.

 *What kind of searches does Nutch support? (quoted, nested, truncation, 
wildcarding [and where], Boolean),
 *Is stemming an option?
 *What kind of stemming does Nutch use? (and can you add exceptions/changes?)
 *Does Nutch support Boolean operators? (can you use Google-like plus or minus 
or are you stuck with 1990s terms?)
 *Does Nutch support weighted field searching, synonym support?
 *What kinds of indexes does Nutch build? (multi-format indexing, incremental 
indexing, spell-check support, thesauri support, fielded searching,  
rank-by-reputation?)

 *How does the search engine handle punctuation and special characters? (and 
what's configurable?)
 *Which document formats are supported?
 *What post-coordination options are available? (hey Karen, what does this 
mean?)

 *How easy is Nutch to configure?
 *How transparent is its configuration to a working organization: does it 
require geeky command line stuff, or can a knowledgable manager enter a web or 
software interface to view or modify settings?

 * How are results sorted?
 * Does Nutch support deduping?
 * Can one tinker with relevance algoritms?
 * Are there ranking overrides?

Reply via email to