Hi there,
browsing through the message thread I tried to find a trail addressing file
system crawls. I want to implement an enterprise search over a networked
filesystem, crawling all sorts of documents, such as html, doc, ppt and pdf.
Nutch provides plugins enabling it to read proprietary
Hi All,
I'm trying to use the SnowballAnalyzer and for some strange reason i cannot.
I got the following error message in the logs file:
org.apache.solr.core.SolrException: Error instantiating class class
org.apache.lucene.analysis.snowball.SnowballAnalyzer
at
On Aug 30, 2006, at 2:42 AM, Bruno wrote:
browsing through the message thread I tried to find a trail
addressing file
system crawls. I want to implement an enterprise search over a
networked
filesystem, crawling all sorts of documents, such as html, doc, ppt
and pdf.
Nutch provides
On 8/29/06, sangraal aiken [EMAIL PROTECTED] wrote:
The problem only occurs when adding docs that contain ![CDATA[]] tags in
the body of the field tag. The problem also only seems to cause an add
limit on an individual post. I limited the size of my HTTP posts to 5000
documents per post, and the
: constructor requires the language parameter. I see SnowballAnalyzer
: mentioned in a comment in the example schema.xml, but there is no
: specification for language. My guess is you'll need to construct
Whooops ... i just changed that example so as not to misslead people.
FYI: the
You might want to look at acts_as_searchable for Ruby:
http://rubyforge.org/projects/ar-searchable
That's a similar plugin for the Hyperestraier search engine using its
REST interface.
On 8/28/06, Erik Hatcher [EMAIL PROTECTED] wrote:
I've spent a few hours tinkering with an Ruby ActiveRecord
On Aug 28, 2006, at 10:25 PM, Erik Hatcher wrote:
I'd like to commit this to the Solr repository. Any objections?
Once committed, folks will be able to use script/plugin
install ... to install the Ruby side of things, and using a binary
distribution of Solr's example application and a