Microkernal architecture (was Re: XML Query)

2006-04-07 Thread Erik Hatcher
for myself, ways to wire in flexible configuration into Solr and what his thoughts on it might be. Erik On 4/7/06, Erik Hatcher [EMAIL PROTECTED] wrote: Here's my opinion on this matter... a StandardRequestHandler shouldn't even need to be subclassed to plug in different query

Fwd: Microkernal architecture (was Re: XML Query)

2006-04-08 Thread Erik Hatcher
Passing on Howard's reply... Begin forwarded message: From: Howard M. Lewis Ship [EMAIL PROTECTED] Date: April 7, 2006 5:00:31 PM EDT To: Erik Hatcher [EMAIL PROTECTED] Subject: Re: Microkernal architecture (was Re: XML Query) Cool. Erik Hatcher wrote: On Apr 7, 2006, at 2:33 PM, Yonik

Re: highlighting

2006-04-18 Thread Erik Hatcher
Hoss, I've seen you mention plugin several times... I presume you mean a custom request handler. If not, could you elaborate on what you mean? Thanks, Erik On Apr 18, 2006, at 1:17 PM, Chris Hostetter wrote: : Focus on the interface: : - how clients will specify what extra

Re: Solr is indexing XML only?

2006-04-26 Thread Erik Hatcher
David, Solr doesn't index XML files, but rather XML is used as the wrapper of the text that does get indexed. The document structure is defined in schema.xml, and the field text to be indexed is sent wrapped in an XML request. Regarding your scenario, you would need to write code that

Re: One big XML file vs. many HTTP requests

2006-05-12 Thread Erik Hatcher
On May 12, 2006, at 1:02 PM, Michael Levy wrote: One nice feature of INQUERY is that you can create one large SGML file, containing lots of records, each bracketed with DOC and / DOC tags. Submitting that big SGML document for indexing goes very fast. I believe that Solr indexes one

Re: SolPHP

2006-06-01 Thread Erik Hatcher
Brian, I'd love to give any RoR bindings a try if you're a point to share. I can see all sorts of interesting fun that can be had with such bindings, such as pulling schema.xml from the server and using its field definitions to build mapping objects (like ActiveRecord), support for all

Re: List of indexed terms for a field

2006-06-07 Thread Erik Hatcher
On Jun 7, 2006, at 3:45 AM, Paul Terray wrote: I am trying Solr for some projects and I am very impressed by its simplicity and clarity of use. I am trying to make an index: Is there any way to get a list of all indexed terms for a field (especially a string or text one)? Out of the

Re: Finding documents with undefined field

2006-06-07 Thread Erik Hatcher
On Jun 7, 2006, at 4:18 PM, Yonik Seeley wrote: I do still need to do these kinds of inverted sets somehow. One problem is that not() needs to know how large the sets are. I could add a DocSet.flip(int maxDoc) or a DocSet.flip(int startIndex, int endIndex) or something like that... but a

Re: custom query response writer

2006-06-15 Thread Erik Hatcher
On Jun 15, 2006, at 3:37 AM, Chris Hostetter wrote: : are linked to the one being written, think relationships here). I'm : having difficulty customizing the output because XMLWriter's : constructor is package protected. Is there a performance or API : reason we need this class to be

Re: custom query response writer

2006-06-16 Thread Erik Hatcher
On Jun 15, 2006, at 10:18 AM, Yonik Seeley wrote: On 6/15/06, Erik Hatcher [EMAIL PROTECTED] wrote: Having a way to hook into the response writing by leveraging the ever improving Solr codebase and its utilities rather than copy/pasting would be a nice way to aim, I think. It's a double

Re: Wildcard Query

2006-06-20 Thread Erik Hatcher
On Jun 20, 2006, at 6:07 AM, Pace Davis wrote: Ok, before I go start writing a new request handlerlet me ask a dumb question and see if I am approaching this wrong in Solr. If I am trying to search a field where I have one doc with a field that has a value of Hello World...if the search

Re: Faceted Browsing questions

2006-06-24 Thread Erik Hatcher
On Jun 24, 2006, at 12:38 PM, Vish D. wrote: Erik, when do you plan on having your implementation refactored with good use of code? This weekend :) I have imported more data than my hacked implementation can handle without bumping up Jetty's JVM heap size, so I'm now at the point

Re: Faceted Browsing questions

2006-06-26 Thread Erik Hatcher
On Jun 24, 2006, at 4:29 PM, Yonik Seeley wrote: On 6/24/06, Erik Hatcher [EMAIL PROTECTED] wrote: This weekend :) I have imported more data than my hacked implementation can handle without bumping up Jetty's JVM heap size, so I'm now at the point where it is necessary for me to start using

autowarmCount usefulness

2006-06-26 Thread Erik Hatcher
I'm trying to fully understand the LRUCache and the autowarmCount parameter. Why does it make sense to auto-warm filters and query results? In my case, if a new document is added it may invalidate many filters, and it would require knowing the details of the documents added/removed to

Re: Mac OSX - error reading /usr/local/lib/libsvnjavahl-1.0.0.0.dylib

2006-08-15 Thread Erik Hatcher
Just as a data point, my team (3 of us) develop on OS X using Solr with no problems. Two of us are on MacBook Pro's and one poor soul is on a PowerBook. I know that doesn't help, and I do recall stumbling into this particular issue or one very much like it a long while ago (not Solr

Re: Viewing Lucene indexes generated by Solr

2006-08-17 Thread Erik Hatcher
Is it possibly a Lucene version mis-match with your Luke using an older version of Lucene? I frequently use Luke to peruse Lucene indices (though all on the same machine). Erik On Aug 17, 2006, at 2:41 PM, Ken Krugler wrote: Hi all, I have a Lucene index generated by Solr.

Re: about analyzer and index

2006-08-27 Thread Erik Hatcher
On Aug 27, 2006, at 3:27 AM, James liu wrote: lucene have ChineseAnalyzer and CJKAnalyzer,,,so i can search chinese keyword with it. solr have it? if not, how can i add it? Those analyzers are not part of the core Solr distribution, but you can add them easily by getting the JAR file from

Re: document support for file system crawling

2006-08-30 Thread Erik Hatcher
On Aug 30, 2006, at 2:42 AM, Bruno wrote: browsing through the message thread I tried to find a trail addressing file system crawls. I want to implement an enterprise search over a networked filesystem, crawling all sorts of documents, such as html, doc, ppt and pdf. Nutch provides

Re: acts_as_solr

2006-08-30 Thread Erik Hatcher
On Aug 28, 2006, at 10:25 PM, Erik Hatcher wrote: I'd like to commit this to the Solr repository. Any objections? Once committed, folks will be able to use script/plugin install ... to install the Ruby side of things, and using a binary distribution of Solr's example application

Re: Simple Faceted Searching out of the box

2006-09-09 Thread Erik Hatcher
On Sep 9, 2006, at 8:15 AM, Tim Archambault wrote: What is faceted browsing? Maybe an example of a site interface that is using it would be good. Dumb question, I know. Faceted browsing is like this: http://shopper.cnet.com/ and http:// www.nines.org/collex In Collex, the constrain

Re: MoreLikeThis class in Lucene within Solr?

2006-09-12 Thread Erik Hatcher
On Sep 11, 2006, at 4:54 PM, Michael Imbeault wrote: Is it possible in any way to use the MoreLikeThis class with solr (http://lucene.apache.org/java/docs/api/org/apache/lucene/search/ similar/MoreLikeThis.html)? Right now I'm determining similar docs by just querying for the whole body

Re: MoreLikeThis class in Lucene within Solr?

2006-09-12 Thread Erik Hatcher
On Sep 12, 2006, at 3:41 PM, Michael Imbeault wrote: I haven't looked at the specifics of how MoreLikeThis determine which items are similar; I'm mainly wondering about performance here. Yesterday I tried to code myself a poor man's similarity class (which was nothing more than doing a

Re: Got it working! And some questions

2006-09-13 Thread Erik Hatcher
On Sep 12, 2006, at 4:47 PM, Chris Hostetter wrote: : I've implemented the ability to override the default operator with : q.op=AND|OR. The patch is pasted below for your review. if i'm reading that right, one subtlety is that new SolrQueryParser(schema,field) no longer pas attention to

Re: Faceted Searching problems

2006-09-13 Thread Erik Hatcher
You need to use an untokenized field for facets. I can see we're going to get this question frequently now - it was mentioned earlier today in fact. You can use a copyField that is untokenized such that you can use one field for searching, and one for facets. You are obviously using a

Re: Faceted Searching problems

2006-09-13 Thread Erik Hatcher
On Sep 13, 2006, at 9:37 PM, Chris Hostetter wrote: http://www.nabble.com/Error-in-faceted-browsing-tf2267819.html ...i'll try to update the docs for facet.field to make this more obvious. Would it ever make sense to generate facets on a tokenized field? Maybe the facet implementation

Re: Simple Faceted Searching out of the box

2006-09-22 Thread Erik Hatcher
On Sep 22, 2006, at 2:45 PM, Tim Archambault wrote: I believe there's a way to access MSSQL, MySQL etc. directly with Lucene, but not sure how to do this with SOLR. Nope. Lucene is a pure search engine, with no hooks to databases, or document parsers, etc. Lots of folks have built these

Re: Extending Solr's Admin functionality

2006-09-24 Thread Erik Hatcher
On Sep 23, 2006, at 3:57 PM, Otis Gospodnetic wrote: How about another approach - expose all Solr admin data via HTTP/ XML, just like it's done with search requests? i think that would be fantastic. thinking of solr as a hard core service above and beyond lucene exposing all of its

Re: Extending Solr's Admin functionality

2006-09-24 Thread Erik Hatcher
On Sep 24, 2006, at 3:09 PM, Bertrand Delacretaz wrote: On 9/24/06, Erik Hatcher [EMAIL PROTECTED] wrote: ...perhaps some authentication/ authorization as well as HTTPS should eventually make it into the core, but getting more fine grained is unnecessary... If meaningful URLs are used

Re: Extending Solr's Admin functionality

2006-09-24 Thread Erik Hatcher
I certainly agree that a JMX interface into Solr would be nice to have. [a different topic: I've also been thinking Solr under NetKernel would be a nice fit also] On Sep 24, 2006, at 3:12 PM, Simon Willnauer wrote: Also security / firewall doubts would not concern the core and its security

Re: Can't get q.op working

2006-09-27 Thread Erik Hatcher
Kevin, I've just tried this locally using the tutorial example data, using both a default (in schema.xml) of AND and OR. (I use the Ruby response writer because it's easier to read than XML ;) Use the default operator from schema.xml:

Re: Extending Solr's Admin functionality

2006-09-27 Thread Erik Hatcher
, Erik Hatcher [EMAIL PROTECTED] wrote: Ah, so I'm beginning to get it. If we build Solr with JMX support, the admin HTTP/XML(err, Ruby) interface could be written into the JMX HTTP adapter as a separate web application, and allowing users to plug it in or not. If I'm understanding that correctly

Re: Extending Solr's Admin functionality

2006-09-27 Thread Erik Hatcher
On Sep 27, 2006, at 1:49 PM, Erik Hatcher wrote: Well, for long-hanging fruit (autumn Virginia apples): uh, it's been a long day: http://www.pacificviewpress.com/children/ longisadragon.html but i meant low-hanging fruit. * In a read-only fashion, everything from stats.jsp

Re: Couple of problems

2006-10-11 Thread Erik Hatcher
Are you ensuring that the %s replacements are properly encoded for XML? Erik On Oct 11, 2006, at 7:54 AM, mark wrote: that just returns the null pointer exception. I have checked my schema and doc: Schema: field name=id type=string indexed=false stored=true/ field

Re: Solr use case

2006-10-11 Thread Erik Hatcher
On Oct 11, 2006, at 10:24 AM, climbingrose wrote: Is it true that Solr is mainly used for applications that rarely change the underlying data? No, not at all. Solr is very dynamic, and in fact shines even more than plain Lucene when the data changes frequently. As I understand, if

Re: Invalid XML in response

2006-10-12 Thread Erik Hatcher
On Oct 12, 2006, at 11:26 AM, Chris Hostetter wrote: I agree ... comments various people have made over the last few weeks or so have gotten me thinking that standardizing the way errors are returned to the HTTP client (for both updates and selects) is one of the biggest API problems in

Re: Index-time Boosting

2006-10-23 Thread Erik Hatcher
On Oct 20, 2006, at 2:34 PM, Walter Underwood wrote: We have a well-developed browsing design, so I'd rather not mix facets in with that. Two other things work against using facets: most of our queries are known-item searches, and I think that facets work best when there is very broad agreement

Re: Index search questions; special cases

2006-11-13 Thread Erik Hatcher
On Nov 13, 2006, at 1:51 PM, Chris Hostetter wrote: That reminds me ... i seem to remember someone saying once that Nutch lso builds word based n-grams out of it's stop words, so searches on the or on won't match anything because those words are never indexed as a single tokens, but if a

Re: Index search questions; special cases

2006-11-14 Thread Erik Hatcher
On Nov 14, 2006, at 2:00 PM, Chris Hostetter wrote: CommonGrams itself seems to have some other dependencies on nutch because of other utilities in the same class, but based on a quick skim, what you really want is the nested private static class Filter extends TokenFilter which doesn't

Re: Newbie Question acts_as_solr

2006-12-18 Thread Erik Hatcher
Mike, Yes, Solr needs to be running within a Java web application server to operate. Like a database server, for example. Try Movie.find_by_solr, not lowercase movie. find_by_solr is a class-level method, not an instance method. Erik On Dec 18, 2006, at 4:01 PM, Mike Beccaria

Re: Newbie Question acts_as_solr

2006-12-19 Thread Erik Hatcher
On Dec 18, 2006, at 8:36 PM, Mike Beccaria wrote: Another question... The controller: @query= Movie.find_by_solr(params[:squery]) Then in the view: label%= @query %/label Debugging tip: use %= debug @query % to see exactly what you got. Produces 2 hits (as per count_by_solr) that look like

Re: Multiple indexes...

2006-12-21 Thread Erik Hatcher
What is the advantage to running multiple indexes from a single Solr instance over multiple Solr instances each serving a single index? Erik On Dec 21, 2006, at 3:26 PM, escher2k wrote: I looked at the forums and found that it is not possible to have multiple indexes associated

Re: Faceted search/browse

2006-12-29 Thread Erik Hatcher
On Dec 28, 2006, at 10:18 PM, Giri wrote: Do you have any simple tutorial that explains how to enable the faceted search in solr? I wouold appreciate if you point me to some resource. The wiki is the best source of information on Solr. For facets, look here:

Re: Is there a BasicSummarizer for solr?

2007-01-02 Thread Erik Hatcher
Thorsten - there is support for the Lucene Highlighter built into Solr. You can see details of how to use it here: http://wiki.apache.org/solr/HighlightingParameters Erik On Jan 2, 2007, at 7:26 AM, Thorsten Scherler wrote: Hi all, I need to implement a summary function with

Re: Solr Flare

2007-01-02 Thread Erik Hatcher
application generator) needs to have the ASL attached to it, or how this sort of thing should be handled. Thanks! Erik -Yonik On 1/2/07, Erik Hatcher [EMAIL PROTECTED] wrote: I'm finally kicking off that long dreamed of Ruby/Solr DSL, including an accompanying general purpose web

Re: Solr Flare

2007-01-02 Thread Erik Hatcher
On Jan 2, 2007, at 5:23 PM, Zaheed Haque wrote: Great! Wow! Well, as you've noted, it's still in its infancy. Being impressed will come a few weeks later, at least :) What are the difference between flare and act_as_solr as of today? As of today, tons... since acts_as_solr actually

Re: How to tell the highlighter not to escape?

2007-01-03 Thread Erik Hatcher
On Jan 3, 2007, at 7:39 AM, Thorsten Scherler wrote: However I still think the highlighter should return unescaped tags for highlighting. There is IMO no benefit for the current behavior. That really isn't practical. Suppose the prefix were and the suffix were ? It would return invalid

Re: Solr Flare

2007-01-03 Thread Erik Hatcher
should evolve fairly quickly over the next few weeks. Erik On 1/2/07, Erik Hatcher [EMAIL PROTECTED] wrote: I'm finally kicking off that long dreamed of Ruby/Solr DSL, including an accompanying general purpose web interface to showcase the greatness of Solr's capabilities

Re: Solr Flare

2007-01-03 Thread Erik Hatcher
Zaheed, All great questions. My answers will be vague, since I'm still in the early stages of how this will work, but I'll provide what I can below On Jan 3, 2007, at 2:28 PM, Zaheed Haque wrote: 1. Are you thinking of any user/password/my page/settings what not... for

Re: Searching multiple indices (solr newbie)

2007-01-07 Thread Erik Hatcher
On Jan 5, 2007, at 1:59 AM, Chris Hostetter wrote: : The issue is best described with an example: : search for canon - matches multiple categories, which will have very : different schemas : http://cnet.search.com/search? chkpt=astg.cnet.fd.search.cnetq=canontag=srch I just so happen to

Re: Handling disparate data sources in Solr

2007-01-07 Thread Erik Hatcher
The idea of having Solr handle various document types is a good one, for sure. I'm not sure what specifics would need to be implemented, but I at least wanted to reply and say its a good idea! Care has to be taken when passing a URL to Solr for it to go fetch, though. There are a lot of

Re: Searching multiple indices (solr newbie)

2007-01-12 Thread Erik Hatcher
On Jan 8, 2007, at 3:13 AM, Chris Hostetter wrote: : with a single schema -- but dynamicFields are used to store category : specific fields, so that if you are doing a category specific search, : category specific filters can be offered to you... : :

Re: Faceted search problem

2007-01-16 Thread Erik Hatcher
On Jan 16, 2007, at 10:05 PM, Peter McPeterson wrote: Hi all, I'm trying this solr ruby DSL called Flare/solrb and I don't really know how the faceted search works because I cant add whatever fields I want to to the index. This is currently not working: conn =

Re: Internationalization

2007-01-17 Thread Erik Hatcher
Way to go Bess! This is great stuff you're sharing. I have a question though... On Jan 16, 2007, at 11:48 AM, Bess Sadler wrote: Currently, we are assigning all fields, no matter what language to type string, defined as fieldtype name=string class=solr.StrField sortMissingLast=true/

Re: Using HTTP-Post for Queries

2007-01-22 Thread Erik Hatcher
On Jan 22, 2007, at 4:09 AM, Maximilian Hütter wrote: Is there a XMLQueryParser yet? I didn't find it in the source. Yes - it's part Lucene's contrib area: http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/xml-query- parser/ You'll have to build the JAR and put it into Solr's

Re: Using HTTP-Post for Queries

2007-01-22 Thread Erik Hatcher
On Jan 21, 2007, at 11:12 PM, Yonik Seeley wrote: On 1/21/07, Erik Hatcher [EMAIL PROTECTED] wrote: Yes, I think different syntaxes in different places would be useful. For example, a user enters a full-text search query that is suitable to use with Solr's QueryParser, and then the user facets

Re: Using HTTP-Post for Queries

2007-01-24 Thread Erik Hatcher
On Jan 24, 2007, at 9:38 AM, Maximilian Hütter wrote: Erik Hatcher schrieb: On Jan 22, 2007, at 4:09 AM, Maximilian Hütter wrote: Is there a XMLQueryParser yet? I didn't find it in the source. Yes - it's part Lucene's contrib area: http://svn.apache.org/repos/asf/lucene/java/trunk

Re: INTERNET ARCHIVE goes SOLR!

2007-01-28 Thread Erik Hatcher
Tracey, Thanks for sharing! This is a fantastic case study. Your experiences and numbers will certainly be a major sales pitch for bringing Solr into more organizations. On Jan 27, 2007, at 4:12 PM, Tracey Jaquith wrote: We've done essentially no optimizing outside of focusing on a

Re: add CJKTokenizer to solr

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 1:08 AM, zha jimmy wrote: hi, all I am try to config solr to support chinese tokenize。 I saw the tips in schema.xml: !-- One can also specify an existing Analyzer class that has a default constructor via the class attribute on the analyzer element

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 6:15 PM, Chris Hostetter wrote: : We override defaultOperator of OR to AND. : : We really ought to make AND the default anyway. No, no, no, no, No.. :) Your argument is a good one, and I buy it. However, I've never had a case where a user typing

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 11:01 PM, Chris Hostetter wrote: if there are cases where DisMax isn't the right choice for raw user input ... i'm not aware of them, but i'd love to hear about them :) Ok, ok, ok... I'm a self-admitted dismax avoider thus far. I'll remedy that by building in dismax

Re: SOLR-116

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 8:49 PM, Antonio Eggberg wrote: After doing quite a bit of searching what I understand is that the medicine to my problem of word count is in docTermFreq and TermEnum ... as Chris Hostetter points out clearly for statistical purpose in the post below. (Please note I am

Re: OR filtering...

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 7:26 PM, Yonik Seeley wrote: On 1/29/07, escher2k [EMAIL PROTECTED] wrote: I have a question about the syntax for doing an OR filter in my URL. How do I specify where ((fq=colA[10 TO 20]) AND (fq=state:USA OR fq=country:USA) ? Basically, I am doing a search for a

Re: How fast is Solr insert or am i doing something wrong?

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 7:08 PM, Yonik Seeley wrote: On 1/29/07, Antonio Eggberg [EMAIL PROTECTED] wrote: Is it a good practice to do commit after every insert .. is this what is taking the time.. are there any general rule of thumb. Definitely don't do a commit after every insert. Do a

Re: MoreLikeThis similarity-type queries in Solr

2007-01-31 Thread Erik Hatcher
I'm happy to apply this soon, as I could use it myself and deprecate the custom stuff I've done in Collex. At the very latest, I'll tackle this when unit tests arrive, or maybe before. Thanks for the reminder, Bertrand. Erik On Jan 31, 2007, at 3:37 PM, Bertrand Delacretaz

Re: Posting from Ant

2007-01-31 Thread Erik Hatcher
On Jan 31, 2007, at 1:56 PM, Binkley, Peter wrote: Is there an Ant task out there somewhere that can POST bunches of files to Solr, doing what the post.sh script does but with filesets? Great question. And you'd think I'd be able to help, but alas I've not done this myself. I've found

Re: lucene version in release 1.1

2007-01-31 Thread Erik Hatcher
Speaking of which, I know we are waiting on some settling of Lucene's trunk before upgrading, but I'd love to have the *:* QueryParser syntax. I had a weird issue where I indexed 50k objects recently with the basic text field schema, and a [* TO *] reported 4 documents, so I tried -[*

Re: SV: Re: SOLR-116

2007-01-31 Thread Erik Hatcher
On Jan 30, 2007, at 5:33 AM, Antonio Eggberg wrote: Erik Hatcher [EMAIL PROTECTED] skrev: So, no, the current information provided by this handler does not contain frequency information. I'd be happy to consider patches that allow it to provide more information, though I'd like to keep

Re: Posting from Ant

2007-01-31 Thread Erik Hatcher
Good eye, Hoss! I meant to mention the foreach task, but my Ant committership would have been revoked :) (just kidding, but it is controversial and this is why it doesn't exist in Ant itself). And you can, of course, always exec to curl or to Solr's example post.sh. Erik On

Fwd: Posting from Ant

2007-02-01 Thread Erik Hatcher
Ok, we have it on good authority that foreach is the way to go for Ant - POST - Solr. Erik Begin forwarded message: From: Steve Loughran [EMAIL PROTECTED] Date: February 1, 2007 8:34:33 AM EST To: Erik Hatcher [EMAIL PROTECTED] Subject: Re: Posting from Ant On 01/02/07, Erik

Fwd: Posting from Ant

2007-02-01 Thread Erik Hatcher
PROTECTED] Date: February 1, 2007 2:52:24 AM EST To: Erik Hatcher [EMAIL PROTECTED] Subject: Re: Posting from Ant yes, there is an antlib (not released, you need to build it yourself) that does posts, including http forms posting. http://svn.apache.org/viewvc/ant/sandbox/antlibs/http/trunk/ On 01/02/07

Re: Posting from Ant

2007-02-01 Thread Erik Hatcher
with it. Erik On Feb 1, 2007, at 11:43 AM, Binkley, Peter wrote: Thanks, I'll try that out. I hope there aren't any encoding issues... Nah, how likely is that? I'll report back. Peter -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Thursday, February 01, 2007 6

Re: Java client library - SOLR-20

2007-02-01 Thread Erik Hatcher
Actually, Peter, this is another great starting point for some slick Ant task wrappers. We could have a whole set of control tasks that do the optimize, commit, add, delete, and even search. Erik On Feb 1, 2007, at 2:44 PM, Ryan McKinley wrote: #6 solr-client.zip on SOLR-20 is

solrb documentation (was Re: SV: Re: SOLR-116)

2007-02-01 Thread Erik Hatcher
On Feb 1, 2007, at 6:47 AM, Zaheed Haque wrote: I think trying to tackle two problem (Learning Solr as well as solrb/flare) at the same time is giving the problem. While I was writing the i18n unit test I found out the 100% test cov. thats really cool. Now in terms of docs.. I think it would

Re: Indexing RSS

2007-02-01 Thread Erik Hatcher
Speaking of which, I know Yonik paid particular attention the Lucene GData Server when it was under active development. How do you compare Solr to it? Are we getting warmer towards Solr supporting Atom natively? ;) Erik On Feb 1, 2007, at 12:18 PM, rubdabadub wrote: Hello

Re: facet.query question

2007-02-01 Thread Erik Hatcher
of this sort of UI goodness. Erik Thanks. Peter From: Erik Hatcher [EMAIL PROTECTED] Reply-To: solr-user@lucene.apache.org To: solr-user@lucene.apache.org Subject: Re: facet.query question Date: Thu, 1 Feb 2007 21:19:08 -0500 On Feb 1, 2007, at 7:55 PM, Peter McPeterson wrote: Well

Re: facet.query question

2007-02-01 Thread Erik Hatcher
I meant facet.query, not fq, in the example below. fq is a filter query, whereas filter.query is a, umm, filter.query. Erik On Feb 1, 2007, at 9:19 PM, Erik Hatcher wrote: On Feb 1, 2007, at 7:55 PM, Peter McPeterson wrote: Well, I think I figured it out. It can be used

convert custom facets to Solr facets...

2007-02-02 Thread Erik Hatcher
Before Solr had facets, I built my own implementation in a much cruder and less performant way into Collex as custom request handlers. Now the performance issue of warming up the cache needs to be addressed. I'm going to upgrade Solr and adjust the application to work with the built-in

Re: Date ranges

2007-02-02 Thread Erik Hatcher
On Feb 2, 2007, at 5:29 PM, Yonik Seeley wrote: On 2/2/07, Michael Kimsal [EMAIL PROTECTED] wrote: I'm having a devil of a time getting date seaching to work properly. I've created a 'date' field in my schema, and I put values like 2005-08-01T23:59:59Z in it. However, when I run the

Re: Custom Tokenizer

2007-02-03 Thread Erik Hatcher
On Feb 3, 2007, at 11:18 AM, Yonik Seeley wrote: Hmmm, classloader hell... Yeah, I had a bad feeling about that external lib thing. It's a holy grail to allow dynamic pluggability in Java, but its much more difficult than it perhaps should be. I assume you are putting your analyzer in

Re: convert custom facets to Solr facets...

2007-02-03 Thread Erik Hatcher
On Feb 3, 2007, at 11:55 AM, Yonik Seeley wrote: On 2/3/07, Erik Hatcher [EMAIL PROTECTED] wrote: On Feb 2, 2007, at 4:29 PM, Yonik Seeley wrote: One downside of doing joins is that it makes it pretty hard to distribute/federate in the future because a document doesn't stand alone

Re: JOIN in Solr (was: convert custom facets to Solr facets...)

2007-02-03 Thread Erik Hatcher
I'm quite open to NOT having a JOIN in Solr if flattening the model still provides the querying capability desired. I've not fully followed the specifics that Yonik has mentioned on this thread, but it certainly is the case that denormalizing/flattening our domain does not exactly lend

performance testing practices

2007-02-05 Thread Erik Hatcher
This week I'm going to be incrementally loading up to 3.7M records into Solr, in 50k chunks. I'd like to capture some performance numbers after each chunk to see how she holds up. What numbers are folks capturing? What techniques are you using to capture numbers? I'm not looking for

Re: performance testing practices

2007-02-05 Thread Erik Hatcher
On Feb 5, 2007, at 11:15 AM, Yonik Seeley wrote: On 2/5/07, Erik Hatcher [EMAIL PROTECTED] wrote: This week I'm going to be incrementally loading up to 3.7M records into Solr, in 50k chunks. I'd like to capture some performance numbers after each chunk to see how she holds up. What numbers

Re: Spelling

2007-02-05 Thread Erik Hatcher
On Feb 5, 2007, at 3:21 PM, Michael Kimsal wrote: I've looked through the archives but don't see any specific issue relating to my question. Is there a way to have SOLR return partial matches - words that are one (or two or X) letters off the matching word? A search for 'field' would

Re: convert custom facets to Solr facets...

2007-02-06 Thread Erik Hatcher
Yonik - this is great! Thanks for codifying the use cases and providing a possible implementation. I'll tinker with this more when I can. Erik On Feb 4, 2007, at 2:13 PM, Yonik Seeley wrote: I was confusing myself too much without nailing down more concrete examples, so I

Re: Spelling

2007-02-06 Thread Erik Hatcher
Doesn't the built-in Solr Highlighting feature do the rewrite? If not, it should. I looked into this once and I believe it does have this particular bug, but I also vaguely recall it not being straightforward to rewrite the query at that point in the code. Erik On Feb 6, 2007,

Re: facet optimizing

2007-02-07 Thread Erik Hatcher
Yonik - I like the way you think Yeah! It's turtles (err, trees) all the way down. Erik /me Pulling the Algorithms book off my shelf so I can vaguely follow along. On Feb 7, 2007, at 8:22 PM, Yonik Seeley wrote: On 2/7/07, Binkley, Peter [EMAIL PROTECTED] wrote: In the

Re: facet optimizing

2007-02-08 Thread Erik Hatcher
And to add some fuel to this fire, I'm seeing in the (first 100k of UVa MARC records) data I'm processing that the facets are sparse with documents. There are a lot of documents that simply don't have a subject genre on them, for example... like almost 50%. Maybe the data will get

Re: SMILE/Rails/Babel and Dynamic Facets?

2007-02-08 Thread Erik Hatcher
On Feb 8, 2007, at 7:23 PM, Antonio Eggberg wrote: You are doing some pretty cool stuff with flare! I am amazed! Now I have some questions :-) Thanks! - Smile and Babel does everything and its so easy so I wonder why you need ruby/rails for flare? What I mean is that one could get XML

Re: dismax without q=

2007-02-09 Thread Erik Hatcher
On Feb 8, 2007, at 11:55 PM, Yonik Seeley wrote: On 2/8/07, Ryan McKinley [EMAIL PROTECTED] wrote: Assuming there is an fq=xxx in the query, could dismax support a queryless query? It does seem reasonable for both dismax and the standard request hander, esp since we have faceting in the mix

Re: Faceted search glitch???

2007-02-09 Thread Erik Hatcher
On Feb 9, 2007, at 4:45 PM, Yonik Seeley wrote: On 2/9/07, Gunther, Andrew [EMAIL PROTECTED] wrote: Try surrounding you mutli-word query in quotes: education_facet:High School. Also do you want to use the filter query param instead fq=education_facet:High School Hopefully having the client

Re: Faceted search glitch???

2007-02-09 Thread Erik Hatcher
ft and fq work for me! Erik On Feb 9, 2007, at 5:35 PM, Yonik Seeley wrote: On 2/9/07, Chris Hostetter [EMAIL PROTECTED] wrote: : The simplest form would be a term query with no escaping whatsoever: : : Something like: : fq=!term f='education_facet'High School I know yonik isn't

Re: Faceted search glitch???

2007-02-09 Thread Erik Hatcher
On Feb 9, 2007, at 9:16 PM, Mike Klaas wrote: Might I suggest: filter.query filter.term filter.future expansion I like it. While Hoss has a point, though descriptive names do make a lot of sense too. It has been a bit confusing to explain facet.query and fq as two different things.

Re: convert custom facets to Solr facets...

2007-02-12 Thread Erik Hatcher
On Feb 12, 2007, at 9:10 PM, Gmail Account wrote: This would be great! I can't help with the solution but I am very interested in using it if one of you guys can figure it out. I can't wait to see if this works out. And just for the record, Solr drives Collex @ NINES: http://

Re: Tagging

2007-02-13 Thread Erik Hatcher
There is also the possibility of keeping tags with the original documents and having them individually updated without having to resend the original full text as well: https://issues.apache.org/ jira/browse/SOLR-139 And yeah, Peter is a solr4lib kinda guy, doing some way cool stuff with

Re: Tagging

2007-02-13 Thread Erik Hatcher
On Feb 13, 2007, at 9:01 PM, Yonik Seeley wrote: And yeah, Peter is a solr4lib kinda guy, doing some way cool stuff with Lucene and Solr already: http://peel.library.ualberta.ca/ search/? search=rawpageNumber=1index=peelbibfield=bodyrawQuery=dogdigstat us= on FYI, your mailer is always

Re: Tagging

2007-02-13 Thread Erik Hatcher
On Feb 13, 2007, at 9:23 PM, Yonik Seeley wrote: On 2/13/07, Erik Hatcher [EMAIL PROTECTED] wrote: On Feb 13, 2007, at 9:01 PM, Yonik Seeley wrote: And yeah, Peter is a solr4lib kinda guy, doing some way cool stuff with Lucene and Solr already: http://peel.library.ualberta.ca/ search

Re: Using solr on windows

2007-02-14 Thread Erik Hatcher
On Feb 14, 2007, at 6:01 AM, Kainth, Sachin wrote: I'm new to solr. I've just installed it and am trying to run the example. The problem is that the tutorial seems to suggest that you need Unix to run solr. For example to add a document to the solr?/lucene? Index you need to do this: [EMAIL

Re: Using solr on windows

2007-02-14 Thread Erik Hatcher
On Feb 14, 2007, at 6:34 AM, Kainth, Sachin wrote: Thanks for the replies, I have now tried using cygwin and the result was this: Posting file C:\apache-solr-1.1.0-incubating\example\exampledocs\solr.xml to htt p://localhost:8983/solr/update

Re: convert custom facets to Solr facets...

2007-02-14 Thread Erik Hatcher
On Feb 13, 2007, at 6:07 PM, Yonik Seeley wrote: On 2/12/07, Erik Hatcher [EMAIL PROTECTED] wrote: And just for the record, Solr drives Collex @ NINES: http:// www.nines.org/collex which implements tagging along with faceted and full-text search. I've recently hacked our system

  1   2   3   4   5   6   7   8   9   10   >