Re: Tagging using SOLR

2007-09-07 Thread Erik Hatcher
On Sep 7, 2007, at 3:09 AM, Doss wrote: Thanks for the guidelines, but basically our idea is to build a system like http://del.icio.us/tag/, is it possible to take counts of similar words from a solr indexed field? How do you define similar words? Here's a tag cloud for a single user in

Re: Replication broken.. no helpful errors?

2007-09-07 Thread Bill Au
As I had pointed out in my first reply to this thread, you had a directory named temp-snapshot.20070816120113 in your data directory on the slave. Snapinstaller was mistakenly treating that as the lastest snapshot and was installing that every time it was called. Snapinstaller didn't trigger a

Re: Distribution Information?

2007-09-07 Thread Bill Au
I that case, definitely take a look at SOLR-333: http://issues.apache.org/jira/browse/SOLR-333 On the master there should be a logs/clients directory. Do you have any files in there? Bill On 9/6/07, Matthew Runo [EMAIL PROTECTED] wrote: Well, I do get... Distribution Info Master Server

Re: Indexing very large files.

2007-09-07 Thread Brian Carmalt
Lance Norskog schrieb: Now I'm curious: what is the use case for documents this large? Thanks, Lance Norskog It is a rand use case, but could become relevant for us. I was told to explore the possibilities, and that's what I'm doing. :) Since I haven't heard any suggestions as to how

Re: Indexing very large files.

2007-09-07 Thread Walter Underwood
Legal discovery can have requirements like this. --wunder On 9/7/07 4:47 AM, Brian Carmalt [EMAIL PROTECTED] wrote: Lance Norskog schrieb: Now I'm curious: what is the use case for documents this large? Thanks, Lance Norskog It is a rand use case, but could become relevant for

Dilbert (off-topic)

2007-09-07 Thread Jeff Rodenburg
It may be off-topic, but it's friday and thought all the java coders would appreciate today's dilbert. (I'm not primary a java dev, but I know the feeling) http://www.dilbert.com/comics/dilbert/archive/dilbert-20070907.html cheers, jeff r.

Re: RSS syndication Plugin

2007-09-07 Thread Thorsten Scherler
On Thu, 2007-09-06 at 09:07 -0400, Ryan McKinley wrote: perhaps: https://issues.apache.org/jira/browse/SOLR-208 in http://svn.apache.org/repos/asf/lucene/solr/trunk/example/solr/conf/xslt/ check: example_atom.xsl example_rss.xsl Awesome. Thanks very much Ryan to point me into the right

Re: Tagging using SOLR

2007-09-07 Thread Doss
Dear Thorsten, Erik, Thanks for the guidelines, but basically our idea is to build a system like http://del.icio.us/tag/, is it possible to take counts of similar words from a solr indexed field? Thanks, Mohandoss On 9/6/07, Erik Hatcher [EMAIL PROTECTED] wrote: On Sep 6, 2007, at 3:29 AM,

Lucene/Solr OnTheRoad

2007-09-07 Thread Erik Hatcher
I just added brief mentions of some upcoming Lucene/Solr-related events to this page: http://wiki.apache.org/lucene-java/OnTheRoad Below is some self-promotion of an upcoming class I have agreed to teach. It's uncomfortable to send this sort of thing out, but if I don't then you might

Re: Distribution Information?

2007-09-07 Thread Matthew Runo
Actually I don't have the clients directory... [EMAIL PROTECTED]: .../logs]$ pwd /opt/solr/logs [EMAIL PROTECTED]: .../logs]$ ls rsyncd-enabled rsyncd.log rsyncd.pid snapcleaner.log snapshooter.log snapshot.current.search2 snapshot.status.search2 [EMAIL PROTECTED]: .../logs]$ It does

Return 2 fields per facet.. name and id, for example?

2007-09-07 Thread Matthew Runo
Hello! I've found something which is either already in SOLR, or should be (as I can see it being very helpful). I couldn't figure out how to do it though.. Lets say I'm trying to print out a page of products, and I want to provide a list of brands to filter by. It would be great if in my

Re: Return 2 fields per facet.. name and id, for example?

2007-09-07 Thread Matthew Runo
Ahh... sneaky. I'll probably do the combined-name#id method. ++ | Matthew Runo | Zappos Development | [EMAIL PROTECTED] | 702-943-7833 ++ On Sep 7, 2007, at 12:38 PM, Yonik

Re: Solr and KStem

2007-09-07 Thread Walter Underwood
Even if KStem isn't ASL, we could include the plug-in code with notes about how to get the stemmer. Or, the Solr plug-in could be contributed to the group that manages the KStem distribution: http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi wunder On 9/7/07 12:59 PM, Yonik Seeley

Re: Indexing very large files.

2007-09-07 Thread Mike Klaas
On 7-Sep-07, at 4:47 AM, Brian Carmalt wrote: Lance Norskog schrieb: Now I'm curious: what is the use case for documents this large? It is a rand use case, but could become relevant for us. I was told to explore the possibilities, and that's what I'm doing. :) Since I haven't heard any

FW: Space costs of dynamic fields?

2007-09-07 Thread Lance Norskog
Are there any extra costs for dynamic v.s. static fields? That is, if I have the same dynamic field in 95% of my documents, should I just make it static and empty in the other 5%? Will query speed or change? Which choice will use more pace? Otherwise, the only downside of dynamic fields is that

org.apache.lucene.util.English missing

2007-09-07 Thread Lance Norskog
Hi folks- The Lucene Spellchecker unit test expects a Java class org.apache.lucene.util.English. I can't find it in the source trees on svn.apache.org. Can someone please mail it to me? Thanks, Lance Norskog

FW: Minor mistake on the Wiki

2007-09-07 Thread Lance Norskog
In the page http://wiki.apache.org/solr/UpdateXmlMessages We find: Optional attributes on doc * boost = float - default is 1.0 (See Lucene docs for definition of boost.) * NOTE: make sure norms are enabled (omitNorms=false in the schema.xml) for any

adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Lance Norskog
Hi- It appears that DirectUpdateHandler2.java does not actually implement the parameters that control whether to override existing documents. Should I use DirectUpdateHandler instead? Apparently DUH is slower than DUH2, but DUH implements these parameters. (We do so many overwrites that

Re: FW: Minor mistake on the Wiki

2007-09-07 Thread Yonik Seeley
On 9/7/07, Lance Norskog [EMAIL PROTECTED] wrote: In the page http://wiki.apache.org/solr/UpdateXmlMessages We find: Optional attributes on doc * boost = float - default is 1.0 (See Lucene docs for definition of boost.) * NOTE: make sure norms

Re: org.apache.lucene.util.English missing

2007-09-07 Thread Otis Gospodnetic
Really? Weird. It's here: /home/otis/dev/repos/lucene/java/trunk [EMAIL PROTECTED] trunk]$ ff English.java ./src/test/org/apache/lucene/util/English.java Note that this is Lucene and that it's src/test. Otis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simpy --

Re: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Yonik Seeley
On 9/7/07, Lance Norskog [EMAIL PROTECTED] wrote: It appears that DirectUpdateHandler2.java does not actually implement the parameters that control whether to override existing documents. It's been proposed that most of these be deprecated anyway and replaced with a simple overwrite=true/false.

Re: Solr and KStem

2007-09-07 Thread Otis Gospodnetic
Look for KStem in Lucene JIRA. Mny years ago something KStem related was contributed, and there was a discussion about licenses then. Otis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simpy -- http://www.simpy.com/ - Tag - Search - Share - Original Message

RE: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Lance Norskog
No, I'm just doing standard overwriting. It just took a little digging to be able to do it :) To gild the lily, it would be efficient in our case to add a boolean flag to each record saying whether to overwrite this record. This would make each record read-only or read-write. But I think this is

Re: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Yonik Seeley
On 9/7/07, Lance Norskog [EMAIL PROTECTED] wrote: No, I'm just doing standard overwriting. It just took a little digging to be able to do it :) Overwriting is the default... you shouldn't have to do specify anything extra when indexing the document. -Yonik

Re: Distribution Information?

2007-09-07 Thread Bill Au
I just double checked distribution.jsp. The directory where it looks for status files is hard coded to logs/clients. So for now master_status_dir in your solr/conf/scripts.conf has to be set to that so the scripts will put the status files there. It looks like they are currently in you logs

Re: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Mike Klaas
On 7-Sep-07, at 1:35 PM, Lance Norskog wrote: Hi- It appears that DirectUpdateHandler2.java does not actually implement the parameters that control whether to override existing documents. Should I use No? allowDups=true ovewritePending=false overwriteCommited=false should result in

New user question: How to show all stored fields in a result

2007-09-07 Thread melkink
Hello Solr Folks, I'm a new solr user and I'm running into a frustrating problem. I'm sure it's a simple solution I just don't have the experience with solr to know the correct way around it. I currently have approximately 600 documents stored and indexed in solr. Each document has some level

Re: Distribution Information?

2007-09-07 Thread Matthew Runo
OK. I made the change, but it seemed not to pick up the files. When I changed distrobutiondump.jsp to say... File masterdir = new File(/opt/solr/logs/clients); it worked. Thank you for your help! ++ | Matthew Runo | Zappos Development