Putting search-lucene.com back on l.a.o/solr

2011-07-06 Thread Otis Gospodnetic
Hi, I just noticed that over on http://lucene.apache.org/solr/ we are back to Lucid Find being the only search provider.  5 months ago we added search-lucene.com there, but now it's gone.  Google Analytics shows that search-lucene.com was removed from there on June 4.  This is when Lucene 3.2

[jira] [Commented] (SOLR-2568) Solr NRT (real time search) does not work

2011-06-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044184#comment-13044184 ] Otis Gospodnetic commented on SOLR-2568: Nagendra, will you be providing a patch

[jira] [Resolved] (SOLR-2568) Solr NRT (real time search) does not work

2011-06-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-2568. Resolution: Duplicate Actually, I'm closing this because we already have SOLR-2566

Re: CLOSE_WAIT after connecting to multiple shards from a primary shard

2011-05-31 Thread Otis Gospodnetic
Hi, I think I already replied to this one on general@ or some other place. Did you try the suggestion? Please send any future replies to solr-u...@lucene.apache.org instead of this dev@ list, which is for development of Lucene/Solr itself. Otis Sematext :: http://sematext.com/ :: Solr

Re: [VOTE] Release Lucene/Solr 3.2.0

2011-05-30 Thread Otis Gospodnetic
I suggest you stick this on the Wiki and make it more official if people +1 this approach. Otherwise it will get forgotten and will be hard to reference. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original

Re: Bug in boilerpipe 1.1.0 referenced from solr-cell

2011-05-05 Thread Otis Gospodnetic
Andrew, you can get to Boilerplate author's email address on http://code.google.com/p/boilerpipe/ Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ From: Andrew Bisson andrew.bis...@gossinteractive.com To:

[jira] [Commented] (SOLR-2399) Solr Admin Interface, reworked

2011-05-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13028417#comment-13028417 ] Otis Gospodnetic commented on SOLR-2399: Stefan - I only looked at the screenshot

[jira] [Commented] (SOLR-2399) Solr Admin Interface, reworked

2011-05-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13028429#comment-13028429 ] Otis Gospodnetic commented on SOLR-2399: Right, so if you look at the names

[jira] [Commented] (SOLR-2399) Solr Admin Interface, reworked

2011-05-02 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027850#comment-13027850 ] Otis Gospodnetic commented on SOLR-2399: Thanks for doing all this, Stefan! I

[jira] [Updated] (LUCENE-3054) add assert to sorts catch broken comparators in tests

2011-04-29 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated LUCENE-3054: - Affects Version/s: 3.1 Btw. this is with Lucene 3.1 For full thread: http://search

[jira] [Updated] (SOLR-319) changes SynonymFilterFactory to Analyze synonyms file

2011-04-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-319: -- Summary: changes SynonymFilterFactory to Analyze synonyms file (was: changes

[jira] [Commented] (SOLR-319) changes SynonymFilterFactoryto Analyze synonyms file

2011-04-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13025361#comment-13025361 ] Otis Gospodnetic commented on SOLR-319: --- Btw., I noticed this functionality is really

MemoryIndex(Reader), IndexReader, and SegmentReader

2011-04-14 Thread Otis Gospodnetic
Hi, Short version: Should MemoryIndex (more precisely, MemoryIndexReader inside MemoryIndex.java) be using something like SegmentReader instead of extending IndexReader and hitting the exception below? Background: I'm working with some code that uses MemoryIndex and some nasty queries, some

Re: Patch for http_proxy support in solr-ruby client

2011-04-12 Thread Otis Gospodnetic
Hi, Hm, maybe you are asking where solr-ruby actually lives and is being developed? I'm not sure. I see it under solr/client/ruby/solr-ruby (no new development in ages?), but I also see an *active* solr-ruby fork over on https://github.com/bbcrd/solr-ruby . So if you want to contribute to

[jira] [Commented] (SOLR-2465) QueryElevationComponent should be reloadable w/o commit

2011-04-11 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018491#comment-13018491 ] Otis Gospodnetic commented on SOLR-2465: Related: reload synonyms without having

termInfosIndexDivisor typo in the Solr-UIMA config?

2011-04-11 Thread Otis Gospodnetic
Hi, I was looking at term index divisor and spotted this: .../lucene-solr-3.1$ ffxg -i IndexDivisor ./solr/src/test-files/solr/conf/solrconfig-termindex.xml:int name=setTermIndexDivisor12/int ./solr/src/test-files/solr/conf/solrconfig-xinclude.xml:int name=setTermIndexDivisor12/int

[jira] [Issue Comment Edited] (SOLR-1307) Provide a standard way to reload plugins

2011-04-11 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018531#comment-13018531 ] Otis Gospodnetic edited comment on SOLR-1307 at 4/11/11 7:37 PM

[jira] [Commented] (SOLR-1307) Provide a standard way to reload plugins

2011-04-11 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018531#comment-13018531 ] Otis Gospodnetic commented on SOLR-1307: +1 Related: http://search-lucene.com/m

[jira] [Commented] (SOLR-1853) ReplicationHandler reports incorrect replication failures

2011-04-04 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13015537#comment-13015537 ] Otis Gospodnetic commented on SOLR-1853: This issue is still open, but it looks

[jira] Commented: (SOLR-1986) Allow users to define multiple subfield types in AbstractSubTypeFieldType

2011-03-17 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13008041#comment-13008041 ] Otis Gospodnetic commented on SOLR-1986: This looks useful. Thomas or Mark, would

[jira] Commented: (SOLR-2242) Get distinct count of names for a facet field

2011-03-15 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13006807#comment-13006807 ] Otis Gospodnetic commented on SOLR-2242: Would this be more consistent

[jira] Commented: (SOLR-2429) ability to not cache a filter

2011-03-15 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13007324#comment-13007324 ] Otis Gospodnetic commented on SOLR-2429: I'm with Hoss. For many months now, I've

Re: Query click logs for custom Lucene relevance models

2011-03-02 Thread Otis Gospodnetic
-lucene.com/ - Original Message From: Andrzej Bialecki a...@getopt.org To: openrelevance-...@lucene.apache.org Sent: Wed, March 2, 2011 5:07:55 AM Subject: Re: Query click logs for custom Lucene relevance models On 3/2/11 3:39 AM, Otis Gospodnetic wrote: Hello, I'm helping

Re: dataset collection

2011-03-01 Thread Otis Gospodnetic
Itamar, Would you happen to have a screenshot that shows that ORev looks like? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Itamar Syn-Hershko ita...@code972.com To:

[jira] Commented: (SOLR-2379) Improve documentation of Analyzers and Tokenizers

2011-02-24 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12999205#comment-12999205 ] Otis Gospodnetic commented on SOLR-2379: Whatever we choose, let's stick to DRY. I

[jira] Commented: (SOLR-2286) Automatically detecting Date/Time format in the DIH

2011-02-23 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12998669#comment-12998669 ] Otis Gospodnetic commented on SOLR-2286: Adam, want to turn that into a patch? I

[jira] Commented: (SOLR-236) Field collapsing

2011-02-18 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12996550#comment-12996550 ] Otis Gospodnetic commented on SOLR-236: --- Why are people still working on this SOLR-236

[jira] Commented: (SOLR-1682) Implement CollapseComponent

2011-02-18 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12996556#comment-12996556 ] Otis Gospodnetic commented on SOLR-1682: Are there known trunk patches that make

[jira] Commented: (SOLR-2066) Search Grouping: support distributed search

2011-02-18 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12996682#comment-12996682 ] Otis Gospodnetic commented on SOLR-2066: Harish - I haven't looked at this patch

[jira] Resolved: (SOLR-1996) Possible edismax phrase query bug with query parametr like: q=(aaa+bbb)+OR+otherField:(zzz)^30

2011-01-21 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-1996. Resolution: Invalid Assignee: Otis Gospodnetic Possible edismax phrase query bug

[jira] Commented: (SOLR-1996) Possible edismax phrase query bug with query parametr like: q=(aaa+bbb)+OR+otherField:(zzz)^30

2011-01-20 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12984127#action_12984127 ] Otis Gospodnetic commented on SOLR-1996: Rafał, was that the case? If so, we can

[jira] Resolved: (SOLR-849) Add bwlimit support to snappuller

2011-01-17 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-849. --- Resolution: Duplicate Implemented in SOLR-2099. Add bwlimit support to snappuller

[jira] Commented: (SOLR-2052) Allow for a list of filter queries and a single docset filter in QueryComponent

2010-11-05 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12928791#action_12928791 ] Otis Gospodnetic commented on SOLR-2052: Looks straight forward, but doesn't apply

Re: JIRA SOLR-* karma

2010-10-06 Thread Otis Gospodnetic
Steve, I just added steve_rowe to Solr as Committer in JIRA. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Uwe Schindler u...@thetaphi.de To: dev@lucene.apache.org Sent: Wed, October

[jira] Commented: (SOLR-2138) Solr 1.4 takes long time to load cores (memory leak?)

2010-10-04 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12917581#action_12917581 ] Otis Gospodnetic commented on SOLR-2138: Identical JVM? Solr 1.4 takes long time

[jira] Commented: (LUCENE-2660) Add alternative search-provider to Lucene site

2010-09-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12915062#action_12915062 ] Otis Gospodnetic commented on LUCENE-2660: -- Alex, I think Lucene Solr use

Re: SOLR: Check for language

2010-08-21 Thread Otis Gospodnetic
Short answer: yes, doable. Details on solr-u...@lucene... Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ From: Lalit Kumar 4 lkum...@sapient.com To: dev@lucene.apache.org dev@lucene.apache.org Sent: Sat, August 21, 2010

[jira] Commented: (LUCENE-2456) A Column-Oriented Cassandra-Based Lucene Directory

2010-08-09 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896631#action_12896631 ] Otis Gospodnetic commented on LUCENE-2456: -- Karthick, I'm interested

[jira] Commented: (SOLR-1980) Implement boundary match support

2010-08-05 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895937#action_12895937 ] Otis Gospodnetic commented on SOLR-1980: What about Span queries - no use here

[jira] Resolved: (SOLR-1977) Misplaced maven artifacts

2010-07-09 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-1977. Resolution: Fixed Misplaced maven artifacts

[jira] Issue Comment Edited: (SOLR-1961) Use Lucene's Field Cache To Retrieve Stored Fields From Memory

2010-06-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12882909#action_12882909 ] Otis Gospodnetic edited comment on SOLR-1961 at 6/26/10 11:13 PM

[jira] Commented: (SOLR-1969) Make MMapDirectory configurable in solrconfig.xml

2010-06-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12882911#action_12882911 ] Otis Gospodnetic commented on SOLR-1969: Nice! MMapDirectoryFactory needs

Re: On update are index files only appended to?

2010-06-25 Thread Otis Gospodnetic
Dennis, That's a u...@lucene question. Update does delete followed by an add. A delete just marks a document as deleted. In other words, it modifies previously created index files. Additions add new files. Once certain thresholds are reached, an add will trigger a segment merge, which will

Re: can lucene search more than one word

2010-06-21 Thread Otis Gospodnetic
Hi, Words have and on are probably in your list of stopwords and getting removed from the query/documents. Please use u...@lucene list for questions about Lucene usage. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/

[jira] Commented: (LUCENE-2503) light/minimal stemming for euro languages

2010-06-17 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12879936#action_12879936 ] Otis Gospodnetic commented on LUCENE-2503: -- Man are you fast! Does the English

[jira] Commented: (LUCENE-2425) An Anti-Merging Multi-Directory Indexing Framework

2010-06-02 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12874445#action_12874445 ] Otis Gospodnetic commented on LUCENE-2425: -- Karthick, it looks like your May 1st

Re: Solr updateRequestHandler and performance vs. atomicity

2010-06-02 Thread Otis Gospodnetic
While preparing material for http://blog.sematext.com/2010/06/02/lucene-digest-may-2010-3/ I came across something that looks relevant: https://issues.apache.org/jira/browse/LUCENE-2456 ...where the author wrote this: In conclusion, this directory attempts to marry the rich search-based

Re: Lucid find doesn't

2010-04-25 Thread Otis Gospodnetic
Hi, Those who need to search Solr/Lucene/etc. archives (lists, wiki, site, etc.) can also use Sematext's search-lucene.com: http://search-lucene.com/ The new d...@lucene is not there yet, but it's coming: http://blog.sematext.com/2010/04/19/poll-handling-lucene-dev-merge/ Otis

[jira] Commented: (SOLR-209) Multifields and multivalued facets

2010-04-19 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12858657#action_12858657 ] Otis Gospodnetic commented on SOLR-209: --- It's been a year since you asked, Shalin, so

[jira] Commented: (LUCENE-2393) Utility to output total term frequency and df from a lucene index

2010-04-14 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12857107#action_12857107 ] Otis Gospodnetic commented on LUCENE-2393: -- I think creating a small index

[jira] Resolved: (NUTCH-570) Improvement of URL Ordering in Generator.java

2010-04-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved NUTCH-570. Resolution: Won't Fix Improvement of URL Ordering in Generator.java

[jira] Commented: (NUTCH-570) Improvement of URL Ordering in Generator.java

2010-04-07 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854665#action_12854665 ] Otis Gospodnetic commented on NUTCH-570: I'm tempted to close this issue as Won't

[jira] Commented: (NUTCH-570) Improvement of URL Ordering in Generator.java

2010-03-30 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12851461#action_12851461 ] Otis Gospodnetic commented on NUTCH-570: Serykh, what does your version of the patch

[jira] Commented: (SOLR-896) Solr Query Parser Plugin for Mark Miller's Qsol Parser

2010-03-27 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12850508#action_12850508 ] Otis Gospodnetic commented on SOLR-896: --- This looks super straight forward. The only

Re: #lucene IRC log [was: RE: lucene and solr trunk]

2010-03-23 Thread Otis Gospodnetic
Uh, the IRC logs... Do people really think making those *searchable* would be useful? I think they'd be *extremely* noisy and hard to interpret without a person really just sequentially reading them. Lots of people talking at the same time, multiple topics, lots of very short intertwined

Re: [DISCUSS] Nutch as a top level project (TLP)?

2010-03-20 Thread Otis Gospodnetic
Personally, I don't see the advantage of Nutch going for a TLP. It's not like new committers are having a hard time getting in today, it's not like they are being proposed and rejected. I also don't feel like Nutch lacks exposure/visibility -- lots of people know about it. It's just that

Re: lucene and solr trunk

2010-03-17 Thread Otis Gospodnetic
+1 for this structure and this set of steps. Otis - Original Message From: Chris Hostetter hossman_luc...@fucit.org To: solr-dev@lucene.apache.org Sent: Tue, March 16, 2010 6:46:19 PM Subject: Re: lucene and solr trunk : Otis, yes, I think so, eventually. But that's gonna

[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.

2010-03-16 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated NUTCH-740: --- Assignee: (was: Otis Gospodnetic) Configuration option to override default language

[jira] Commented: (SOLR-1822) SEVERE: Unable to move index file from: tempfile to: indexfile

2010-03-16 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12846099#action_12846099 ] Otis Gospodnetic commented on SOLR-1822: When Solr starts, doesn't it create

[jira] Commented: (SOLR-1375) BloomFilter on a field

2010-03-16 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12846139#action_12846139 ] Otis Gospodnetic commented on SOLR-1375: Heh, with the Lucene/Solr merge taking

Re: lucene and solr trunk

2010-03-16 Thread Otis Gospodnetic
Hi, Check out the dir structure mentioned here: http://markmail.org/message/gwpmaevw7tavqqge Isn't that what we want? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem search :: http://search-hadoop.com/ - Original Message From: Mark Miller

[jira] Commented: (SOLR-1553) extended dismax query parser

2010-03-01 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12839907#action_12839907 ] Otis Gospodnetic commented on SOLR-1553: What does u in uf stand for? extended

[jira] Commented: (SOLR-1375) BloomFilter on a field

2010-02-25 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12838446#action_12838446 ] Otis Gospodnetic commented on SOLR-1375: {quote} When new segments are created

[jira] Resolved: (SOLR-1788) Remove duplicate field in schema.xml

2010-02-25 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-1788. Resolution: Won't Fix Please email questions to solr-user list. Remove duplicate field

[jira] Commented: (SOLR-1719) stock TokenFilterFactory for flattening positions

2010-01-19 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12802342#action_12802342 ] Otis Gospodnetic commented on SOLR-1719: Does PositionFilterFactory fix the problem

[jira] Resolved: (SOLR-577) added support for boosting fields and documents to python solr interface

2010-01-15 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-577. --- Resolution: Won't Fix Closing per comment. added support for boosting fields and documents

[jira] Resolved: (SOLR-216) Improvements to solr.py

2010-01-15 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic resolved SOLR-216. --- Resolution: Won't Fix Closing per comment Improvements to solr.py

[jira] Commented: (SOLR-758) Enhance DisMaxQParserPlugin to support full-Solr syntax and to support alternate escaping strategies.

2010-01-15 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12800816#action_12800816 ] Otis Gospodnetic commented on SOLR-758: --- I this still needed with enhanced dismax now

Re: New MEAP: Mahout in Action

2010-01-14 Thread Otis Gospodnetic
+1 Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message From: Grant Ingersoll gsing...@apache.org To: mahout-dev@lucene.apache.org Sent: Thu, January 14, 2010 9:14:09 PM Subject: Re: New MEAP: Mahout in Action +1. (BTW, great read so far, I've

Re: Compound File Default

2010-01-12 Thread Otis Gospodnetic
I think what has changed is that a lot more people hit this problem, and a number of people provided answers, so it's much easier now for a new person to learn what to do when this limit is hit. At the same time, seeing how some people benchmark systems without tuning them and then publish

Re: Lucene 2.9.0 Near Real Time Indexing and lock timeouts

2010-01-12 Thread Otis Gospodnetic
John, Yes, you should get 2.9.0 or 3.0.0, their indexing is faster. Still, even with 2.4.0 you shouldn't run into problems if you are really using just 1 IndexWriter. Still, I'd try upgrading first. Oh, and java-user is the place to ask. Otis -- Sematext -- http://sematext.com/ -- Solr -

Re: Lucene 2.9.0 Near Real Time Indexing and Service Crashes/restarts

2010-01-12 Thread Otis Gospodnetic
John, you should have a look at Zoie. I just finished adding LinkedIn's case study about Zoie to Lucene in Action 2, so this is fresh in my mind. :) Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message From: jchang jchangkihat...@gmail.com To:

Re: Compound File Default

2010-01-12 Thread Otis Gospodnetic
Subject: Re: Compound File Default Otis Gospodnetic wrote: At the same time, seeing how some people benchmark systems without tuning them and then publish their results, cfs may be safer. Though at the same time you get nailed with a 10-15% indexing speed hit. -- - Mark http

NYC Search in the Cloud meetup: Jan 20

2010-01-12 Thread Otis Gospodnetic
Hello, If Search Engine Integration, Deployment and Scaling in the Cloud sounds interesting to you, and you are going to be in or near New York next Wednesday (Jan 20) evening: http://www.meetup.com/NYC-Search-and-Discovery/calendar/12238220/ Sorry for dupes to those of you subscribed to

Re: Compound File Default

2010-01-11 Thread Otis Gospodnetic
+1. I never liked having the compound format be the default, since increasing the max # of open file handles is a well documented thing, at least in the UNIX world. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message From: Grant Ingersoll

[jira] Commented: (LUCENE-2127) Improved large result handling

2010-01-07 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797776#action_12797776 ] Otis Gospodnetic commented on LUCENE-2127: -- +1 for Aaron's patch in a separate

[jira] Commented: (SOLR-773) Incorporate Local Lucene/Solr

2009-12-21 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12793300#action_12793300 ] Otis Gospodnetic commented on SOLR-773: --- Dave - useful, thanks! Do you think creating

[jira] Commented: (LUCENE-1910) Extension to MoreLikeThis to use tag information

2009-12-15 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12790889#action_12790889 ] Otis Gospodnetic commented on LUCENE-1910: -- * I'll second Mark's suggestion

[jira] Commented: (SOLR-1632) Distributed IDF

2009-12-11 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12789379#action_12789379 ] Otis Gospodnetic commented on SOLR-1632: I didn't look a the patch, but from your

[jira] Commented: (SOLR-1632) Distributed IDF

2009-12-10 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12789120#action_12789120 ] Otis Gospodnetic commented on SOLR-1632: What about this approach: http

java.net.URL synchronization

2009-12-09 Thread Otis Gospodnetic
Hello, Has anyone seen this: http://www.supermind.org/blog/580/java-net-url-synchronization-bottleneck ? Is this something that needs to be addressed in Nutch (and thus in Bixo, and thus in the common crawler project)? Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-12-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785473#action_12785473 ] Otis Gospodnetic commented on LUCENE-2091: -- +1 for skipping BM25 and going

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-12-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785690#action_12785690 ] Otis Gospodnetic commented on LUCENE-2091: -- Joaquin - could you please explain

[jira] Commented: (SOLR-1277) Implement a Solr specific naming service (using Zookeeper)

2009-12-03 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785694#action_12785694 ] Otis Gospodnetic commented on SOLR-1277: How about this idea for the what to do

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783530#action_12783530 ] Otis Gospodnetic commented on LUCENE-2091: -- Has anyone compared this particular

[jira] Issue Comment Edited: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783530#action_12783530 ] Otis Gospodnetic edited comment on LUCENE-2091 at 11/30/09 4:21 AM

Sentence detection/extraction as Tokenizer?

2009-11-27 Thread Otis Gospodnetic
Hello, The contrib/wordnet package contains an AnalyzerUtil class with a method that extracts sentences from text/String. It is super-simplistic, so probably not very accurate, but I am wondering if *conceptually* it would make sense to have a Tokenizer that extracts sentences? I suppose

NMF for Taste

2009-11-27 Thread Otis Gospodnetic
Hello, Recently, I read Matrix Factorization Techniques for Recommender Systems from http://research.yahoo.com/node/2859 . I was wondering what you think about this vs. what we have in Taste now? It looks like Collective Intelligence talks about this on p232-239 + 302... but I haven't read

Distributing index over N disks

2009-11-24 Thread Otis Gospodnetic
Hello, Would it make sense and be possible to spread different index files over multiple disks (without resorting to putting an index on a RAID)? For example, what if the index files didn't live in a single index dir, but were organized by their type in a snallow dir tree, like this:

Re: Whither Query Norm?

2009-11-24 Thread Otis Gospodnetic
I'm late to the thread, and although it looks like the discussion is over, I'll inline a Q for Jake. I should add in my $0.02 on whether to just get rid of queryNorm() altogether: -1 from me, even though it's confusing, because having that call there (somewhere, at least) allows you to

Re: Whither Query Norm?

2009-11-24 Thread Otis Gospodnetic
Hello, Regarding that monstrous term-idf map. Is this something that one could use to adjust the scores in http://wiki.apache.org/solr/DistributedSearch#Distributed_Searching_Limitations scenario? Say a map like that was created periodically for each shard and distributed to all other nodes

contrib-db javadoc 404s

2009-11-18 Thread Otis Gospodnetic
Not sure why, but this contrib's javadoc is missing: http://lucene.apache.org/java/2_9_1/api/contrib-db/index.html Also, the name db always bugged me a little. Would it make more sense to call it contrib-bdb instead of contrib-db? Otis

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-17 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779116#action_12779116 ] Otis Gospodnetic commented on MAHOUT-165: - Yes, Wolfgang contributed MemoryIndex

Re: Low priority: analytics for lucene.apache.org/mahout?

2009-11-13 Thread Otis Gospodnetic
I'm the same. http://markmail.org/message/iluezhazv7m43k5s Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR - Original Message From: Sean Owen sro...@gmail.com To: mahout-dev@lucene.apache.org Sent:

[jira] Commented: (SOLR-1553) extended dismax query parser

2009-11-11 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776661#action_12776661 ] Otis Gospodnetic commented on SOLR-1553: I think you need to click on Issue Links

[jira] Commented: (SOLR-1550) statistics for request handlers should report std dev

2009-11-09 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12775163#action_12775163 ] Otis Gospodnetic commented on SOLR-1550: Haven't tried the patch yet, just had

[jira] Commented: (SOLR-1537) Dedupe Sharded Search Results by Shard Order or Score

2009-11-05 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12774053#action_12774053 ] Otis Gospodnetic commented on SOLR-1537: The ID here being the uniqueKey? i.e

[jira] Commented: (SOLR-1536) Support for TokenFilters that may modify input documents

2009-11-05 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12774057#action_12774057 ] Otis Gospodnetic commented on SOLR-1536: Is this better than writing a custom

Avro in Solr

2009-11-03 Thread Otis Gospodnetic
Hello, Avro is still young, from what I know, but I'm wondering if anyone has any thoughts on whether there is a place or need for Avro in Solr? http://www.cloudera.com/blog/2009/11/02/avro-a-format-for-big-data/ Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene,

Re: Avro in Solr

2009-11-03 Thread Otis Gospodnetic
and name of fields in any document is completely arbitrary in Solr. Is it possible to represent such a datastructure in avro? On Wed, Nov 4, 2009 at 3:43 AM, Otis Gospodnetic wrote: Hello, Avro is still young, from what I know, but I'm wondering if anyone has any thoughts on whether

<    1   2   3   4   5   6   7   8   9   10   >