Hi,
I just noticed that over on http://lucene.apache.org/solr/ we are back to Lucid
Find being the only search provider. 5 months ago we added search-lucene.com
there, but now it's gone. Google Analytics shows that search-lucene.com was
removed from there on June 4. This is when Lucene 3.2
[
https://issues.apache.org/jira/browse/SOLR-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13044184#comment-13044184
]
Otis Gospodnetic commented on SOLR-2568:
Nagendra, will you be providing a patch
[
https://issues.apache.org/jira/browse/SOLR-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-2568.
Resolution: Duplicate
Actually, I'm closing this because we already have SOLR-2566
Hi,
I think I already replied to this one on general@ or some other place. Did you
try the suggestion? Please send any future replies to
solr-u...@lucene.apache.org instead of this dev@ list, which is for development
of Lucene/Solr itself.
Otis
Sematext :: http://sematext.com/ :: Solr
I suggest you stick this on the Wiki and make it more official if people +1
this
approach. Otherwise it will get forgotten and will be hard to reference.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original
Andrew, you can get to Boilerplate author's email address on
http://code.google.com/p/boilerpipe/
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
From: Andrew Bisson andrew.bis...@gossinteractive.com
To:
[
https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13028417#comment-13028417
]
Otis Gospodnetic commented on SOLR-2399:
Stefan - I only looked at the screenshot
[
https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13028429#comment-13028429
]
Otis Gospodnetic commented on SOLR-2399:
Right, so if you look at the names
[
https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027850#comment-13027850
]
Otis Gospodnetic commented on SOLR-2399:
Thanks for doing all this, Stefan!
I
[
https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic updated LUCENE-3054:
-
Affects Version/s: 3.1
Btw. this is with Lucene 3.1
For full thread: http://search
[
https://issues.apache.org/jira/browse/SOLR-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic updated SOLR-319:
--
Summary: changes SynonymFilterFactory to Analyze synonyms file (was:
changes
[
https://issues.apache.org/jira/browse/SOLR-319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13025361#comment-13025361
]
Otis Gospodnetic commented on SOLR-319:
---
Btw., I noticed this functionality is really
Hi,
Short version:
Should MemoryIndex (more precisely, MemoryIndexReader inside MemoryIndex.java)
be using something like SegmentReader instead of extending IndexReader and
hitting the exception below?
Background:
I'm working with some code that uses MemoryIndex and some nasty queries, some
Hi,
Hm, maybe you are asking where solr-ruby actually lives and is being developed?
I'm not sure. I see it under solr/client/ruby/solr-ruby (no new development in
ages?), but I also see an *active* solr-ruby fork over on
https://github.com/bbcrd/solr-ruby . So if you want to contribute to
[
https://issues.apache.org/jira/browse/SOLR-2465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018491#comment-13018491
]
Otis Gospodnetic commented on SOLR-2465:
Related: reload synonyms without having
Hi,
I was looking at term index divisor and spotted this:
.../lucene-solr-3.1$ ffxg -i IndexDivisor
./solr/src/test-files/solr/conf/solrconfig-termindex.xml:int
name=setTermIndexDivisor12/int
./solr/src/test-files/solr/conf/solrconfig-xinclude.xml:int
name=setTermIndexDivisor12/int
[
https://issues.apache.org/jira/browse/SOLR-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018531#comment-13018531
]
Otis Gospodnetic edited comment on SOLR-1307 at 4/11/11 7:37 PM
[
https://issues.apache.org/jira/browse/SOLR-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13018531#comment-13018531
]
Otis Gospodnetic commented on SOLR-1307:
+1
Related:
http://search-lucene.com/m
[
https://issues.apache.org/jira/browse/SOLR-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13015537#comment-13015537
]
Otis Gospodnetic commented on SOLR-1853:
This issue is still open, but it looks
[
https://issues.apache.org/jira/browse/SOLR-1986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13008041#comment-13008041
]
Otis Gospodnetic commented on SOLR-1986:
This looks useful. Thomas or Mark, would
[
https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13006807#comment-13006807
]
Otis Gospodnetic commented on SOLR-2242:
Would this be more consistent
[
https://issues.apache.org/jira/browse/SOLR-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13007324#comment-13007324
]
Otis Gospodnetic commented on SOLR-2429:
I'm with Hoss. For many months now, I've
-lucene.com/
- Original Message
From: Andrzej Bialecki a...@getopt.org
To: openrelevance-...@lucene.apache.org
Sent: Wed, March 2, 2011 5:07:55 AM
Subject: Re: Query click logs for custom Lucene relevance models
On 3/2/11 3:39 AM, Otis Gospodnetic wrote:
Hello,
I'm helping
Itamar,
Would you happen to have a screenshot that shows that ORev looks like?
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
From: Itamar Syn-Hershko ita...@code972.com
To:
[
https://issues.apache.org/jira/browse/SOLR-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12999205#comment-12999205
]
Otis Gospodnetic commented on SOLR-2379:
Whatever we choose, let's stick to DRY. I
[
https://issues.apache.org/jira/browse/SOLR-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12998669#comment-12998669
]
Otis Gospodnetic commented on SOLR-2286:
Adam, want to turn that into a patch?
I
[
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12996550#comment-12996550
]
Otis Gospodnetic commented on SOLR-236:
---
Why are people still working on this SOLR-236
[
https://issues.apache.org/jira/browse/SOLR-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12996556#comment-12996556
]
Otis Gospodnetic commented on SOLR-1682:
Are there known trunk patches that make
[
https://issues.apache.org/jira/browse/SOLR-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12996682#comment-12996682
]
Otis Gospodnetic commented on SOLR-2066:
Harish - I haven't looked at this patch
[
https://issues.apache.org/jira/browse/SOLR-1996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-1996.
Resolution: Invalid
Assignee: Otis Gospodnetic
Possible edismax phrase query bug
[
https://issues.apache.org/jira/browse/SOLR-1996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12984127#action_12984127
]
Otis Gospodnetic commented on SOLR-1996:
Rafał, was that the case? If so, we can
[
https://issues.apache.org/jira/browse/SOLR-849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-849.
---
Resolution: Duplicate
Implemented in SOLR-2099.
Add bwlimit support to snappuller
[
https://issues.apache.org/jira/browse/SOLR-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12928791#action_12928791
]
Otis Gospodnetic commented on SOLR-2052:
Looks straight forward, but doesn't apply
Steve, I just added steve_rowe to Solr as Committer in JIRA.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
From: Uwe Schindler u...@thetaphi.de
To: dev@lucene.apache.org
Sent: Wed, October
[
https://issues.apache.org/jira/browse/SOLR-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12917581#action_12917581
]
Otis Gospodnetic commented on SOLR-2138:
Identical JVM?
Solr 1.4 takes long time
[
https://issues.apache.org/jira/browse/LUCENE-2660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12915062#action_12915062
]
Otis Gospodnetic commented on LUCENE-2660:
--
Alex, I think Lucene Solr use
Short answer: yes, doable. Details on solr-u...@lucene...
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
From: Lalit Kumar 4 lkum...@sapient.com
To: dev@lucene.apache.org dev@lucene.apache.org
Sent: Sat, August 21, 2010
[
https://issues.apache.org/jira/browse/LUCENE-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896631#action_12896631
]
Otis Gospodnetic commented on LUCENE-2456:
--
Karthick, I'm interested
[
https://issues.apache.org/jira/browse/SOLR-1980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895937#action_12895937
]
Otis Gospodnetic commented on SOLR-1980:
What about Span queries - no use here
[
https://issues.apache.org/jira/browse/SOLR-1977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-1977.
Resolution: Fixed
Misplaced maven artifacts
[
https://issues.apache.org/jira/browse/SOLR-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12882909#action_12882909
]
Otis Gospodnetic edited comment on SOLR-1961 at 6/26/10 11:13 PM
[
https://issues.apache.org/jira/browse/SOLR-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12882911#action_12882911
]
Otis Gospodnetic commented on SOLR-1969:
Nice!
MMapDirectoryFactory needs
Dennis,
That's a u...@lucene question.
Update does delete followed by an add.
A delete just marks a document as deleted. In other words, it modifies
previously created index files.
Additions add new files. Once certain thresholds are reached, an add will
trigger a segment merge, which will
Hi,
Words have and on are probably in your list of stopwords and getting removed
from the query/documents.
Please use u...@lucene list for questions about Lucene usage.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
[
https://issues.apache.org/jira/browse/LUCENE-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12879936#action_12879936
]
Otis Gospodnetic commented on LUCENE-2503:
--
Man are you fast!
Does the English
[
https://issues.apache.org/jira/browse/LUCENE-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12874445#action_12874445
]
Otis Gospodnetic commented on LUCENE-2425:
--
Karthick, it looks like your May 1st
While preparing material for
http://blog.sematext.com/2010/06/02/lucene-digest-may-2010-3/ I came across
something that looks relevant:
https://issues.apache.org/jira/browse/LUCENE-2456
...where the author wrote this:
In conclusion, this directory attempts to marry the rich search-based
Hi,
Those who need to search Solr/Lucene/etc. archives (lists, wiki, site, etc.)
can also use Sematext's search-lucene.com:
http://search-lucene.com/
The new d...@lucene is not there yet, but it's coming:
http://blog.sematext.com/2010/04/19/poll-handling-lucene-dev-merge/
Otis
[
https://issues.apache.org/jira/browse/SOLR-209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12858657#action_12858657
]
Otis Gospodnetic commented on SOLR-209:
---
It's been a year since you asked, Shalin, so
[
https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12857107#action_12857107
]
Otis Gospodnetic commented on LUCENE-2393:
--
I think creating a small index
[
https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved NUTCH-570.
Resolution: Won't Fix
Improvement of URL Ordering in Generator.java
[
https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854665#action_12854665
]
Otis Gospodnetic commented on NUTCH-570:
I'm tempted to close this issue as Won't
[
https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12851461#action_12851461
]
Otis Gospodnetic commented on NUTCH-570:
Serykh, what does your version of the patch
[
https://issues.apache.org/jira/browse/SOLR-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12850508#action_12850508
]
Otis Gospodnetic commented on SOLR-896:
---
This looks super straight forward. The only
Uh, the IRC logs... Do people really think making those *searchable* would be
useful?
I think they'd be *extremely* noisy and hard to interpret without a person
really just sequentially reading them. Lots of people talking at the same
time, multiple topics, lots of very short intertwined
Personally, I don't see the advantage of Nutch going for a TLP. It's not like
new committers are having a hard time getting in today, it's not like they are
being proposed and rejected. I also don't feel like Nutch lacks
exposure/visibility -- lots of people know about it. It's just that
+1 for this structure and this set of steps.
Otis
- Original Message
From: Chris Hostetter hossman_luc...@fucit.org
To: solr-dev@lucene.apache.org
Sent: Tue, March 16, 2010 6:46:19 PM
Subject: Re: lucene and solr trunk
: Otis, yes, I think so, eventually. But that's gonna
[
https://issues.apache.org/jira/browse/NUTCH-740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic updated NUTCH-740:
---
Assignee: (was: Otis Gospodnetic)
Configuration option to override default language
[
https://issues.apache.org/jira/browse/SOLR-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12846099#action_12846099
]
Otis Gospodnetic commented on SOLR-1822:
When Solr starts, doesn't it create
[
https://issues.apache.org/jira/browse/SOLR-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12846139#action_12846139
]
Otis Gospodnetic commented on SOLR-1375:
Heh, with the Lucene/Solr merge taking
Hi,
Check out the dir structure mentioned here:
http://markmail.org/message/gwpmaevw7tavqqge
Isn't that what we want?
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Hadoop ecosystem search :: http://search-hadoop.com/
- Original Message
From: Mark Miller
[
https://issues.apache.org/jira/browse/SOLR-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12839907#action_12839907
]
Otis Gospodnetic commented on SOLR-1553:
What does u in uf stand for?
extended
[
https://issues.apache.org/jira/browse/SOLR-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12838446#action_12838446
]
Otis Gospodnetic commented on SOLR-1375:
{quote}
When new segments are created
[
https://issues.apache.org/jira/browse/SOLR-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-1788.
Resolution: Won't Fix
Please email questions to solr-user list.
Remove duplicate field
[
https://issues.apache.org/jira/browse/SOLR-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12802342#action_12802342
]
Otis Gospodnetic commented on SOLR-1719:
Does PositionFilterFactory fix the problem
[
https://issues.apache.org/jira/browse/SOLR-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-577.
---
Resolution: Won't Fix
Closing per comment.
added support for boosting fields and documents
[
https://issues.apache.org/jira/browse/SOLR-216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Otis Gospodnetic resolved SOLR-216.
---
Resolution: Won't Fix
Closing per comment
Improvements to solr.py
[
https://issues.apache.org/jira/browse/SOLR-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12800816#action_12800816
]
Otis Gospodnetic commented on SOLR-758:
---
I this still needed with enhanced dismax now
+1
Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch
- Original Message
From: Grant Ingersoll gsing...@apache.org
To: mahout-dev@lucene.apache.org
Sent: Thu, January 14, 2010 9:14:09 PM
Subject: Re: New MEAP: Mahout in Action
+1. (BTW, great read so far, I've
I think what has changed is that a lot more people hit this problem, and a
number of people provided answers, so it's much easier now for a new person to
learn what to do when this limit is hit.
At the same time, seeing how some people benchmark systems without tuning them
and then publish
John,
Yes, you should get 2.9.0 or 3.0.0, their indexing is faster. Still, even with
2.4.0 you shouldn't run into problems if you are really using just 1
IndexWriter. Still, I'd try upgrading first. Oh, and java-user is the place
to ask.
Otis
--
Sematext -- http://sematext.com/ -- Solr -
John, you should have a look at Zoie. I just finished adding LinkedIn's case
study about Zoie to Lucene in Action 2, so this is fresh in my mind. :)
Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch
- Original Message
From: jchang jchangkihat...@gmail.com
To:
Subject: Re: Compound File Default
Otis Gospodnetic wrote:
At the same time, seeing how some people benchmark systems without tuning
them
and then publish their results, cfs may be safer.
Though at the same time you get nailed with a 10-15% indexing speed hit.
--
- Mark
http
Hello,
If Search Engine Integration, Deployment and Scaling in the Cloud sounds
interesting to you, and you are going to be in or near New York next Wednesday
(Jan 20) evening:
http://www.meetup.com/NYC-Search-and-Discovery/calendar/12238220/
Sorry for dupes to those of you subscribed to
+1. I never liked having the compound format be the default, since increasing
the max # of open file handles is a well documented thing, at least in the UNIX
world.
Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch
- Original Message
From: Grant Ingersoll
[
https://issues.apache.org/jira/browse/LUCENE-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797776#action_12797776
]
Otis Gospodnetic commented on LUCENE-2127:
--
+1 for Aaron's patch in a separate
[
https://issues.apache.org/jira/browse/SOLR-773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12793300#action_12793300
]
Otis Gospodnetic commented on SOLR-773:
---
Dave - useful, thanks!
Do you think creating
[
https://issues.apache.org/jira/browse/LUCENE-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12790889#action_12790889
]
Otis Gospodnetic commented on LUCENE-1910:
--
* I'll second Mark's suggestion
[
https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12789379#action_12789379
]
Otis Gospodnetic commented on SOLR-1632:
I didn't look a the patch, but from your
[
https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12789120#action_12789120
]
Otis Gospodnetic commented on SOLR-1632:
What about this approach: http
Hello,
Has anyone seen this:
http://www.supermind.org/blog/580/java-net-url-synchronization-bottleneck ?
Is this something that needs to be addressed in Nutch (and thus in Bixo, and
thus in the common crawler project)?
Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch
[
https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785473#action_12785473
]
Otis Gospodnetic commented on LUCENE-2091:
--
+1 for skipping BM25 and going
[
https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785690#action_12785690
]
Otis Gospodnetic commented on LUCENE-2091:
--
Joaquin - could you please explain
[
https://issues.apache.org/jira/browse/SOLR-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12785694#action_12785694
]
Otis Gospodnetic commented on SOLR-1277:
How about this idea for the what to do
[
https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783530#action_12783530
]
Otis Gospodnetic commented on LUCENE-2091:
--
Has anyone compared this particular
[
https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783530#action_12783530
]
Otis Gospodnetic edited comment on LUCENE-2091 at 11/30/09 4:21 AM
Hello,
The contrib/wordnet package contains an AnalyzerUtil class with a method that
extracts sentences from text/String. It is super-simplistic, so probably not
very accurate, but I am wondering if *conceptually* it would make sense to have
a Tokenizer that extracts sentences? I suppose
Hello,
Recently, I read Matrix Factorization Techniques for Recommender Systems from
http://research.yahoo.com/node/2859 . I was wondering what you think about
this vs. what we have in Taste now?
It looks like Collective Intelligence talks about this on p232-239 + 302... but
I haven't read
Hello,
Would it make sense and be possible to spread different index files over
multiple disks (without resorting to putting an index on a RAID)?
For example, what if the index files didn't live in a single index dir, but
were organized by their type in a snallow dir tree, like this:
I'm late to the thread, and although it looks like the discussion is over, I'll
inline a Q for Jake.
I should add in my $0.02 on whether to just get rid of queryNorm() altogether:
-1 from me, even though it's confusing, because having that call there
(somewhere, at least) allows you to
Hello,
Regarding that monstrous term-idf map.
Is this something that one could use to adjust the scores in
http://wiki.apache.org/solr/DistributedSearch#Distributed_Searching_Limitations
scenario? Say a map like that was created periodically for each shard and
distributed to all other nodes
Not sure why, but this contrib's javadoc is missing:
http://lucene.apache.org/java/2_9_1/api/contrib-db/index.html
Also, the name db always bugged me a little. Would it make more sense to
call it contrib-bdb instead of contrib-db?
Otis
[
https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779116#action_12779116
]
Otis Gospodnetic commented on MAHOUT-165:
-
Yes, Wolfgang contributed MemoryIndex
I'm the same. http://markmail.org/message/iluezhazv7m43k5s
Otis
--
Sematext is hiring -- http://sematext.com/about/jobs.html?mls
Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR
- Original Message
From: Sean Owen sro...@gmail.com
To: mahout-dev@lucene.apache.org
Sent:
[
https://issues.apache.org/jira/browse/SOLR-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776661#action_12776661
]
Otis Gospodnetic commented on SOLR-1553:
I think you need to click on Issue Links
[
https://issues.apache.org/jira/browse/SOLR-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12775163#action_12775163
]
Otis Gospodnetic commented on SOLR-1550:
Haven't tried the patch yet, just had
[
https://issues.apache.org/jira/browse/SOLR-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12774053#action_12774053
]
Otis Gospodnetic commented on SOLR-1537:
The ID here being the uniqueKey? i.e
[
https://issues.apache.org/jira/browse/SOLR-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12774057#action_12774057
]
Otis Gospodnetic commented on SOLR-1536:
Is this better than writing a custom
Hello,
Avro is still young, from what I know, but I'm wondering if anyone has any
thoughts on whether there is a place or need for Avro in Solr?
http://www.cloudera.com/blog/2009/11/02/avro-a-format-for-big-data/
Otis
--
Sematext is hiring -- http://sematext.com/about/jobs.html?mls
Lucene,
and name of fields in any document is completely arbitrary
in Solr. Is it possible to represent such a datastructure in avro?
On Wed, Nov 4, 2009 at 3:43 AM, Otis Gospodnetic
wrote:
Hello,
Avro is still young, from what I know, but I'm wondering if anyone has any
thoughts on whether
401 - 500 of 1633 matches
Mail list logo