How to index the contents from SVN repository

2009-04-26 Thread Ashish P

Is there any way to index contents of SVN rep in Solr ??
-- 
View this message in context: 
http://www.nabble.com/How-to-index-the-contents-from-SVN-repository-tp23240110p23240110.html
Sent from the Solr - User mailing list archive at Nabble.com.



Question about MoreLikeThis

2009-04-26 Thread jlist9
Hi, I have a question about what MoreLikeThis means - I suppose
it means get more documents that are similar to _this_ document.
So I expect the query always take a known document as argument.
I wonder how I should interpret this query:

http://localhost:8983/solr/select?q=apachemlt=truemlt.fl=manu,catmlt.mindf=1mlt.mintf=1fl=id,score

It doesn't seem to specify a document. So what's the This in
MoreLikeThis in this case? Or, this means something else, and
not a document?





SOLRizing advice?

2009-04-26 Thread Paul Libbrecht


Hello list,

I am surely not the only one who wishes to migrate from bare lucene to  
solr.
Many different reasons can be there, e.g. facetting, web- 
externalization, ease of update... what interests me here are the  
steps needed in the form of advice as to what to use.


Here's a few hints. I would love a web-page grouping all these:

- first change references to indexwriter/indexreader/indexsearch to be  
those of SOLR using embedded-solr-server


- make a first solr schema with appropriate analyzers by defining  
particular dynamic fields


- slowly replace the queries methods with solr queries, slowly taking  
advantage of solr features


- web-expose the solr core for at least admin by merging the web.xml

Does such a web-page already exist?

thanks in advance

paul

smime.p7s
Description: S/MIME cryptographic signature


Re: SOLRizing advice?

2009-04-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
It is a very uncommon usecase to slowly migrate from lucene to Solr.
I somehow feel that the piecemeal migration is going to be more
expensive than the whole migration .

happy hacking...
--Noble

On Mon, Apr 27, 2009 at 10:05 AM, Paul Libbrecht p...@activemath.org wrote:

 Hello list,

 I am surely not the only one who wishes to migrate from bare lucene to solr.
 Many different reasons can be there, e.g. facetting, web-externalization,
 ease of update... what interests me here are the steps needed in the form of
 advice as to what to use.

 Here's a few hints. I would love a web-page grouping all these:

 - first change references to indexwriter/indexreader/indexsearch to be those
 of SOLR using embedded-solr-server

 - make a first solr schema with appropriate analyzers by defining particular
 dynamic fields

 - slowly replace the queries methods with solr queries, slowly taking
 advantage of solr features

 - web-expose the solr core for at least admin by merging the web.xml

 Does such a web-page already exist?

 thanks in advance

 paul



-- 
--Noble Paul


Re: facet results in order of rank

2009-04-26 Thread ristretto.rb
BUMP.

After waiting a bit for a comment on this, I'm assuming there's no
support for this type of feature.
So, we are pushing on with a completely different implementation.
Unfortunately, we haven't the time
for the expertise to consider implementing it ourselves.

gene


On Fri, Apr 24, 2009 at 6:55 PM, ristretto.rb ristretto...@gmail.com wrote:
 Hello,

 Is it possible to order the facet results on some ranking score?
 I've had a look at the facet.sort param,
 (http://wiki.apache.org/solr/SimpleFacetParameters#head-569f93fb24ec41b061e37c702203c99d8853d5f1)
 but that seems to order the facet either by count or by index value
 (in my case alphabetical.)

 We are facing a big number of facet results for multiple termed
 queries that are OR'ed together.  We want to keep the OR nature of our
 queries,
 but, we want to know which facet values are likely to give you higher
 ranked results.  We could AND together the terms, to get the facet
 list to be
 more manageable, but we would be filtering out too many results.  We
 prefer to OR terms and let the ranking bring the good stuff to the
 top.

 For example, suppose we have a index of all known animals and
 each doc has a field AO for animal-origin.

 Suppose we search for:  wolf grey forest Europe
 And generate facets AO.  We might get the following
 facet results:

 For the AO field, lots of countries of the world probably have grey or
 forest or wolf or Europe in their indexing data, so I'm asserting we'd
 get a big list here.
 But, only some of the countries will have all 4 terms, and those are
 the facets that will be the most interesting to drill down on.  Is
 there
 a way to figure out which facet is the most highly ranked like this?

 This is a contrived example, not part of any real project I know
 about.  Just trying to get my point across.

 thanks
 Gene

 Gene Campbell
 Picante Solutions Limited



Re: How to index the contents from SVN repository

2009-04-26 Thread Ashish P

Right. But is there a way to track file updates and diffs.
Thanks,
Ashish

Noble Paul നോബിള്‍  नोब्ळ् wrote:
 
 If you can check it out into a directory using SVN command then you
 may use DIH to index the content.
 
 a combination of FileListEntityProcessor and PlainTextEntityProcessor may
 help
 
 
 On Sun, Apr 26, 2009 at 1:38 PM, Ashish P ashish.ping...@gmail.com
 wrote:

 Is there any way to index contents of SVN rep in Solr ??
 --
 View this message in context:
 http://www.nabble.com/How-to-index-the-contents-from-SVN-repository-tp23240110p23240110.html
 Sent from the Solr - User mailing list archive at Nabble.com.


 
 
 
 -- 
 --Noble Paul
 
 

-- 
View this message in context: 
http://www.nabble.com/How-to-index-the-contents-from-SVN-repository-tp23240110p23250220.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: How to index the contents from SVN repository

2009-04-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
I guess not.

On Mon, Apr 27, 2009 at 10:42 AM, Ashish P ashish.ping...@gmail.com wrote:

 Right. But is there a way to track file updates and diffs.
 Thanks,
 Ashish

 Noble Paul നോബിള്‍  नोब्ळ् wrote:

 If you can check it out into a directory using SVN command then you
 may use DIH to index the content.

 a combination of FileListEntityProcessor and PlainTextEntityProcessor may
 help


 On Sun, Apr 26, 2009 at 1:38 PM, Ashish P ashish.ping...@gmail.com
 wrote:

 Is there any way to index contents of SVN rep in Solr ??
 --
 View this message in context:
 http://www.nabble.com/How-to-index-the-contents-from-SVN-repository-tp23240110p23240110.html
 Sent from the Solr - User mailing list archive at Nabble.com.





 --
 --Noble Paul



 --
 View this message in context: 
 http://www.nabble.com/How-to-index-the-contents-from-SVN-repository-tp23240110p23250220.html
 Sent from the Solr - User mailing list archive at Nabble.com.





-- 
--Noble Paul


Re: Temporarily indexing data on lucene

2009-04-26 Thread Atlantis69

Any ideas or updates? 

Thanks,
Amit


Atlantis69 wrote:
 
 I have a list of public profiles of my site user's on solr index. There is
 also a community around them, which is currently not their in Index.
 
 While searching, I have to give an option to search only my community
 (friends and friends of friends). I could do it from data base query or
 storing connection graph in memory but here I loose power of Solr
 Analyzers, tokenizers and filters. 
 
 Alternatively, I am thinking to store this relation temporarily in some
 other Solr instance (running on a separate machine) and use it for search.
 I.e create this index async when user logs in and destroy when user logs
 out.
 
 So when user searches for a profile the application will merge the results
 from two indexes and returns unique users.
 
 Is this a practical/scalable solution? If yes, what performance
 consideration, I should look for this new solr instance? For merging
 should I built an application over solr or solr provides any way of
 merging results from multiple indexes?
 
 Thanks,
 Amit
 
 

-- 
View this message in context: 
http://www.nabble.com/Temporarily-indexing-data-on-lucene-tp23212838p23250507.html
Sent from the Solr - User mailing list archive at Nabble.com.