Re: Solr Statistics, num docs

2010-07-15 Thread Li Li
numDocs is the total indexed docs. May be your docs have duplicated key. When duplicated, the older one will be deleted. uniqueKey is defined in solrconfig.xml 2010/7/16 Karthik K : > Hi, > Is numDocs in solr statistics equal to the total number of documents that > are searchable on solr? I find t

Solr Statistics, num docs

2010-07-15 Thread Karthik K
Hi, Is numDocs in solr statistics equal to the total number of documents that are searchable on solr? I find that this number is very low in my case compared to the total number of documents indexed. Please let me know the possible reasons for this. Thanks, Karthik

no response

2010-07-15 Thread satya swaroop
Hi all, i Have a problem with the solr. when i send the documents(.doc) i am not getting the response. example: sa...@geodesic-desktop:~/Desktop$ curl " http://localhost:8080/solr/update/extract?stream.file=/home/satya/Desktop/InvestmentDecleration.doc&stream.contentType=application

Timeout in distributed search

2010-07-15 Thread marship
Hi. All. Is there anyway to have time out support in distributed search. I searched https://issues.apache.org/jira/browse/SOLR-502 but looks it is not in main release of solr1.4 I have 70 cores, when I search, some response in 0-700ms. Some return in about 2s. Some need very long time, more

Novice seeking help to change filters to search without diacritics

2010-07-15 Thread HSingh
I am new to Solr and seeking your help to change filter from ISOLatin1AccentFilterFactory to ASCIIFoldingFilterFactory files. I am not sure what change is to be made and where exactly this change is to be made. And finally, what would replace mapping-ISOLatin1Accent.txt file? I would like Solr t

Securing Solr 1.4 in a glassfish container AS NEW THREAD

2010-07-15 Thread Sharp, Jonathan
Hi All, I am considering securing Solr with basic auth in glassfish using the container, by adding to web.xml and adding sun-web.xml file to the distributed WAR as below. If using SolrJ to index files, how can I provide the credentials for authentication to the http-client (or can someone

Re: Spatial Search - Best choice ?

2010-07-15 Thread Chris Hostetter
: Subject: Spatial Search - Best choice ? : In-Reply-To: : References: http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you chan

Securing Solr 1.4 in a glassfish container

2010-07-15 Thread Sharp, Jonathan
Hi All, I am considering securing Solr with basic auth in glassfish using the container, by adding to web.xml and adding sun-web.xml file to the distributed WAR as below. If using SolrJ to index files, how can I provide the credentials for authentication to the http-client (or can someone poin

Re: range query on TrieLongField - strange result?

2010-07-15 Thread Yonik Seeley
Yikes... confirmed! Something is very wrong here. -Yonik http://www.lucidimagination.com On Thu, Jul 15, 2010 at 8:47 PM, Yonik Seeley wrote: > Hmmm, I'll try and duplicate. > > -Yonik > http://www.lucidimagination.com > > 2010/7/15 Koji Sekiguchi : >> I see different results between SortableLo

Re: range query on TrieLongField - strange result?

2010-07-15 Thread Yonik Seeley
Hmmm, I'll try and duplicate. -Yonik http://www.lucidimagination.com 2010/7/15 Koji Sekiguchi : > I see different results between SortableLongField and TrieLongField > when I try same range query. > > This is the test data: > > > > ZERO > 0 > 0 > > > Long.MAX_VALUE-1000 > 9223372036854774807

Re: Custom comparator

2010-07-15 Thread Rodrigo Rezende
How its possible to access TopDocs using solr API? Thanks, Rodrigo On Thu, Jul 15, 2010 at 8:03 PM, Erick Erickson wrote: > Hmmm, why do you need a custom collector? You can use > the form of the search that returns a TopDocs, from which you > can get the max score and the array of ScoreDoc each

range query on TrieLongField - strange result?

2010-07-15 Thread Koji Sekiguchi
I see different results between SortableLongField and TrieLongField when I try same range query. This is the test data: ZERO 0 0 Long.MAX_VALUE-1000 9223372036854774807 9223372036854774807 Long.MAX_VALUE 9223372036854775807 9223372036854775807 where f_tl is TrieLongField and f_sl is Sort

Re: Custom comparator

2010-07-15 Thread Erick Erickson
Hmmm, why do you need a custom collector? You can use the form of the search that returns a TopDocs, from which you can get the max score and the array of ScoreDoc each of which has its score. So you can just let the underlying code get the top N documents, and throw out any that don't score above

Re: problem with storing??

2010-07-15 Thread Peter Karich
satya, just a side question: did you use dismax handler? dismax won't handle q=*:* for dismax it should be empty q= to get all docs > First, look at the SOLR admin page and see if there's anything in your > index. > > Second, examine the SOLR log files, see what comes out when you try this. > > Y

Re: How to speed up solr search speed

2010-07-15 Thread Peter Karich
Hi Scott! > I am aware these cores on same server are interfering with each other. Thats not good. Try to use only one core per CPU. With more per CPU you won't have any benefits over the single-core version, I think. > can solr use more memory to avoid disk operation conflicts? Yes, only the m

Re: problem with storing??

2010-07-15 Thread Erick Erickson
First, look at the SOLR admin page and see if there's anything in your index. Second, examine the SOLR log files, see what comes out when you try this. You really have to provide some more details other than "it didn't work" for us to do more than guess, Reviewing this might help: http://wiki.ap

RE: how to eliminating scoring from a query?

2010-07-15 Thread Nagelberg, Kallin
How about: 1. Create a date field to indicate indextime. 2 Use a date filter to restrict articles to today and yesterday such as myindexdate:"[NOW/DAY-1DAY TO NOW/DAY+1DAY]" 3. sort on that field. -Kallin Nagelberg -Original Message- From: oferiko [mailto:ofer...@gmail.com] Sent: Th

Re: how to eliminating scoring from a query?

2010-07-15 Thread oferiko
thanks, i want it to be the indexing order, but with a limit, something like everything that matches my query, and was indexed since yesterday, in an ascending order. Ofer On Thu, Jul 15, 2010 at 8:25 PM, Erick Erickson [via Lucene] < ml-node+970139-889457701-316...@n3.nabble.com > wrote: > B

Re: How to get search results taking into account ortographies errors ???

2010-07-15 Thread Robert Muir
I think you want to look at using solr.ASCIIFoldingFilterFactory: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ASCIIFoldingFilterFactory On Thu, Jul 15, 2010 at 12:43 PM, Ariel wrote: > Hi every body I am working with apache solr and django with spanish > documents and I woul

Re: how to eliminating scoring from a query?

2010-07-15 Thread Erick Erickson
By specifying a sort that doesn't include score. I think it's just automatic then. It wouldn't make sense to eliminate scoring *without* sorting by some other field , you'd essentially get a random ordering. Best Erick On Thu, Jul 15, 2010 at 1:43 AM, oferiko wrote: > > in http://www.lucidim

Custom re-rank

2010-07-15 Thread Rodrigo Rezende
Hello, I am doing a rerank function of solr results using Solr Function query Plugin: http://localhost:8983/solr/articles.0/select/?q={!func}myReRank(query($qq))&qq=query here Inside myReRank plugin I do the things. First, is that the best way to do that? If so, Is it possible to limit myReRank

Re: Nested Function Query Syntax

2010-07-15 Thread Rodrigo Rezende
Thank you, that works fine! On Thu, Jul 15, 2010 at 2:01 PM, Yonik Seeley wrote: > On Thu, Jul 15, 2010 at 12:49 PM, Rodrigo Rezende wrote: >> Yeah, it is redundant, but I am using that to use the solr query >> response as input of a plugin function: >> >> http://localhost:8983/solr/articles.0/s

Re: Nested Function Query Syntax

2010-07-15 Thread Yonik Seeley
On Thu, Jul 15, 2010 at 12:49 PM, Rodrigo Rezende wrote: > Yeah, it is redundant, but I am using that to use the solr query > response as input of a plugin function: > > http://localhost:8983/solr/articles.0/select/?q={!func}myFunction(query({!query > v='the query string here'})) This might be ea

Re: Nested Function Query Syntax

2010-07-15 Thread Rodrigo Rezende
Yeah, it is redundant, but I am using that to use the solr query response as input of a plugin function: http://localhost:8983/solr/articles.0/select/?q={!func}myFunction(query({!query v='the query string here'})) So in myFunction I can take the query results, with the score, and write my custom

How to get search results taking into account ortographies errors ???

2010-07-15 Thread Ariel
Hi every body I am working with apache solr and django with spanish documents and I would want when a user make a search and forget to accent the words the search results show both posibilities: the results without the accent an the results with the accent. would you help me please ??? Regards Ari

Re: Tag generation

2010-07-15 Thread Tommaso Teofili
Hi all, in UIMA there are two components which wrap OpenCalais [1] and AlchemyAPI [2][3] services that you could use, then you could also add something else to the tagging pipeline (using existing stuff [4] or implementing your own logic). Hope this helps. Tommaso [1] : http://uima.apache.org/sand

Re: Nested Function Query Syntax

2010-07-15 Thread Yonik Seeley
On Thu, Jul 15, 2010 at 11:51 AM, Rodrigo Rezende wrote: > I solved the problem. > The correct syntax is: > > http://localhost:8983/solr/articles.0/select/?q={!func}query({!query > v='hello'})&fl=Document.title,score,&debugQuery=on query() causes a new QParser to be created. so does {!query}...

Re: Tag generation

2010-07-15 Thread Markus Jelsma
Check out OpenCalais [1]. Maybe it works for your case and language. [1]: http://www.opencalais.com/ On Thursday 15 July 2010 17:34:31 kenf_nc wrote: > A colleague mentioned that he knew of services where you pass some content > and it spits out some suggested Tags or Keywords that would be best

Re: Nested Function Query Syntax

2010-07-15 Thread Rodrigo Rezende
I solved the problem. The correct syntax is: http://localhost:8983/solr/articles.0/select/?q={!func}query({!query v='hello'})&fl=Document.title,score,&debugQuery=on Rodrigo On Thu, Jul 15, 2010 at 12:32 PM, Rodrigo Rezende wrote: > Hello, > > I am trying to use function nested query syntax w

Re:Re: How to speed up solr search speed

2010-07-15 Thread marship
Hi. Peter. And I checked my example/solr/conf/solrconfig.xml. (solr 1.4) I don't see in it. But I see it in solr website's solrconfig.xml wiki. So should I add it or the default(without it ) is ok? Thanks 在2010-07-15 17:19:57,"Peter Karich" 写道: >How does your queries look like? Do you us

Re: Tag generation

2010-07-15 Thread Olivier Dobberkau
Am 15.07.2010 um 17:34 schrieb kenf_nc: > A colleague mentioned that he knew of services where you pass some content > and it spits out some suggested Tags or Keywords that would be best suited > to associate with that content. > > Does anyone know if there is a contrib to Solr or Lucene that do

Re:Re: How to speed up solr search speed

2010-07-15 Thread marship
Hi. Peter. I think I am not using faceting, highlighting ... I read about them but don't know how to work with them. I am using the default "example" just change the indexed fields. For my case, I don't think solr can work as fast as 100-200ms on average. I tried some keywords on only single

Tag generation

2010-07-15 Thread kenf_nc
A colleague mentioned that he knew of services where you pass some content and it spits out some suggested Tags or Keywords that would be best suited to associate with that content. Does anyone know if there is a contrib to Solr or Lucene that does something like this? Or a third party tool that

Nested Function Query Syntax

2010-07-15 Thread Rodrigo Rezende
Hello, I am trying to use function nested query syntax with solr 1.4.1, but I am not sure if I am doing in right way: I try this query and I get all documents which score is 12 http://localhost:8983/solr/articles.0/select/?q={!func}product(3,4)&fl=Document.title,score,&debugQuery=on Using the s

Custom comparator

2010-07-15 Thread dan sutton
Hi, I have a requirement to have a custom comparator that keep the top N documents (chosen by some criteria) but only if their score is more then e.g. 1% of the maxScore. Looking at SolrIndexSearcher.java, I was hoping to have a custom TopFieldCollector.java to return these via TopFieldCollector.

London open source search meet-up

2010-07-15 Thread Richard Marr
Hi all, Apologies for the cross-post. We are organising another open source search social evening in London on Wednesday the 28 July. As usual the plan is to get together and chat about search technology, from Lucene to Solr, Hadoop, Mahout, Xapian and the like - bringing together people from ac

Re: How to find first document for the ALL search

2010-07-15 Thread Ian Connor
Hi, The good news is that: /solr/select?q=*%3A*&fq=&start=1&rows=1&fl=id did work (kind of odd really) so I am reading all the documents from the bad one to a new solr using using the same configuration using ruby (complete rebuild). so far so good - it is gone through 500k out of 1.7M and seem

SOLR Search Query : Exception : Software caused connection abort

2010-07-15 Thread sandeep kumar
Hi, I am trying to test the SOLR search with very big query , but when i try its throwing exception: "Exception : Software caused connection abort". I'm using HTTP POST and server I'm using is Tomcat. Is SOLR query has any limitations with size or length..etc?? P ls help me and let me know soluti

Re: Query help

2010-07-15 Thread Rupert Bates
Sorry, my mistake, the example should have been as follows: ?q=ContentType:News&sort=DatePublished+desc&start=0&rows=5 ?q=ContentType:Analysis&sort=DatePublished+desc&start=0&rows=5 Rupert On 15 July 2010 13:02, kenf_nc wrote: > > Your example though doesn't show different ContentType, it shows

Re: Error in building Solr-Cloud (ant example)

2010-07-15 Thread Robert Muir
this test needs to call super.setUp() in its setUp()... On Thu, Jul 15, 2010 at 3:15 AM, Rebecca Watson wrote: > hi mark, > > jayf and i are working together :) > > i tried to apply the patch to the trunk, but the ant tests failed... > > i checked out the latest trunk: > svn checkout http://svn.a

Re: Query help

2010-07-15 Thread kenf_nc
Your example though doesn't show different ContentType, it shows a different sort order. That would be difficult to achieve in one call. Sounds like your best bet is asynchronous (multi-threaded) calls if your architecture will allow for it. -- View this message in context: http://lucene.472066.

SOLR Search Query : Exception : Software caused connection abort

2010-07-15 Thread sandeep kumar
Hi, I am trying to test the SOLR search with very big query , but when i try its throwing exception: "Exception : Software caused connection abort". I'm using HTTP POST and server I'm using is Tomcat. Is SOLR query has any limitations with size or length..etc?? P ls help me and let me know soluti

Re: Spatial Search - Best choice ?

2010-07-15 Thread Saïd Radhouani
Thanks for the links, but this makes things even harder :) Do you have any recommendations for one pointer over another? Thanks, -S On Jul 15, 2010, at 1:08 PM, findbestopensource wrote: > Some more pointers to spatial search, > > http://www.jteam.nl/products/spatialsolrplugin.html > http://c

Re: Spatial Search - Best choice ?

2010-07-15 Thread findbestopensource
Some more pointers to spatial search, http://www.jteam.nl/products/spatialsolrplugin.html http://code.google.com/p/spatial-search-lucene/ http://sujitpal.blogspot.com/2008/02/spatial-search-with-lucene.html Regards Aditya www.findbestopensource.com On Thu, Jul 15, 2010 at 3:54 PM, Saïd Radhoua

Spatial Search - Best choice ?

2010-07-15 Thread Saïd Radhouani
Hi, Using Solr 1.4, I'm now working on adding spatial search options, such as distance-based sorting, Bounding-box filter, etc. To the best of my knowledge, there are three possible points we can start from: 1. The http://blog.jteam.nl/2009/08/03/geo-location-search-with-solr-and-lucene/ 2. Th

Re: problem with storing??

2010-07-15 Thread satya swaroop
hi, i sent the commit after adding the documents. but the problem is same regards, satya

Query help

2010-07-15 Thread Rupert Bates
Hello, I hope someone can help me with this. I have a website which is built on top of Solr and the home page is composed of 4 sections one for each type of content on the site. At the moment to populate this page, I am using 4 separate queries of the form: ?q=ContentType:News&sort=DatePublished+d

AW: problem with storing??

2010-07-15 Thread Bastian Spitzer
Hi, did u send a at some time after adding documents? The added docs are "pending" to be added to the index until u finally commit then. You can see your pending added documents count in the statistics page in the admin panel cheers -Ursprüngliche Nachricht- Von: satya swaroop [mailt

problem with storing??

2010-07-15 Thread satya swaroop
Hi all, i am new to solr and i followed d wiki and got everything going right. But when i send any html/txt/pdf documents the response is as follows::: 0576 but when i search in the solr i dont find the result can any one tell me what to be done..?? The curl i used for the above o/p

Re: Solr Best Version

2010-07-15 Thread Peter Karich
we are using 1.4.0 without any major problems so far. (So, I would use 1.4.1 for the next app, just to have the latest version.) the trunk is also nice to use fuzzy search performance boosts. Peter. > Hi all, > I'm going to develop a search architecture solr based and i wonder if you > could sugg

Re: How to speed up solr search speed

2010-07-15 Thread Peter Karich
How does your queries look like? Do you use faceting, highlighting, ... ? Did you try to customize the cache? Setting the HashDocSet to "0.005 of all documents" improves our search speed a lot. Did you optimize the index? 500ms seems to be slow for an 'average' search. I am not an expert but with

Solr Best Version

2010-07-15 Thread Alessandro Benedetti
Hi all, I'm going to develop a search architecture solr based and i wonder if you could suggest me which Solr version will suite best my needs. I have 10 Solr machines which use replication, sharding and multi-core ; 1 Solr server would index Documents (Xml, *Pdf*,Text ... ) on a *NFS* *v3*Filesyst

Re: question on wild card

2010-07-15 Thread Mark N
thanks erick . One more question when "the perfect world*" is passed as search query its converted as "? perfect world" what does "?" mean Since i am using standard analyzer i thought stop word "the" is removed thanks On Thu, Jul 15, 2010 at 7:01 AM, Erick Erickson wrote: > The best w

Re:RE: How to speed up solr search speed

2010-07-15 Thread marship
Hi. Thanks for replying. My document has many different fields(about 30 fields, 10 different type of documents but these are not the point ) and I have to search over several fields. I was putting all 76M documents into several lucene indexes and use the default lucene.net ParaSearch t

Re: Using stored terms for faceting

2010-07-15 Thread Peter Karich
Dear Hoss, I will try to clarify what I want to achieve :-) Assume I have the following three docs: id:1 description: bmx bike 123 id:2 description: bmx bike 321 id:3 description: a mountain bike If I query against *:* I want to get the facets and its document count ala: bike: 3 bmx: 2 I rea

RE: How to speed up solr search speed

2010-07-15 Thread Fornoville, Tom
Is there any reason why you have to limit each instance to only 1M documents? If you could put more documents in the same core I think it would dramatically improve your response times. -Original Message- From: marship [mailto:mars...@126.com] Sent: donderdag 15 juli 2010 6:23 To: solr-us

Re: Error in building Solr-Cloud (ant example)

2010-07-15 Thread Rebecca Watson
hi mark, jayf and i are working together :) i tried to apply the patch to the trunk, but the ant tests failed... i checked out the latest trunk: svn checkout http://svn.apache.org/repos/asf/lucene/dev/trunk patched it with SOLR-1873, and put the two JARs into trunk/solr/lib ant compile in the

Re: Multiple cores or not?

2010-07-15 Thread Ravi Kumar Julapalli
Hi, The architecture is probably better served depending on the content and data sources. If you have multiple data sources for each of the sites.. e.g database serving up site content, Feeds serving up syndicated content , then multicore will be better served using each core by data source c