Pagination searchComponent

2014-01-15 Thread ostrowskij
*Hi, I'm trying to add searchComponent to my solr. It works fine except pagination. The results are print but pagination get data from solr quey results. Have any ideas? Xml returned:* response lst name=responseHeader int name=status0/int int name=QTime8972/int

High cpu ratio when solr sleep

2014-01-15 Thread YouPeng Yang
Hi I find that the cpu ratio is very high when the tomcat contained solr 4.6 sleep. The pid 13359 shows that my sleeping solr web container take high cpu ratio Any insights? [solr@fkapp1 ~]$ top -d -1 -u solr top - 17:30:15 up 302 days, 7:10, 5 users, load average: 4.54, 4.52, 4.47

Facets maxcount feature?

2014-01-15 Thread yriveiro
Hi, I'm wondering if Solr has some feature like face.mincount but for maxcount. I have an use case where I need to know what facets have less than n elements. I can do this adding the facet.limit=-1 parameter and fetch the whole set and client-side remove the elements that don't match the

Request to add my username to ContributorsGroup

2014-01-15 Thread Ramzi Alqrainy
My wiki username Ramzi Alqrainy -- Sincerely , *Ramzi Alqrainy* Mobile: +962 (7) 9551 6105 Email: ramzi.alqra...@gmail.com My Profile: http://jo.linkedin.com/in/ramzialqrainy -- This e-mail and any attachments to it contains information which may be

Re: Facets maxcount feature?

2014-01-15 Thread Mikhail Khludnev
Hello Yago, This condition doesn't help to reduce computation significantly for facet.method=fc nor fcs, it might help for enum, but it requires implementation efforts. Also, my feeling is that you have much more performance challenges if you have million size facets response, it's not typical

Re: High cpu ratio when solr sleep

2014-01-15 Thread Mikhail Khludnev
Hello, Invoke top for particular process displaying threads enabled. Find the hottest thread PID. invoke jstack for this process, find the suspicious thread by .. nid=0x[PID in hex] ... PROFIT! On Wed, Jan 15, 2014 at 1:38 PM, YouPeng Yang yypvsxf19870...@gmail.comwrote: Hi I find that the

Re: SolrCloud Result Grouping vs CollapsingQParserPlugin

2014-01-15 Thread Joel Bernstein
During query time, depending on the query, results can be returned from both shards. For e.g. a query q=solrgroup=truegroup.field=adskdedupgroup.ngroups=true would ideally return data from both shards and apply the grouping on shard1 based on adskdedup field. This will also ensure that

Re: need help on OpenNLP with Solr

2014-01-15 Thread rashi gandhi
Thanks Lance for clearing. One more Question: Is there a possibility of integrating boosting with payloads in LUCENE-2899 with solr? Thanks in Advance On Mon, Jan 6, 2014 at 9:46 PM, rashi gandhi gandhirash...@gmail.comwrote: Hi, Also i wanted know, Is it possible to integrate wordnet

Re: Request to add my username to ContributorsGroup

2014-01-15 Thread Erick Erickson
Done for the Solr contributor's group. Yours is the only one with a space in it, hope it works. I know of no reason it won't, just something I noticed... Let us know if you have problems. Best and thanks! Erick On Wed, Jan 15, 2014 at 6:02 AM, Ramzi Alqrainy ramzi.alqra...@gmail.com wrote: My

Creating collections from java code

2014-01-15 Thread Flavio Pompermaier
Hi to everybody, I'm trying to create a collection programmatically instead of doing via http. Is that possible? I found around to use something like CoreAdminRequest.Create createRequest = new CoreAdminRequest.Create(); createRequest.setCoreName(coreName);

Re: Creating collections from java code

2014-01-15 Thread Flavio Pompermaier
Sorry, I found the error myself..I need to specify configName as paramete rof setCollectionConfigName(): createRequest.setCollectionConfigName(configName); Best, Flavio On Wed, Jan 15, 2014 at 4:26 PM, Flavio Pompermaier pomperma...@okkam.itwrote: Hi to everybody, I'm trying to create

Re: core.properties and solr.xml

2014-01-15 Thread Steven Bower
I will open up a JIRA... I'm more concerned over the core locator stuff vs the solr.xml.. Should the specification of the core locator go into the solr.xml or via some other method? steve On Tue, Jan 14, 2014 at 5:06 PM, Alan Woodward a...@flax.co.uk wrote: Hi Steve, I think this is a great

Order of values in stored field

2014-01-15 Thread Jamie Johnson
Is there any guarantee in Solr/Lucene for the order of values in a stored field? For instance if we did the following document.addField(text, value 1); document.addField(text, value 2); document.addField(text, value 3); is there a guarantee that the document will return the field arr name=text

Re: core.properties and solr.xml

2014-01-15 Thread Alan Woodward
I think solr.xml is the correct place for it, and you can then set up substitution variables to allow it to be set by environment variables, etc. But let's discuss on the JIRA ticket. Alan Woodward www.flax.co.uk On 15 Jan 2014, at 15:39, Steven Bower wrote: I will open up a JIRA... I'm

Re: SolrCloud Result Grouping vs CollapsingQParserPlugin

2014-01-15 Thread shamik
Thanks Joel, really appreciate your help. I'll keep an eye on the 4.6.1 release. -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-Result-Grouping-vs-CollapsingQParserPlugin-tp4111331p4111486.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Order of values in stored field

2014-01-15 Thread Yonik Seeley
On Wed, Jan 15, 2014 at 10:55 AM, Jamie Johnson jej2...@gmail.com wrote: Is there any guarantee in Solr/Lucene for the order of values in a stored field? Yes, order is preserved. -Yonik http://heliosearch.org -- off-heap filters for solr

Re: core.properties and solr.xml

2014-01-15 Thread Mark Miller
I think these API’s are pretty new and deep to want to support them for users at this point. It constrains refactoring and can complicates things down the line, especially with SolrCloud. This same discussion has come up in JIRA issues before. At best, I think all the recent refactoring in this

Re: Order of values in stored field

2014-01-15 Thread Jamie Johnson
great, thanks Yonik. My testing had shown that was the case but I wasn't sure if that was just happenstance or something that was guaranteed by the API. Again I appreciate the fast response! On Wed, Jan 15, 2014 at 11:25 AM, Yonik Seeley yo...@heliosearch.comwrote: On Wed, Jan 15, 2014 at

Re: core.properties and solr.xml

2014-01-15 Thread Alan Woodward
This is true. But if we slap big warning: experimental messages all over it, then users can't complain too much about backwards-compat breaks. My intention when pulling all this stuff into the CoresLocator interface was to allow other implementations to be tested out, and other suggestions

Error while trying to index doc on replica

2014-01-15 Thread aabreur
I have a collection that has only one shard. There are two nodes of this shard, a leader and a replica. When trying to index a doc to the leader, everything goes OK: POST xx.x.xx.x/solr/ds_target/update?commit=true { add: { doc: { key1: value1, key2: value2 } } }

Re: Error while trying to index doc on replica

2014-01-15 Thread Erick Erickson
Right, the error returned to the client often isn't a complete list. Take a look at your Solr logs as you try to index this, that may give you a better idea of what happens. Best, Erick On Wed, Jan 15, 2014 at 1:05 PM, aabreur alexandre.ab...@vtex.com.br wrote: I have a collection that has only

Re: Error while trying to index doc on replica

2014-01-15 Thread aabreur
Thanks Erick.. When tailing the log i get several similar errors: INFO - 2014-01-15 18:28:50.250; org.apache.solr.update.processor.LogUpdateProcessor; [ds_target] webapp=/solr path=/update params={distrib.from=http://172.0.60.97:80/solr/ds_target/update.distrib=TOLEADERwt=javabinversion=2} {} 0

Re: core.properties and solr.xml

2014-01-15 Thread Mark Miller
What’s the benefit? So you can avoid having a simple core properties file? I’d rather see more value than that prompt exposing something like this to the user. It’s a can of warms that I personally have not seen a lot of value in yet. Whether we mark it experimental or not, this adds a burden,

ANNOUNCE: ApacheCon NA 2014 Travel Assistance Applications now open!

2014-01-15 Thread Chris Hostetter
(Note: cross-posted to various lucene user lists, if you have replies please keep them on general@lucene, but pleast note that specific questions should be addressed to travel-assista...@apache.org) - - - Forwarded Announcement - - - The Travel Assistance Committee (TAC) are pleased

Re: Error while trying to index doc on replica

2014-01-15 Thread aabreur
Solved. It was a 'misconfigured' solrconfig.xml Had this: requestHandler name=/update class=solr.JsonUpdateRequestHandler/ Fixed backing to default and adding a path for json updates: requestHandler name=/update class=solr.UpdateRequestHandler / requestHandler name=/update/json

Search Suggestion Filtering

2014-01-15 Thread Hamish Campbell
Hi all, I'm looking into options for filtering the search suggestions dictionary. Using Solr 4.6.0, Suggester component and fst.FuzzyLookupFactory using a field based dictionary, we're indexing records for a multi-tenanted SaaS platform. SearchHandler records are always filtered by the

Re: Indexing URLs from websites

2014-01-15 Thread Teague James
I am still unsuccessful in getting this to work. My expectation is that the index-anchor plugin should produce values for the field anchor. However this field is not showing up in my Solr index no matter what I try. Here's what I have in my nutch-site.xml for plugins:

SolrException Error when indexing new documents at scale in SolrCloud -

2014-01-15 Thread cwhi
I have a SolrCloud installation with about 2 million documents indexed in it. It's been buzzing along without issue for the past 8 days, but today started throwing errors on document adds that eventually resulted in out of memory exceptions. There is nothing funny going on. There are a few

Analyzers versus Tokenizers/TokenFilters

2014-01-15 Thread Benson Margulies
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters never mentions an Analyzer class. http://wiki.apache.org/solr/SolrPlugins talks about subclasses of SolrAnalyzer as ways of delivering an entire analysis chain and still 'minding the gap'. Anyone care to offer a comparison of the

Re: SolrException Error when indexing new documents at scale in SolrCloud -

2014-01-15 Thread Shawn Heisey
On 1/15/2014 2:43 PM, cwhi wrote: I have a SolrCloud installation with about 2 million documents indexed in it. It's been buzzing along without issue for the past 8 days, but today started throwing errors on document adds that eventually resulted in out of memory exceptions. There is nothing

Re: Analyzers versus Tokenizers/TokenFilters

2014-01-15 Thread Ahmet Arslan
Hi Benson, Using lucene analyzer in schema.xlm should be last resort. For very specific reasons : if you have an existing analyzer, etc. Ahmet On Wednesday, January 15, 2014 11:52 PM, Benson Margulies ben...@basistech.com wrote: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters

Re: SolrException Error when indexing new documents at scale in SolrCloud -

2014-01-15 Thread cwhi
Hi Shawn, Thanks for the quick reply. I did notice the exception you pointed out and had some thoughts about it maybe being the client library I'm using to connect to Solr (C# SolrNet) disconnecting too early, but that doesn't explain it eventually running out of memory altogether. A large

Re: SolrException Error when indexing new documents at scale in SolrCloud -

2014-01-15 Thread Shawn Heisey
On 1/15/2014 3:10 PM, cwhi wrote: Thanks for the quick reply. I did notice the exception you pointed out and had some thoughts about it maybe being the client library I'm using to connect to Solr (C# SolrNet) disconnecting too early, but that doesn't explain it eventually running out of memory

Re: Search Suggestion Filtering

2014-01-15 Thread Tomás Fernández Löbbe
I think your use case is the one described in LUCENE-5350, maybe you want to take a look to the patch and comments there. Tomás On Wed, Jan 15, 2014 at 12:58 PM, Hamish Campbell hamish.campb...@koordinates.com wrote: Hi all, I'm looking into options for filtering the search suggestions

admin gui right side not loading

2014-01-15 Thread Andreas Owen
I'm using solr 4.3.1 and have installed it on a win 2008 server. Solr is working, for example import search. But the admin guis right side isn't loading and I get a javascript error for several d3-objects. The last error is: Load timeout for modules: lib/order!lib/jquery.autogrow

Re: admin gui right side not loading

2014-01-15 Thread Shawn Heisey
On 1/15/2014 5:20 PM, Andreas Owen wrote: I'm using solr 4.3.1 and have installed it on a win 2008 server. Solr is working, for example import search. But the admin guis right side isn't loading and I get a javascript error for several d3-objects. Could one of these be the problem?

Re: Search Suggestion Filtering

2014-01-15 Thread Hamish Campbell
Thanks Tomás, I'll take a look. Still interested to hear from anyone about using queries to populate the list - I'm willing to give up a bit of performance for the flexibility it would provide. On Thu, Jan 16, 2014 at 1:06 PM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: I think your

Re: Search Suggestion Filtering

2014-01-15 Thread Areek Zillur
Hey Hamish, You might want to check this out LUCENE-5402 . I added support for index-time pruning for suggesters that consumes from the index itself. I plan to add this support to file-based suggesters as well. In order to use this functionality from Solr, more changes are required. I am planning

Re: Analyzers versus Tokenizers/TokenFilters

2014-01-15 Thread Benson Margulies
Ahmet, So, this is an interesting difference between Lucene (and ES) and Solr. In Lucene, the idea seems to be that you package up a reusable analysis chain as an analyzer. Saying 'use analyzer X' is less complex than saying 'use tokenizer T and filters F1, F2, ...'. thanks, benson On Wed, Jan

solr cloud + hdfs issue

2014-01-15 Thread longsan
Hi, i'm newer for solr cloud. i met a question here. we use hdfs as file storage service and use Solr cloud to provide indexing service and query service. But solr cloud will replicate index/data files between leader and replica. And the hdfs replicates the files too. i think it wil waste disk

Re: solr cloud + hdfs issue

2014-01-15 Thread Otis Gospodnetic
Hi, I think this is a known issue and I don't know of anyone working on changing this. Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ On Wed, Jan 15, 2014 at 9:46 PM, longsan longsan...@sina.com wrote: Hi, i'm newer for

Re: Analyzers versus Tokenizers/TokenFilters

2014-01-15 Thread Otis Gospodnetic
But the latter gives users the flexibility of putting together any T+F1FN chains they want and easily adding their own custom Fx to the mix. Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ On Wed, Jan 15, 2014 at 9:45 PM,

Re: High cpu ratio when solr sleep

2014-01-15 Thread Otis Gospodnetic
I wonder if this would be a good addition to Solr Admin functionality? Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ On Wed, Jan 15, 2014 at 6:29 AM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: Hello, Invoke top

changing term frequency (tf) at query time

2014-01-15 Thread Karan jindal
Hi all, Can I change the value of term frequency for certain files while at query time? For ex: if there is a scenario in which I want to ignore term frequency of *title* field, can I modify solr/lucene code to always gives term frequency for title as 1. One possible way of doing it can be this

Re: High cpu ratio when solr sleep

2014-01-15 Thread YouPeng Yang
Hi Thanks for the reply. I get the information as following: [solr@fkapp1 ~]$ ps mp 13359 -o THREAD,tid USER %CPU PRI SCNT WCHAN USER SYSTEM TID solr 217 -- - - - - solr 0.0 21- 184466-

Alternatives to GATE?

2014-01-15 Thread Philippe de Rochambeau
Hello, can anyone suggest alternatives to GATE (http://gate.ac.uk/download/)? I would like to index place and person names in PDFs using gazetteers (ie, dictionaries) and normalize dates ( (eg, December 1st, 2001 will be indexed as 20011201) and feed the indexes to SOLR? GATE is a great tool,

Re: High cpu ratio when solr sleep

2014-01-15 Thread YouPeng Yang
Hi By the ways,after I restart the web container ,the ratio returns normal. So when does the sutuation come out? Regards 2014/1/16 YouPeng Yang yypvsxf19870...@gmail.com Hi Thanks for the reply. I get the information as following: