Troubles with Solr admin+Alfresco searching

2011-11-11 Thread VladislavLysov
Hello!!! I'm use Solr+Alfresco. I was install Alfresco 4.0b and uploaded a few files(with .txt extension) through the Share. After that a was opened Solr admin at this URL http:\\localhost:\solr\alfresco\admin http:\\localhost:\solr\alfresco\admin . Then I typed my query(for example -

Re: InvalidTokenOffsetsException when using MappingCharFilterFactory, DictionaryCompoundWordTokenFilterFactory and Highlighting

2011-11-11 Thread Vadim Kisselmann
Hi Edwin, Chris it´s an old bug. I have big problems too with OffsetExceptions when i use Highlighting, or Carrot. It looks like a problem with HTMLStripCharFilter. Patch doesn´t work. https://issues.apache.org/jira/browse/LUCENE-2208 Regards Vadim 2011/11/11 Edwin Steiner

License Info

2011-11-11 Thread Rahul R
Hello, Since Apache Solr is governed by Apache License 2.0 - does it mean that all jar files bundled within Solr are also governed by the same License ? Do I have to worry about checking the License information of all bundled jar files in my commercial Solr powered application ? Even if I use

getting solr to expand Acronym

2011-11-11 Thread Tiernan OToole
Dont know if this is posible, but i need to ask anyway... Say we have a list of Acronyms in a database (CD, DVD, CPU) and also a list of their not so short names (Compact Disk, Digital Versitile Disk, Central Processing Unit) but they are not linked in any particular way (lost of items, some with

Does not work MappingCharFilterFactory?

2011-11-11 Thread 八鍬 元宜
I have created something of a war and ant Solr revision number 1199405. I put the index was created in Tomcat then the solr war. Search was not hit. solr schema settings are as follows. fieldType name=text_kwd class=solr.TextField positionIncrementGap=1001 analyzer charFilter

Re: One field must match with edismax

2011-11-11 Thread roySolr
Thanks, it was that easy. I was thinking about a variant of the mm option in dismax but this works great! -- View this message in context: http://lucene.472066.n3.nabble.com/One-field-must-match-with-edismax-tp3496232p3499312.html Sent from the Solr - User mailing list archive at Nabble.com.

writing to shards

2011-11-11 Thread Ankita Patil
Hi, I have an index which has the shard parameter pointing to 2 shards(server1, server2). While reading from the shards you query on the index with the shard parameter and it will retrieve the documents from both the shards. Now I want to add the documents to their respective shards depending

Solr 3.3 Sorting is not working for long fields

2011-11-11 Thread rajini maski
Hi, I have upgraded my Solr from 1.4.1 to 3.3.Now I tried to sort on a long field and documents are not getting sorted based on that. Sort is working when we do sorting on facet ex:facet=on facet.sort=studyid But when do simple sort on documents , sort=studyid, sort doesn't

Re: Preventing DIH to stop on error

2011-11-11 Thread O. Klein
O. Klein wrote: org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' value: url_example is not a directory Processing Document # 1 at org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:123) I added onError=skip and

Using solr during optimization

2011-11-11 Thread Kalika Mishra
Hi, I would like to optimize solr core which is in Reader Writer mode. Since the Solr cores are huge in size (above 100 GB) the optimization takes hours to complete. When the optimization is going on say. on the Writer core, the application wants to continue using the indexes for both query and

How to mix solr query info into the apache httpd logging (reverseproxy)?

2011-11-11 Thread alex_mass
Hello we are using solr beyong a httpd reverse proxy. We'd like to mix up solr log information together with apache: * info coming from apache (user_track module to follow user over several session) etc. * solr info (nbhits ...) The idea is that we have all our logs at the same place with a

Problems with type mismatch on SolrJ annotations

2011-11-11 Thread Anil Kumar YV
Hi, I am also facing similar issue. Did you find any solution for the same? Please share it. Thanks, Anil.

Re: Using solr during optimization

2011-11-11 Thread Mark Miller
Do a you have something forcing you to optimize, or are you just doing it for the heck of it? On Nov 11, 2011, at 7:50 AM, Kalika Mishra wrote: Hi, I would like to optimize solr core which is in Reader Writer mode. Since the Solr cores are huge in size (above 100 GB) the optimization takes

Re: Distributed indexing

2011-11-11 Thread Erick Erickson
As you can see, there are a lot of related issues. Solr Cloud is where this action is happening, and there's a lot of active development work going on in this area. But the short form is you'll have to wait for that release (you can pitch in and help if you'd like) The release date for all

Re: Dynamic adding of shards

2011-11-11 Thread Erick Erickson
You can always put the shards in the URL. I don't know whether reloading the core will accomplish this, you might check that out. See: http://wiki.apache.org/solr/CoreAdmin#RELOAD Best Erick On Thu, Nov 10, 2011 at 12:20 AM, Ankita Patil ankita.pa...@germinait.com wrote: Hi, One way to add

Re: Solr Indexing Time varying each time I index

2011-11-11 Thread Erick Erickson
We've seen around a 10-15% decrease in performance on average in a virtualized environment as a first approximation, which doesn't explain your results but might give you a place to start. I'm pretty sure Solr isn't an issue, but my question is how much RAM is on your underlying hardware? And how

Re: Facet counts incorrect on filtered field when using groups

2011-11-11 Thread Erick Erickson
Hmmm, something's not making sense here. First, grouping shouldn't affect faceting as far as I understand, they're two entirely different constructs. Why would you expect specifying grouping to affect facet counts? Second, what is this all about? fq={!tag=CC}CountryCode:MA It looks like you're

Re: Core reload vs servlet container restart

2011-11-11 Thread Erick Erickson
Hmmm, may be a bug, can you reliably reproduce this? About the admin/schema that is simply a file transfer from the schema file, it's not reconstructed from the current internal representation in Solr. So the fact that your change shows up there is unrelated to whether the Solr instance knows

Re: Using solr during optimization

2011-11-11 Thread Kalika Mishra
Hi Mark, We are performing almost 11,000 updates a day, we have around 50 million docs in the index (i understand we will need to shard) the core seg will get fragmented over a period of time. We will need to do optimize every few days or once in a month; do you have any reason not to optimize

RE: I cannot get elevate.xml to work with Apache Solr (1.41)

2011-11-11 Thread Michael Herchel
Thanks Koji! Using the elevate request handler worked. Is there a way to use the QueryElevationComponent in the default search handler? I added the following: bool name=enableElevationtrue/bool bool name=forceElevationtrue/bool to requestHandler name=standard class=solr.SearchHandler, so

Re: keeping master server indexes in sync after failover recovery

2011-11-11 Thread Erick Erickson
You've pretty much nailed getting back in sync, although there's a third way, just re-index all the changed content to M1 since M1 went down. Whether this is viable depends a lot on your particular situation. You'd want to disable replication while the reindexing happened. Note that you have one

Re: Using solr during optimization

2011-11-11 Thread Mark Miller
I would not optimize - it's very expensive. With 11,000 updates a day, I think it makes sense to completely avoid optimizing. That should be your default move in any case. If you notice performance suffers more than is acceptable (good chance you won't), then I'd use a lower merge factor. It

Re: Core reload vs servlet container restart

2011-11-11 Thread Mark Miller
What version of Solr? When you look at the logs, does a new SolrCore look like it comes up right away? It sounds like perhaps the old SolrCore is still serving requests - how long do you wait before trying to restart jetty? Anything interesting in the logs around that time? - Mark Miller

RE: [Solr-3.4] Norms file size is large in case of many unique indexed fields in index

2011-11-11 Thread Ivan Hrytsyuk
Thank you guys for responses. Some background on the task: The problem we are trying to solve with Solr is the following. We have to provide a full-text search over documents that partially consist of fields that are always there and partially of additional metadata as key-value pairs where

Missing Results

2011-11-11 Thread dennis leary
In our implementation we are using dataimport handler to bring records in via database. In our schema we have a field called designation and it is of type string and is multivalued. We run a simple query q=designations:My Designation and it returns documents that match but depending on the

Missing Results

2011-11-11 Thread dennis leary
In our implementation we are using dataimport handler to bring records in via database. In our document we have a field called designation and it is of type string and is multivalued. We run a simple query q=designations:My Designation and it returns documents that match but depending on the

How to read values from dataimport.properties in a production environment

2011-11-11 Thread Nico Luna
I'm trying to know the values stored in the dataimport.properties file in a production environment using the solr admin feature, so I copied the same behaviour as: [Schema] and [Config] but changing the contentType property (from contentType=text/xml to contentType=text):

a newbie question reagarding keyword count in each document

2011-11-11 Thread zeek
Hi All, I am realtively new to Solr/Lucene and need some help. - I am basically storing documents where each document represents an Entity (a thing, a place etc) - each Entity has some unique features that i need to store in a filed(s) - also, i need to store the mention of those features (based

Re: getting solr to expand Acronym

2011-11-11 Thread lboutros
Hi, I'm not sure to see what you mean, but perhaps synonyms could solve your problem ? http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory Ludovic. - Jouve France. -- View this message in context:

Re: Distributed indexing

2011-11-11 Thread Otis Gospodnetic
Hi, I think at least a part of the question here is whether what's in JIRA is the latest stuff or if Mark or Yonik or ... have some patches that they could put in JIRA, so that others can work off of them.  For example, I just looked at these 3 issues:

Re: getting solr to expand Acronym

2011-11-11 Thread Otis Gospodnetic
Tiernan, I don't think you can do it through any Solr configs, though I imagine you could do it with a custom Token Filter that keeps track of the context. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/

Re: License Info

2011-11-11 Thread Otis Gospodnetic
Rahul, You should check licenses of those other projects' jars independently of Solr. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ From: Rahul R rahul.s...@gmail.com To:

Re: Distributed indexing

2011-11-11 Thread Mark Miller
There is a distrib indexing branch called solrcloud. Anyone that wants to contribute should check that branch out. I have various code I have not put up yet for review because I'm still working on it - but the main action happens on that branch. I'm not working on every issue at the moment

RE: getting solr to expand Acronym

2011-11-11 Thread Brandon Ramirez
Could this be simulated through synonyms? Could you define CD as a synonym of Compact Disc or vice versa? I'm not sure if that would work, just brainstorming here... Brandon Ramirez | Office: 585.214.5413 | Fax: 585.295.4848 Software Engineer II | Element K | www.elementk.com

Re: Distributed indexing

2011-11-11 Thread Mark Miller
On Nov 9, 2011, at 12:52 PM, Rafał Kuć wrote: Hello! I was looking for a way to implement distributed indexing in Solr. From looking at the https://issues.apache.org/jira/browse/SOLR-2358 there was some work done to enable Solr to distribute the documents to shards without the need of 3rd

Re: Distributed indexing

2011-11-11 Thread Rafał Kuć
Hello! Thanks for the answer :) -- Regards, Rafał Kuć On Nov 9, 2011, at 12:52 PM, Rafał Kuć wrote: Hello! I was looking for a way to implement distributed indexing in Solr. From looking at the https://issues.apache.org/jira/browse/SOLR-2358 there was some work done to enable Solr to

Group.ngroup parameter memory consumption

2011-11-11 Thread Rafał Kuć
Hello! I was wondering if there is a way for calculating the memory consumption of group.ngroups parameter. I know the the answer can be 'that depends', but what I'm actually wondering about is what the memory consumption depends on - number of documents returned by a query or number of groups ?

Re: Group.ngroup parameter memory consumption

2011-11-11 Thread Martijn v Groningen
The ngroup option collects per search the number of unique groups matching the query. Based on the collected groups it returns the count. So it depends of the number of groups matching the query. To get more in detail: per unique group a ByteRef instance is created to represent a group and this

Highlighting is not working for wildcard searches when TermVectors (hl.useFastVectorHighlighter) is enabled

2011-11-11 Thread Shyam Bhaskaran
Hi, Highlighting is not working for wildcard searches when TermVectors (hl.useFastVectorHighlighter) is enabled. I wanted to use FastVectorHighlighter to improve the performance of search results but when hl.useFastVectorHighlighter is enabled highlighting does not work in case of wildcard

Re: I cannot get elevate.xml to work with Apache Solr (1.41)

2011-11-11 Thread Koji Sekiguchi
(11/11/11 23:16), Michael Herchel wrote: Thanks Koji! Using the elevate request handler worked. Is there a way to use the QueryElevationComponent in the default search handler? I added the following: bool name=enableElevationtrue/bool bool name=forceElevationtrue/bool torequestHandler

Re: Group.ngroup parameter memory consumption

2011-11-11 Thread Rafał Kuć
Hello! Thanks, that's what I was looking for :) -- Regards, Rafał Kuć http://solr.pl The ngroup option collects per search the number of unique groups matching the query. Based on the collected groups it returns the count. So it depends of the number of groups matching the query. To get