Re: charfilter doesn't do anything

2013-09-08 Thread Andreas Owen
yes but that filter html and not the specific tag i want. On 7. Sep 2013, at 7:51 PM, Erick Erickson wrote: Hmmm, have you looked at: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Not quite the body, perhaps, but might it help? On Fri, Sep

Re: subindex

2013-09-08 Thread Peyman Faratin
Hi Erick it makes sense. Thank you for this. peyman On Sep 5, 2013, at 4:11 PM, Erick Erickson erickerick...@gmail.com wrote: Nope. You can do this if you've stored _all_ the fields (with the exception of _version_ and the destinations of copyField directives). But there's no way I know

Re: charfilter doesn't do anything

2013-09-08 Thread Jack Krupansky
I tried this and it seems to work when added to the standard Solr example in 4.4: field name=body type=text_html_body indexed=true stored=true / fieldType name=text_html_body class=solr.TextField positionIncrementGap=100 analyzer charFilter class=solr.PatternReplaceCharFilterFactory

Expunge deleting using excessive transient disk space

2013-09-08 Thread Manuel Le Normand
Hi, In order to delete part of my index I run a delete by query that intends to erase 15% of the docs. I added this params to the solrconfig.xml mergePolicy class=org.apache.lucene.index.TieredMergePolicy int name=maxMergeAtOnce2/int int name=maxMergeAtOnceExplicit2/int double

Profiling Solr Lucene for query

2013-09-08 Thread Manuel Le Normand
Hello all Looking on the 10% slowest queries, I get very bad performances (~60 sec per query). These queries have lots of conditions on my main field (more than a hundred), including phrase queries and rows=1000. I do return only id's though. I can quite firmly say that this bad performance is due

Solr suggest - How to define solr suggest as case insensitive

2013-09-08 Thread Mysurf Mail
My suggest (spellchecker) is returning case sensitive answers. (I use it to autocomplete - dog and Dog return different phrases)\ my suggest is defined as follows - in solrconfig - searchComponent class=solr.SpellCheckComponent name=suggest lst name=spellchecker str name=namesuggest/str

Re: unknown _stream_source_info while indexing rich doc in solr

2013-09-08 Thread Nutan
Error got resolved,thanks a lot Sir.I have been trying since days to resolve it. On Fri, Sep 6, 2013 at 11:36 PM, Chris Hostetter-3 [via Lucene] ml-node+s472066n4088604...@n3.nabble.com wrote: : it shows type as undefined for dynamic field ignored_* , and I am using That means the

Re: Indexing pdf files - question.

2013-09-08 Thread Nutan Shinde
Error got resolved,solution was dynamic field / must be within fields tag. On Sun, Sep 8, 2013 at 3:31 AM, Furkan KAMACI furkankam...@gmail.comwrote: Could you show us logs you get when you start your web container? 2013/9/4 Nutan Shinde nutanshinde1...@gmail.com My solrconfig.xml is:

Re: Expunge deleting using excessive transient disk space

2013-09-08 Thread Erick Erickson
Right, but you should have at least as much free space as your total index size, and I don't see the total index size (but I'm just glancing). I'm not entirely sure you can precisely calculate the maximum free space you have relative to the amount needed for merging, some of the people who wrote

Dynamic Field

2013-09-08 Thread anurag.jain
Hi all, I am using solr dynamic field. i am storing data in the following format:- idbatch_*job_* So for a doc, data is storing like:-

Re: Some highlighted snippets aren't being returned

2013-09-08 Thread Eric O'Hanlon
Hi again Everyone, I didn't get any replies to this, so I thought I'd re-send in case anyone missed it and has any thoughts. Thanks, Eric On Aug 7, 2013, at 1:51 PM, Eric O'Hanlon elo2...@columbia.edu wrote: Hi Everyone, I'm facing an issue in which my solr query is returning highlighted

Re: Some highlighted snippets aren't being returned

2013-09-08 Thread Bill Bell
Zip up all your configs Bill Bell Sent from mobile On Sep 8, 2013, at 3:00 PM, Eric O'Hanlon elo2...@columbia.edu wrote: Hi again Everyone, I didn't get any replies to this, so I thought I'd re-send in case anyone missed it and has any thoughts. Thanks, Eric On Aug 7, 2013, at

Re: Dynamic Field

2013-09-08 Thread Jack Krupansky
2. Flatten your data. 3. Use dynamic and multivalued fields only in moderation. 1. First, tell us how your application intends to use and query your data. That will be a guide to how your data should be stored. -- Jack Krupansky -Original Message- From: anurag.jain Sent: Sunday,

SOLR index Recovery availability

2013-09-08 Thread atuldj.jadhav
Hi Team,Need your suggestions/views on the approach I have in place for SOLR availability and recovery. I am running *SOLR 3.5* and have around *30k* document's indexed in my SOLR core. I have configured SOLR to hold *5k * documents in each segment at a time.I periodically commit optimize my SOLR

Re: SOLR index Recovery availability

2013-09-08 Thread Walter Underwood
This sounds very complicated for only 30K documents. Put them all on one server, give it enough memory so that the index can all be in file buffers. If there is a disaster, reindex everything. That should only take a few minutes. And don't optimize. wunder On Sep 8, 2013, at 3:01 PM,

multiple update processor chains.

2013-09-08 Thread mike st. john
is it possible to have multiple run by default? i've tried adding multiple update.chains for the UpdateRequestHandler but it didn't seem to work. wondering if its even possible. Thanks msj

Data import

2013-09-08 Thread Luís Portela Afonso
Hi, It's possible to disable document update when running data import, full-import command? Thanks smime.p7s Description: S/MIME cryptographic signature

RE: Some highlighted snippets aren't being returned

2013-09-08 Thread Bryan Loofbourrow
Eric, Your example document is quite long. Are you setting hl.maxAnalyzedChars? If you don't, the highlighter you appear to be using will not look past the first 51,200 characters of the document for snippet candidates. http://wiki.apache.org/solr/HighlightingParameters#hl.maxAnalyzedChars --

Re: Tweaking boosts for more search results variety

2013-09-08 Thread Sai Gadde
Sorry for the delayed response. Limitations in this scenario where we have 5 million indexed documents from about only 1000 sites. If results are grouped by site we will not be able to show more than a couple of pages for lot of search keywords. Ex: Search for Solr has 1000 matches but only

Re: Data import

2013-09-08 Thread Alexandre Rafalovitch
What do you specifically mean by the disable document update? Do you mean in-place update? Or do you mean you want to run the import but not actually populate Solr collection with processed documents? It might help to explain the business level goal you are trying to achieve. Or, specific error

Re: Solr4.4 or zookeeper 3.4.5 do not support too many collections? more than 600?

2013-09-08 Thread diyun2008
Thank you Erick. It's very useful to me. I have already started to merge logs of collections to 15 collections. but there's another question. If I merge 1000 collections to 1 collection, to the new collection it will have about 20G data and about 30M records. In 1 solr server, I will create 15

Re: multiple update processor chains.

2013-09-08 Thread Alexandre Rafalovitch
Only one chain per handler. But then you can define any sequence inside the chain, so why do you care about multiple chains? Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps

Re: Loading a SpellCheck dynamically

2013-09-08 Thread Mr Havercamp
Hi Thanks for the response. Per your instructions, I have set up additional request handlers for handling language-specific /selects: !-- generic query -- requestHandler name=/select class=solr.SearchHandler lst name=defaults str name=echoParamsexplicit/str

Searching solr on school name during year

2013-09-08 Thread Rohit Kumar
Hi, Currently I have a student search which allows me to search for documents in a school. I am looking at including year search into the existing schema which would enable users to search for students in a school during an year. I have a proposed change in the schema to add the year component to