Re: Handling acronyms

2021-01-15 Thread Shaun Campbell
d be using these acronyms at index time, not > > search time. It will make your index bigger and you'll have to re-index > > to add new synonyms (as they may apply to old documents) but this could > > be an occasional task, and in the meantime you could use query-time > >

Re: Handling acronyms

2021-01-15 Thread Shaun Campbell
ntaining 9000 synonyms in Solr's synonyms.txt file seems unweildy to me. > > Cheers > > Charlie > > On 15/01/2021 09:48, Shaun Campbell wrote: > > I have a medical journals search application and I've a list of some > 9,000 > > acronyms like this: > >

Re: Handling acronyms

2021-01-15 Thread Shaun Campbell
hin the multiwords. > > As synonyms.txt: > SRN, Stroke\ Research\ Network > IGBP, isolated\ gastric\ bypass > ... > > Redards > Bernd > > > Am 15.01.21 um 10:48 schrieb Shaun Campbell: > > I have a medical journals search application and I've a list of some

Handling acronyms

2021-01-15 Thread Shaun Campbell
I have a medical journals search application and I've a list of some 9,000 acronyms like this: MSNQ=>MSNQ Multiple Sclerosis Neuropsychological Screening Questionnaire SRN=>SRN Stroke Research Network IGBP=>IGBP isolated gastric bypass TOMADO=>TOMADO Trial of Oral Mandibular Advancement Devices fo

Re: Highlighting large text fields

2021-01-12 Thread Shaun Campbell
hting more is your goal right now it appears. > > ~ David Smiley > Apache Lucene/Solr Search Developer > http://www.linkedin.com/in/davidwsmiley > > > On Tue, Jan 12, 2021 at 2:45 PM Shaun Campbell > wrote: > > > That's great David. So hl.maxAnalyzedChars isn&#x

Re: Highlighting large text fields

2021-01-12 Thread Shaun Campbell
n On Tue, 12 Jan 2021 at 19:33, David Smiley wrote: > On Tue, Jan 12, 2021 at 1:08 PM Shaun Campbell > wrote: > > > Hi David > > > > Getting closer now. > > > > First of all, a bit of a mistake on my part. I have two cores set up and > I >

Re: Highlighting large text fields

2021-01-12 Thread Shaun Campbell
t the dedicated plugin info is only for > the original and Fast Vector Highlighters. > > ~ David > > > > > > Thanks > > Shaun > > > > On Mon, 11 Jan 2021 at 20:57, David Smiley wrote: > > > > > Hello! > > > > > > I worked

Re: Highlighting large text fields

2021-01-12 Thread Shaun Campbell
d highlighting to each request? Thanks Shaun On Mon, 11 Jan 2021 at 20:57, David Smiley wrote: > Hello! > > I worked on the UnifiedHighlighter a lot and want to help you! > > On Mon, Jan 11, 2021 at 9:58 AM Shaun Campbell > wrote: > > > I've been using highlighting

Highlighting large text fields

2021-01-11 Thread Shaun Campbell
I've been using highlighting for a while, using the original highlighter, and just come across a problem with fields that contain a large amount of text, approx 250k characters. I only have about 2,000 records but each one contains a journal publication to search through. What I noticed is that so

Searching document content and mult-valued fields

2020-07-01 Thread Shaun Campbell
Hi Been using Solr on a project now for a couple of years and is working well. It's just a simple index of about 20 - 25 fields and 7,000 project records. Now there's a requirement to be able to search on the content of documents (web pages, Word, pdf etc) related to those projects. My initial t

Re: Multiple Cores

2011-06-20 Thread Shaun Campbell
I would say it all depends on what you are trying to do. Unlike a relational database, in Solr the data does not need to be normalised, you need to put everything into an index so that you can achieve whatever feature it is that you want. For example, you may search on customer and want a facette

Re: manual background re-indexing

2011-04-28 Thread Shaun Campbell
Hi Paul Would a multi-core set up and the swap command do what you want it to do? http://wiki.apache.org/solr/CoreAdmin Shaun On 28 April 2011 12:49, Paul Libbrecht wrote: > > Hello list, > > I am planning to implement a setup, to be run on unix scripts, that should > perform a full pull-and-

Re: Indexing Best Practice

2011-04-11 Thread Shaun Campbell
If it's of any help I've split the processing of PDF files from the indexing. I put the PDF content into a text file (but I guess you could load it into a database) and use that as part of the indexing. My processing of the PDF files also compares timestamps on the document and the text file so th

Re: Tips for getting unique results?

2011-04-10 Thread Shaun Campbell
t is greater than 0, index otherwise. > > Prior to Solr1.4, one needed to use true instead of count and false instead > of index. > > This parameter can be specified on a per field basis. > > > -Pete > > On Apr 8, 2011, at 2:49 AM, Shaun Campbell wrote: > > > Pete &g

Re: Tips for getting unique results?

2011-04-08 Thread Shaun Campbell
Pete Surely the default sort order for facets is by descending count order. See http://wiki.apache.org/solr/SimpleFacetParameters. If your results are really sorted in ascending order can't you sort them externally eg Java? Hope that helps. Shaun

Re: Highlighting Issue

2010-12-09 Thread Shaun Campbell
Sorry, see what you mean about fixed-length (minGramSize==maxGramSize). I see mine aren't.:( On 9 December 2010 14:26, Koji Sekiguchi wrote: > (10/12/09 22:50), Shaun Campbell wrote: > >> OK. I'd switch to FastVectorHighlighter which cured the exceptions and >> g

Re: Highlighting Issue

2010-12-09 Thread Shaun Campbell
OK. I'd switch to FastVectorHighlighter which cured the exceptions and gives me highlighting so I assumed that you could use this instead of the standard highlighter on n-grammed fields. I guess my query was how does the highlighter now highlight synonym terms? Thanks Shaun > As I said in my pr

Re: Highlighting Issue

2010-12-09 Thread Shaun Campbell
ative assets As you can see asset works well. For the "synonyms" does it just highlight the first n characters where n is the length of the input string? Can't figure out how it could do otherwise. Shaun On 9 December 2010 12:51, Koji Sekiguchi wrote: > (10/12/09 21:22

Highlighting Issue

2010-12-09 Thread Shaun Campbell
I'm trying to highlight a field and I'm getting an exception thrown, only on certain search terms though. I am fairly certain that the cause of the problem is through having synonyms on the highlighted field as I have had highlighting working in the past on other fields. The added complication is

EmbeddedSolrServer, Indexing and Core Swapping

2010-11-16 Thread Shaun Campbell
Hi I've switched my app to now use an EmbeddedSolrServer. I'm doing an index on my rebuild core and swapping cores at the end. Unfortunately, without restarting my web app I can't see the newly indexed data. I can see core swapping is working, and I can see the data after indexing without restar

Core Swapping

2010-11-16 Thread Shaun Campbell
I've got a Solr multi core system and I'm trying to swap the cores after a re-index via SolrJ using a separate HTTP Solr web server.  My application seems to be generating a URL that's not valid for my Solr Tomcat installation but I can't see why or where it's getting its data from. Core swapping

Exception being thrown indexing a specific pdf document using Solr Cell

2010-10-15 Thread Shaun Campbell
I've got an existing Spring Solr SolrJ application that indexes a mixture of documents. It seems to have been working fine now for a couple of weeks but today I've just started getting an exception when processing a certain pdf file. The exception is : ERROR: org.apache.solr.core.SolrCore - org.

Re: Swapping cores with SolrJ

2010-09-14 Thread Shaun Campbell
Hi Mitch Thanks for responding. Not actually sure what you wanted from CoreAdminResponse but I put the following in: CoreAdminRequest car = new CoreAdminRequest(); car.setCoreName("live"); car.setOtherCoreName("rebuild"); car.setAction(CoreAdminPar

Swapping cores with SolrJ

2010-09-14 Thread Shaun Campbell
I've got Solr set up now with two cores which I call live and rebuild and which point to core0 and core1 directories respectively. My solr.xml file contains: In my Spring MVC application I have Solr set up as an embedded server and have two singleton beans which I use to refer to

Multi core schema file

2010-09-08 Thread Shaun Campbell
I have a query regarding multi core schema files which I can't find the answer to. In a multi core set up does each core need it's own schema file? I have two cores with their own schema files, which are a straight copy of each other, and I just wanted to maintain one schema file. I see there's

Re: SolrJ and Multi Core Set up

2010-09-03 Thread Shaun Campbell
't need to add anything with SolrQuery. > > URL url = new URL(new URL(solrBaseUrl), coreName); > CommonsHttpSolrServer server = new CommonsHttpSolrServer(url); > > Concerning the "default" core thing - I wouldn't know about that. > > > Cheers, > Chantal >

SolrJ and Multi Core Set up

2010-09-03 Thread Shaun Campbell
I'm writing a client using SolrJ and was wondering how to handle a multi core installation. We want to use the facility to rebuild the index on one of the cores at a scheduled time and then use the SWAP facility to switch the "live" core to the newly rebuilt core. I think I can do the SWAP with C