Re: Autocomplete with Solr 3.1

2011-07-28 Thread scorpking
Nobody can help me -- View this message in context: http://lucene.472066.n3.nabble.com/Autocomplete-with-Solr-3-1-tp3202214p3206095.html Sent from the Solr - User mailing list archive at Nabble.com.

Collpasing MultiValue fields

2011-07-28 Thread FatMan Corp
Hello, I understand collapsing is not yet possible for multi value fields, but still wonder what is the best way to solve the issue I am having. I have the following document data fields: 1. Title (max 200 chars) 2. Abstract (max 2000 chars) 3. Body (can be quite long) 4. Author (multi valued)

I can't pass the unit test when compile from apache-solr-3.3.0-src

2011-07-28 Thread Bing Yu
I just goto apache-solr-3.3.0/solr and run 'ant test' I find that the junit test will always fail, and told me ’BUILD FAILED‘ but if I type 'ant dist', I can get a apache-solr-3.3-SNAPSHOT.war with no warning. Is it a problem just me? my server:Centos 5.6 64bit/apache-ant-1.8.2 /junit and jdk

Re: Dealing with keyword stuffing

2011-07-28 Thread Pranav Prakash
On Thu, Jul 28, 2011 at 08:31, Chris Hostetter hossman_luc...@fucit.orgwrote: : Presumably, they are doing this by increasing tf (term frequency), : i.e., by repeating keywords multiple times. If so, you can use a custom : similarity class that caps term frequency, and/or ensures that the

Index time boosting with DIH

2011-07-28 Thread Bürkle , David
Can someone point me to an example for using index time boosting with the DataImportHandler.

Re: Index time boosting with DIH

2011-07-28 Thread Shalin Shekhar Mangar
On Thu, Jul 28, 2011 at 3:56 PM, Bürkle, David david.buer...@irix.chwrote: Can someone point me to an example for using index time boosting with the DataImportHandler. You can use the special flag variable $docBoost to add a index time boost.

Re: Dealing with keyword stuffing

2011-07-28 Thread Gora Mohanty
On Thu, Jul 28, 2011 at 3:48 PM, Pranav Prakash pra...@gmail.com wrote: [...] I am not sure how to use SweetSpotSimilarity. I am googling on this, but any useful insights are so much appreciated. Replace the existing DefaultSimilarity class in schema.xml (look towards the bottom of the file)

Reusing SolrServer instances when swapping cores

2011-07-28 Thread Michael Szalay
Hi all We work with two cores (active and passive) and swap them when the reindexing was finished. Is it allowed to reuse the same instance of the SolrServer (both Embedded and Common)? I.E. do they point to the other core after the swapping? Regards Michael -- Michael Szalay Senior

Re: please help explaining debug output

2011-07-28 Thread Erick Erickson
IDF is the frequency of the term in that field for the entire index, not the specific document. So it means that the term is in that field for some document somewhere, but not in that particular document I believe... Which leads me to wonder if the document is getting indexed as you expect,

Re: Exact match not the first result returned

2011-07-28 Thread Brian Lamb
That's a clever idea. I'll put something together and see how it turns out. Thanks for the tip. On Wed, Jul 27, 2011 at 10:55 PM, Chris Hostetter hossman_luc...@fucit.orgwrote: : With your solution, RECORD 1 does appear at the top but I think thats just : blind luck more than anything else

Possible to use quotes in dismax qf?

2011-07-28 Thread O. Klein
I want to do a dismax search to search for original query and this query as a phrasequery: q=sail boat needs to be converted to dismax query q=sail boat sail boat qf=title^10 content^2 What is best way to do this? -- View this message in context:

Re: slave data files way bigger than master

2011-07-28 Thread Erick Erickson
My utter and complete shot in the dark is that the slave isn't getting its data from the master you think it is. I know it's a silly comment, but I've chased my tail this way more than once G... None of the files match. None of the dates match, etc. I'm assuming that bouncing the slave doesn't

Re: how to get solr core information using solrj

2011-07-28 Thread Jiang mingyuan
hi Stefan, thanks for your advice,i wrote a jsp file to obtain those information. witch looks like : CoreContainer cores=(CoreContainer)request.getAttribute(org.apache.solr.CoreContainer); then cores.getCores() get core informations. later I translate infos to json format. at client side.I

Re: how to get solr core information using solrj

2011-07-28 Thread Jiang mingyuan
HI Erick: At the page you have show me, I found some useful methods. But it seems like not contains method about how to obtain core names,core paths. so,I followed the solr index page's method,wrote a jsp page ,like: CoreContainer

Re: Solr DataImport with multiple DBs

2011-07-28 Thread Erick Erickson
Often, the easiest solution when DIH gets really complex is to do one of two things: 1 Use SolrJ instead. You can do complex things more easily much of the time with DIH. 2 You could consider using a custom Transformer in conjunction with your primary delta query to access the second

Re: what data type for geo fields?

2011-07-28 Thread Peter Wolanin
Thanks for the feedback. I'll have look more at how geohash works. Looking at the sample schema more closely, I see: fieldType name=double class=solr.TrieDoubleField precisionStep=0 omitNorms=true positionIncrementGap=0/ So in fact double is also Trie, but just with precisionStep 0 in the

Re: what data type for geo fields?

2011-07-28 Thread Yonik Seeley
On Thu, Jul 28, 2011 at 10:24 AM, Peter Wolanin peter.wola...@acquia.com wrote: Thanks for the feedback.  I'll have look more at how geohash works. Looking at the sample schema more closely, I see:  fieldType name=double class=solr.TrieDoubleField precisionStep=0 omitNorms=true

Re: Possible to use quotes in dismax qf?

2011-07-28 Thread Juan Grande
Hi, You can use the pf parameter of the DismaxQParserPlugin: http://wiki.apache.org/solr/DisMaxQParserPlugin#pf_.28Phrase_Fields.29 This parameter receives a list of fields using the same syntax as the qf parameter. After determining the list of matching documents, DismaxQParserPlugin will boost

RE: Solr DataImport with multiple DBs

2011-07-28 Thread Dyer, James
Would it be possible to just run two sepearate deltas, one that updates records that changed in ds1 and another that updates records that changed in ds2 ? Of course this would be inefficient if a lot of records typically change in both places at the same time. With this approach, you might

Re: colocated term stats

2011-07-28 Thread Jonathan Rochkind
Not sure if this will do what you want, but one way might be using facets. Take the term you are interested in, and apply it as an fq. Now the result set will include only documents that include that term. So also request facets for that result set, the top 10 facets are the top 10 terms

Re: Exact match not the first result returned

2011-07-28 Thread Jonathan Rochkind
Keep in mind that if you use a field type that includes spaces (eg StrField, or KeywordTokenizer), then if you're using dismax or lucene query parsers, the only way to find matches in this field on queries that include spaces will be to do explicit phrase searches with double quotes. These

Re: Possible to use quotes in dismax qf?

2011-07-28 Thread Jonathan Rochkind
It's not clear to me why you would try to do that, I'm not sure it makes a lot of sense. You want to find all documents that have sail boat as a phrase AND have sail somewhere in them AND have boat somewhere in them? That's exactly the same as just all documents that have sail boat as a

about the Solr request filter

2011-07-28 Thread 于浩
Hello,Dear friends, I have got an problem in developing with solr. In My Application ,It must sends multiple query to solr server after the page is loaded. Then I found a problem: some request will return statusCode:0 and QTime:0, The solr has accepted the request, but It does not return a

Re: Store complete XML record (DIH XPathEntityProcessor)

2011-07-28 Thread Chantal Ackermann
Hi g, have a look at the PlainTextEntityProcessor: http://wiki.apache.org/solr/DataImportHandler#PlainTextEntityProcessor you will have to call the URL twice that way, but I don't think you can get the complete document (the root element with all structure) via xpath - so the

Exception in thread main org.apache.solr.common.SolrException: No such core: core1

2011-07-28 Thread automata
Hi I am very new with Solr, infact just started today so forgive my lack of knowledge on the subject. Everything went fine until the point where I started to get the exception Exception in thread main org.apache.solr.common.SolrException: No such core: core1 and I am stuck at the same point for

Re: Possible to use quotes in dismax qf?

2011-07-28 Thread O. Klein
I removed the post as it might confuse people. But because of analysers combining 2 words in a phrase query using shingles and positionfilter and the usage of dismax, I need q to be the original query plus the original query as phrasequery. That way the combined words are also highlighted and do

question about exception in faceting

2011-07-28 Thread Koji Sekiguchi
If I got an exception during faceting (e.g. undefined field), Solr doesn't return HTTP 400 but 200 with the exception stack trace in arr name=exception .../arr tag. Why is it implemented so? I checked Solr 1.1 and saw the same behavior. Except FacetComponent, HighlightComponent for example, if I

Re: Store complete XML record (DIH XPathEntityProcessor)

2011-07-28 Thread solruser@9913
Thanks Chantal I am ok with the second call and I already tried using that. Unfortunatly It reads the whole file into a field. My file is as below example xml record ... /record record ... /record record ... /record /xml Now the XPATH does the 'for

ShingleFilterFactory class error

2011-07-28 Thread Pradeep Pujari
Hi, I am trying to create shingles with minShingleSize = 10, but it also returns bi-grams too. Heres is my schema defn filter class=solr.ShingleFilterFactory minShingleSize=10 maxShingleSize=25 outputUnigrams=false outputUnigramsIfNoShingles=false tokenSeparator= /

RE: ShingleFilterFactory class error

2011-07-28 Thread Steven A Rowe
Pradeep, As indicated on the wiki http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory, the minShingleSize option is not available in Solr versions prior to 3.1. What version of Solr are you using? (By the way, I am only replying on solr-user@lucene.apache.org

field with repeated data in index

2011-07-28 Thread Mark juszczec
Hello all I created an index consisting of orders and the names of the salesmen who are responsible for the order. As you can imagine, the same name can be associated with many different orders. No problem. Until I try to do a faceted search on the salesman name field. Right now, I have the

RE: field with repeated data in index

2011-07-28 Thread Dyer, James
You need to index the field you want to facet on. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Mark juszczec [mailto:mark.juszc...@gmail.com] Sent: Thursday, July 28, 2011 3:50 PM To: solr-user@lucene.apache.org Subject: field with repeated

Re: field with repeated data in index

2011-07-28 Thread Mark juszczec
James Wow. That was fast. Thanks! But I thought you couldn't index a field that has duplicate values? Mark On Thu, Jul 28, 2011 at 4:53 PM, Dyer, James james.d...@ingrambook.comwrote: You need to index the field you want to facet on. James Dyer E-Commerce Systems Ingram Content Group

RE: field with repeated data in index

2011-07-28 Thread Dyer, James
I'm not sure what you're getting at when you mention duplicate values, but pretty much any way I interpret it, its allowed. The only case it wouldn't be is if the field is your primary key and you try to index a second document with the same key as an existing document. In that case the

[WARNING] Index corruption and crashes in Apache Lucene Core / Apache Solr with Java 7

2011-07-28 Thread Uwe Schindler
Hello Apache Lucene Apache Solr users, Hello users of other Java-based Apache projects, Oracle released Java 7 today. Unfortunately it contains hotspot compiler optimizations, which miscompile some loops. This can affect code of several Apache projects. Sometimes JVMs only crash, but in several

solr.TrieFloatField with multiValued=false treated as `UnInverted multi-valued field`

2011-07-28 Thread RaVbaker
Hi! I have problem with coding own SearchComponent. My schema.xml is: ... fieldType name=decimal class=solr.TrieFloatField precisionStep=2 omitNorms=true positionIncrementGap=0 / ... field name=price_min type=decimal indexed=true stored=true multiValued=false / ... When I use

Index

2011-07-28 Thread GAURAV PAREEK
Hi All, How we can check the particular;ar file is not INDEX in solr ? Regards, Gaurav

Re: Index

2011-07-28 Thread Jonathan Rochkind
I have no idea what you mean. A file on your disk? What does INDEX in solr mean? Be more specific and clear, perhaps provide an example, and maybe someone can help you. On 7/28/2011 5:45 PM, GAURAV PAREEK wrote: Hi All, How we can check the particular;ar file is not INDEX in solr ?

Re: Index

2011-07-28 Thread Nicholas Chase
Do you mean, how can you check whether it has been indexed by solr, and is searchable? Nick On 7/28/2011 5:45 PM, GAURAV PAREEK wrote: Hi All, How we can check the particular;ar file is not INDEX in solr ? Regards, Gaurav

Re: Index

2011-07-28 Thread GAURAV PAREEK
Yes NICK you are correct ? how can you check whether it has been indexed by solr, and is searchable? On Fri, Jul 29, 2011 at 3:27 AM, Nicholas Chase nch...@earthlink.netwrote: Do you mean, how can you check whether it has been indexed by solr, and is searchable? Nick On 7/28/2011

Re: question about exception in faceting

2011-07-28 Thread Koji Sekiguchi
Correction: Except FacetComponent, HighlightComponent for example, if I use a bad regex pattern for RegexFragmenter, HighlightComponent throws an exception then Solr return 400. Solr returns 500 in this case actually. I think it should be 400 (bad request). koji -- Check out Query Log

Re: question about exception in faceting

2011-07-28 Thread Chris Hostetter
: If I got an exception during faceting (e.g. undefined field), Solr doesn't : return HTTP 400 but 200 with the exception stack trace in arr name=exception : .../arr tag. Why is it implemented so? I checked Solr 1.1 and saw the same behavior. super historic, pre-apache, code ... the idea at

Re: question about exception in faceting

2011-07-28 Thread Koji Sekiguchi
(11/07/29 8:52), Chris Hostetter wrote: : If I got an exception during faceting (e.g. undefined field), Solr doesn't : return HTTP 400 but 200 with the exception stack trace inarr name=exception : .../arr tag. Why is it implemented so? I checked Solr 1.1 and saw the same behavior. super