RE: Re: How to import data with a different date format

2010-09-08 Thread Markus Jelsma
less it's some sort of ID So it might be worth it to do some searching earlier rather than later .... Best Erick On Wed, Sep 8, 2010 at 12:34 PM, Markus Jelsma wrote: > No. The Datefield [1] will not accept it any other way. You could, however, > fool your boss and dump your dates i

RE: Re: Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma
if it's documented anywhere, try both I guess. But apparently it does exist in 1.4. Jonathan Markus Jelsma wrote: > Hi, > > I have an index with several collections. Every document has a collection > field that specifies the collection it belongs to. To make querying easier

RE: Re: Re: Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma
iants on a specific fq value 2010 at 1:32 PM, Markus Jelsma wrote: > Interesting! I haven't met the appends method before and i'll be sure to give > it a try tomorrow. Try, the wiki [1] is not very clear on what it really does. Here's a comment from the example solrcon

RE: Re: Re: Invariants on a specific fq value

2010-09-08 Thread Markus Jelsma
Excellent! You already made my day for tomorrow! I'll check it's behavior with fq parameters specifying the a filter for the same field! -Original message- From: Chris Hostetter Sent: Wed 08-09-2010 21:04 To: solr-user@lucene.apache.org; Subject: RE: Re: Re: Invariants on a specific fq v

RE: svn branch issues

2010-09-09 Thread Markus Jelsma
 http://svn.apache.org/repos/asf/lucene/dev/branches/   -Original message- From: Mark Allan Sent: Thu 09-09-2010 10:44 To: solr-user@lucene.apache.org; Subject: svn branch issues Hi all, As I've mentioned in the past, I've created some custom field types   which make use of the Abstract

Re: svn branch issues

2010-09-09 Thread Markus Jelsma
e? > Cheers > Mark > > On 9 Sep 2010, at 10:47 am, Markus Jelsma wrote: > > http://svn.apache.org/repos/asf/lucene/dev/branches/ > > > > -Original message- > > From: Mark Allan > > Sent: Thu 09-09-2010 10:44 > > To: solr-user@lucene.apache.org

Re: Indexing checksum of field value

2010-09-09 Thread Markus Jelsma
custom tokenizer be a good way to > implement it? > > /Staffan > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: Inconsistent search results with multiple keywords

2010-09-09 Thread Markus Jelsma
reated,changed,score,path,url,uid,name&start=0&facet.sort=true&q=keyword1 > +keyword2+keyword3+keyword4&bf=recip(rord(created),4,10704,10704)^200.0&fac > et.field=im_cck_field_author&facet.field=type&facet.field=im_vid_1=&indent= > on&start=0&version=2.

Re: Garbled facets even in a zero hit search

2010-09-09 Thread Markus Jelsma
> > The schema is as following: > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: how to normalize a query

2010-09-09 Thread Markus Jelsma
malized a query so that e.g. q="one two" > equals q="two one" > > Cheers, > Dan > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Re: Inconsistent search results with multiple keywords

2010-09-09 Thread Markus Jelsma
Indeed, it's the dismax, i missed it! My bad..   -Original message- From: Ahmet Arslan Sent: Thu 09-09-2010 20:37 To: solr-user@lucene.apache.org; Subject: Re: Inconsistent search results with multiple keywords > yes, my schema.xml file have  defaultOperator="AND"/> which > is why I th

RE: roadmap

2010-09-09 Thread Markus Jelsma
You should check Jira's roadmap [1] instead. It shows a clear picture of what has been done since the 1.4.1 release and pending issues for the 3.x branch and others.   [1]: https://issues.apache.org/jira/browse/SOLR?report=com.atlassian.jira.plugin.system.project:roadmap-panel     -Orig

RE: Re: Re: Invariants on a specific fq value

2010-09-09 Thread Markus Jelsma
ssage- From: Yonik Seeley Sent: Wed 08-09-2010 19:38 To: solr-user@lucene.apache.org; Subject: Re: Re: Invariants on a specific fq value 2010 at 1:32 PM, Markus Jelsma wrote: > Interesting! I haven't met the appends method before and i'll be sure to give > it a try tomorrow.

RE: Help on spelling.

2010-09-09 Thread Markus Jelsma
I don't see you passing spellcheck parameters in the query string. Are they configured as default in your search handler?   -Original message- From: Gregg Hoshovsky Sent: Thu 09-09-2010 22:40 To: solr-user@lucene.apache.org; Subject: Help on spelling. I am trying to use the spellchecker

RE: How to Update Value of One Field of a Document in Index?

2010-09-10 Thread Markus Jelsma
The MoreLikeThis component actually can accept external input: http://wiki.apache.org/solr/MoreLikeThisHandler#Using_ContentStreams   -Original message- From: Jonathan Rochkind Sent: Fri 10-09-2010 18:59 To: solr-user@lucene.apache.org; Subject: RE: How to Update Value of One Field of a

RE: multivalued fields in result

2010-09-11 Thread Markus Jelsma
Yes, you'll get what is stored and asked for.   -Original message- From: Jason Chaffee Sent: Sat 11-09-2010 05:27 To: solr-user@lucene.apache.org; Subject: multivalued fields in result Is it possible to return multivalued files in the result?   I would like to have a multivalued field

RE: Re: solr.DateField: org.apache.solr.common.SolrException: Error while creating field

2010-09-14 Thread Markus Jelsma
It would be a nice feature if Solr supports queries with time zone support on an index where all times are UTC. There is some chatter about this in SOLR-750 but i haven't found an issue that would add support for time zone queries.   Did i do a lousy search or is the issue missing as of yet?  

Re: Handling Aggregate Records/Roll-up in Solr

2010-09-16 Thread Markus Jelsma
he records passed on a property or do we need to just flatten > the representation of the shirt in the data model. > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Re: Get all results from a solr query

2010-09-16 Thread Markus Jelsma
Not according to the wiki; http://wiki.apache.org/solr/CommonQueryParameters#rows   But you could always create an issue for this one.   -Original message- From: Christopher Gross Sent: Thu 16-09-2010 22:50 To: solr-user@lucene.apache.org; Subject: Re: Get all results from a solr quer

Re: Search the mailinglist?

2010-09-17 Thread Markus Jelsma
t; but there isnt any convinient way to search through the archive. > > Thanks for your help > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: Re: Calculating distances in Solr using longitude latitude

2010-09-20 Thread Markus Jelsma
Hi,   In the early Solr 1.3 times we had an index with leisure-time objects that included geographical coordinates. Based on certain conditions we had to display a specific list of nearby objects. We simply implemented some Great Circle calculations such as the distance between points [1] and

RE: Calgary Solr Consultant?

2010-09-23 Thread Markus Jelsma
Companies and people that offer support should be listed on the wiki, although just a few take the effort to edit the wiki: http://wiki.apache.org/solr/Support   -Original message- From: Ryan Courtnage Sent: Thu 23-09-2010 20:22 To: solr-user@lucene.apache.org; Subject: Calgary Solr C

RE: Grouping in solr ?

2010-09-23 Thread Markus Jelsma
http://wiki.apache.org/solr/FieldCollapsing https://issues.apache.org/jira/browse/SOLR-236   -Original message- From: Papp Richard Sent: Thu 23-09-2010 21:29 To: solr-user@lucene.apache.org; Subject: Grouping in solr ? Hi all,  is it possible somehow to group documents?  I have servic

RE: Search a URL

2010-09-23 Thread Markus Jelsma
Try setting generateWordParts=1 in your WDF. Also, having a WhitespaceTokenizer makes little sense for URL's, there should be no whitespace in a URL, the StandardTokenizer can tokenize a URL. Anyway, the problem is your WDF.   -Original message- From: Max Lynch Sent: Thu 23-09-2010 23:00

Re: Search a URL

2010-09-24 Thread Markus Jelsma
die. > > Read 'Hot, Flat, and Crowded' > Laugh at http://www.yert.com/film.php > > --- On Thu, 9/23/10, Markus Jelsma wrote: > > From: Markus Jelsma > > Subject: RE: Search a URL > > To: solr-user@lucene.apache.org > > Date: Thursday, September 23,

Re: Can Solr do approximate matching?

2010-09-24 Thread Markus Jelsma
ld see how similar ones were > >>> answered. > >>> > >>> And my question is, does Solr support some "find similar" > >>> functionality. For example, in my mind, sentence "I like tasty > >>> strawberries" is 'similar' to a sentence such as "I like yummy > >>> strawberries", just because both have a few of the same words. > >>> > >>> So, to end my long winded query, how would I implement a "find top ten > >>> similar items to this one" functionality? > >>> > >>> Thanks! > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

Re: upgrade index from 2.9 to 3.x

2010-09-24 Thread Markus Jelsma
; when I try to start a server with my old index. > > Thanks in advance, > Mike > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

RE: how are you using Solr?

2010-09-26 Thread Markus Jelsma
http://wiki.apache.org/solr/PublicServers http://www.lucidimagination.com/developer/Community/Application-Showcase-Wiki   -Original message- From: Girish Pandit Sent: Sun 26-09-2010 14:16 To: solr-user@lucene.apache.org; Subject: how are you using Solr? I am trying to understand the wid

RE: spellcheck on multiple fields?

2010-09-27 Thread Markus Jelsma
You can use copyField to get multiple fields in the field you use for spell checking, don't forget to set it to multiValued.   -Original message- From: Savannah Beckett Sent: Mon 27-09-2010 10:08 To: solr-user@lucene.apache.org; Subject: spellcheck on multiple fields? Is it possible to

RE: Solr Deduplication and Field Collpasing

2010-09-28 Thread Markus Jelsma
You could create a custom update processor that adds a digest field for newly added documents that do not have the digest field themselves. This way, the documents that are not added by Nutch get a proper non-empty digest field so the deduplication processor won't create the same empty hash and

RE: Re: Solr Deduplication and Field Collpasing

2010-09-28 Thread Markus Jelsma
r with new indexing again if I don't have to. Is there a way I can update the digest field with the value from the corresponding I'd field using solr? Thanks Raj ----- Original Message - From: Markus Jelsma To: solr-user@lucene.apache.org Sent: Tue Sep 28 18:19:17 2010 S

RE: Re: Solr Deduplication and Field Collpasing

2010-09-28 Thread Markus Jelsma
 Correction, Java heap size should be RAM buffer size if i'm not too mistaken.   -Original message- From: Markus Jelsma Sent: Wed 29-09-2010 01:17 To: solr-user@lucene.apache.org; Subject: RE: Re: Solr Deduplication and Field Collpasing If you can set the digest field for your

RE: Missing facet values for zero counts

2010-09-29 Thread Markus Jelsma
I'm afraid you'd have to add the missing countries in your application. If it's not in the index, it will not be returned. You last question is possible, the facet.query parameter allows you to rely on other conditions to generate a facet count. But if the missing countries are not in the index,

RE: updating the solr index

2010-09-30 Thread Markus Jelsma
Updates will not show up if they weren't committed, either through a manual commit or auto commit.   -Original message- From: Vicedomine, James (TS) Sent: Thu 30-09-2010 21:04 To: solr-user@lucene.apache.org; Subject: updating the solr index Sometimes with I update the solr index (for

RE: Automatic xslt to responses ??

2010-09-30 Thread Markus Jelsma
You can add a default setting to your request handler. Read about defaults, appends and invariants in requesthandlers defined in your solrconfig.xml.   -Original message- From: Sharma, Raghvendra Sent: Thu 30-09-2010 19:17 To: solr-user@lucene.apache.org; Subject: Automatic xslt to resp

RE: can i have more update processors with solr

2010-09-30 Thread Markus Jelsma
Almost, you can define a updateRequestProcessorChain that houses multiple update processors.           true   title_signature   true   title   org.apache.solr.update.processor.Lookup3Signature           true   content_signature   true   content   org.

RE: parsedquery is different from querystrin

2010-09-30 Thread Markus Jelsma
We cannot really give an answer without knowing your fieldType and query. We can see that the blackberry => blackberri is caused by a stemmer you have, perhaps a porter or snowball stemmer. Anyway, that's normal.   -Original message- From: abhayd Sent: Thu 30-09-2010 20:32 To: solr-user@

RE: Is Solr right for my business situation ?

2010-09-30 Thread Markus Jelsma
Recent versions supports sharding and handles distribution of your query and result set merging. The problem, it won't help you to join on separate `tables`. The fields you query need to be present in each shard or you'll end up with an HTTP 400 - undefined field error.   Indeed, there is no e

RE: PatternReplaceFilterFactory creating empty string as a term

2010-10-05 Thread Markus Jelsma
I'm not sure if this is the best approach but a LengthFilter will stop blank terms. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.LengthFilterFactory   -Original message- From: Shawn Heisey Sent: Wed 06-10-2010 00:25 To: solr-user@lucene.apache.org; Subject: PatternRe

RE: PatternReplaceFilterFactory creating empty string as a term

2010-10-05 Thread Markus Jelsma
and some memory.   -Original message----- From: Markus Jelsma Sent: Wed 06-10-2010 00:29 To: solr-user@lucene.apache.org; Subject: RE: PatternReplaceFilterFactory creating empty string as a term I'm not sure if this is the best approach but a LengthFilter will stop blank terms. http://

RE: Re: Umlaut in facet name attribute

2010-10-05 Thread Markus Jelsma
It is a good practice (for many cases as seen on the list) to search (usually with fq) on analzyed fields but return the facet list based on the unanalyzed counterparts.   -Original message- From: Savvas-Andreas Moysidis Sent: Wed 06-10-2010 00:46 To: solr-user@lucene.apache.org; Subjec

Re: Invalid boolean value for query with exclamation

2010-10-07 Thread Markus Jelsma
kenFilters Cheers -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Markus Jelsma
t;> > >> > >> I have restarted/reindexed everything but I still cannot search > >> > >> hoot > >> > >> and get back the company named Shooter. searching shooter is fine. > >> > >> I have followed other example

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Markus Jelsma
ng else here? > > Cheers > > On Oct 8, 2010, at 9:02 AM, Allistair Crossley wrote: > > Oh my. I am basically being a total monkey. Every time I was changing my > > schema.xml to try new things out I was then reindexing our staging > > server's index instead of m

Re: Missing content stream

2010-10-08 Thread Markus Jelsma
ot;post"> > > > > > > > I selected one of the sample xml documents that is provided with the Solr > install for my file. When I submit the file, I get a "Missing Content > Stream" error. Note the "." after localhost just allows me

Re: Help removing myself from this mailing list

2010-10-08 Thread Markus Jelsma
emove my email address. Any help would be really appreciated! > > Steve -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Disable (or prohibit) per-field overrides

2010-10-11 Thread Markus Jelsma
Hi, Anyone knows useful method to disable or prohibit the per-field override features for the search components? If not, where to start to make it configurable via solrconfig and attempt to come up with a working patch? Cheers, -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in

Re: How to manage different indexes for different users

2010-10-11 Thread Markus Jelsma
'm wondering whether I'd have to append their user ID as I index a file. > I'm not sure which approach to follow. Is there a sample or a doc I can read > to understand how to approach this problem? > > Thanks in advance. > > -- > Regards, > > Tharind

Re: How to manage different indexes for different users

2010-10-11 Thread Markus Jelsma
Well, set the user ID for each document and use a filter query to filter only on field:. On Mon, 11 Oct 2010 23:25:29 +0530, Tharindu Mathew wrote: On Mon, Oct 11, 2010 at 10:48 PM, Markus Jelsma wrote: Then you probably read on how to create [1] the new core. Keep in mind, you might need

Re: Disable (or prohibit) per-field overrides

2010-10-11 Thread Markus Jelsma
ct 2010 16:12:35 -0400, Erick Erickson wrote: Have you looked at "invariants" in solrconfig.xml? Best Erick On Mon, Oct 11, 2010 at 12:23 PM, Markus Jelsma wrote: Hi, Anyone knows useful method to disable or prohibit the per-field override features for the search components? If no

Re: Faceting and omitNorms=true

2010-10-12 Thread Markus Jelsma
; > "faceting: indexed='true'" >-- http://wiki.apache.org/solr/FieldOptionsByUseCase > > "Only full-text fields or fields that need an index-time boost need >norms." > http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/conf/sch > ema.xml > > > Thank you, > Paolo -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: Faceting and omitNorms=true

2010-10-12 Thread Markus Jelsma
quick reply. > Do you have a recommendation|suggestion about positionIncrementGap? > > Paolo > > Markus Jelsma wrote: > > You can omit norms in field that you only facet on. The matrix only lists > > mandatory values for those parameters. > > > > On Tu

Re: useFastVectorHighlighter creates fragments with cut off terms, incomplete

2010-10-12 Thread Markus Jelsma
I believe it was this one https://issues.apache.org/jira/browse/LUCENE-1824 > The natives have been getting restless since I turned off the highlighter > on our application quite a few months ago.In an attempt to rectify > that I've migrated to Solr 3.1 (found it quite stable so far) in hopes

Spellcheck issues in 3.1

2010-10-13 Thread Markus Jelsma
ay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Cheers, -- Markus Jelsma - CTO - Openindex http://www.linked

Re: Spellcheck issues in 3.1

2010-10-13 Thread Markus Jelsma
Nice, that's the trick to remember. On Wednesday, October 13, 2010 12:16:07 pm Robert Muir wrote: > > SEVERE: java.lang.NoSuchMethodError: > > org.apache.lucene.analysis.standard.StandardFilter.(Lorg/apache/luc > > ene/util/Version;Lorg/apache/lucene/analysis/TokenStrea

Re: Faceting and first letter of fields

2010-10-14 Thread Markus Jelsma
Here's a very recent thread on the matter: http://lucene.472066.n3.nabble.com/facet-method-enum-vs-fc-td1681277.html > Thanks Yonik. I hadn't actually been using "enum" on facets with a > small number of unique values; the wiki page doesn't give much guidance > on when each is called for. Do you

Re: Disable (or prohibit) per-field overrides

2010-10-17 Thread Markus Jelsma
Hi, Thanks for the suggestion and pointer. We've implemented it using a single regex in Nginx for now. Cheers, > : Anyone knows useful method to disable or prohibit the per-field override > : features for the search components? If not, where to start to make it > : configurable via solrconfig

Re: SolrJ new javabin format

2010-10-17 Thread Markus Jelsma
Well, in Nutch we simply replace the two jars and it all still works. > The CHANGES.txt file in branch_3x says that the javabin format has > changed in Solr 3.1, so you need to update SolrJ as well as Solr. Is > the SolrJ included in 3.1 compatible with both 3.1 and 1.4.1? If not, > that's goi

Re: Disable (or prohibit) per-field overrides

2010-10-18 Thread Markus Jelsma
yes: > > http://wiki.apache.org/solr/SearchHandler#Configuration > > Markus Jelsma wrote: > > Hi, > > > > Thanks for the suggestion and pointer. We've implemented it using a > > single regex in Nginx for now. > > > > Cheers, > > >

Re: Removing Common Web Page Header and Footer from All Content Fetched by Nutch

2010-10-19 Thread Markus Jelsma
mon header is less than a certain value. Same applies > > to the footer content common to all pages. > > > > Is there a way to ignore certain "stop phrase" so to speak in the Nutch > > configuration based on levenshtein distance or jaro winkler distance so &g

Re: Uppercase and lowercase queries

2010-10-19 Thread Markus Jelsma
om: class="solr.StrField" sortMissingLast="true" omitNorms="true"> > > TO: > > omitNorms="true"> > > > > > > > > > > > > But it still doesnt show any results when I query "boston"...why? -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: Uppercase and lowercase queries

2010-10-19 Thread Markus Jelsma
t; > > > > > > > > > > > > > > > > > > > > > > > > > But it still doesnt show any results when I query "boston"...why? > > -- > > View this message in context: > > http://lucene.472066.n3.nabble.com/Uppercase-and-lowercase-queries-tp1731 > > 349p1731349.html Sent from the Solr - User mailing list archive at > > Nabble.com. -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: **SPAM** Re: boosting injection

2010-10-19 Thread Markus Jelsma
query > > > > q=title:history AND author:joyce > > > > is it possible to "transform" at runtime this query in this way: > > > > q=title:history^10 AND author:joyce^5 > > > > ? > > > > Best regards, > > Andrea -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: Mulitple facet - fq

2010-10-20 Thread Markus Jelsma
; and "corporate" ones, I tried > fq=category:corporate&fq=category:personal > > It looks easy, but I can't find the solution. > > > -- > > Yavuz Selim YILMAZ -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: facet Prefix (or term prefix)

2010-10-22 Thread Markus Jelsma
t have to code this in my service layer having received all > facets from SOLR (without the prefix) > > Thanks for any help. > > > > > If you wish to view the St. James's Place email disclaimer, please use the > link below > > http://www.sj

Re: How to use AND as opposed to OR as the default query operator.

2010-10-25 Thread Markus Jelsma
t/pittaya/using-apache-solr on Slide number 52 > that this OR behavior is configurable. > > Could you please tell me where this configuration is located? I could not > locate it in schema.xml. > > Swapnonil Mukherjee > +91-40092712 > +91-9007131999 -- Markus Jelsma - CTO

Re: documentCache clarification

2010-10-27 Thread Markus Jelsma
LazyFieldLoading set to true and in my request > parameters specify "fl=id", then the number of fields per document > shouldn't affect the memory usage of the document cache, right? > > Thanks, > --jay -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: Stored or indexed?

2010-10-27 Thread Markus Jelsma
http://wiki.apache.org/solr/FieldOptionsByUseCase] > Hi all- > > I've read through the documentation, but I'm still a little confused about > the tag, in terms of the indexed and stored attributes. If I have > something marked as indexed="true", why would I ever want stored="false"? > Are there

Re: Start parameter and result grouping

2010-10-31 Thread Markus Jelsma
Ah, seems you're just one day behind. SOLR-2207, paging with field collapsing, has just been resolved: https://issues.apache.org/jira/browse/SOLR-2207 > Hi, > > I'm trying to implement paging when grouping is on. > > Start parameter works, but the result contains all the documents that were >

Re: Start parameter and result grouping

2010-10-31 Thread Markus Jelsma
Oh, and see the just updated wiki page as well: http://wiki.apache.org/solr/FieldCollapsing > Ah, seems you're just one day behind. SOLR-2207, paging with field > collapsing, has just been resolved: > https://issues.apache.org/jira/browse/SOLR-2207 > > > Hi, > > > > I'm trying to implement pagin

Re: Solr in virtual host as opposed to /lib

2010-11-01 Thread Markus Jelsma
No, he didn't make a mistake but you did. Next time, please start a new thread not by conveniently replying to an existing thread and just changing the subject. Now we have two threads in thread. :) > I don't think you read the entire thread. I'm assuming you made a mistake. > > -Original M

Re: Highlighting and maxBooleanClauses limit

2010-11-02 Thread Markus Jelsma
nk it is highlighter bug. Highlighter should skip the range query > when user set requireFieldMatch to true, because your range query is for > another field. If so, please open a jira issue. > > Koji -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Negative or zero value for fieldNorm

2010-11-03 Thread Markus Jelsma
uns on a Solr 1.4.1 instance with Java 6 under the hood. Help or directions are appreciated =) Cheers, -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

Re: Negative or zero value for fieldNorm

2010-11-03 Thread Markus Jelsma
oc in the result set) and does doc 1462 have a very very small fieldNorm? debugOutput for doc 7: 0.0 = fieldNorm(field=title, doc=7) Luke on the title field of doc 7. 1.0 Thanks for your reply! > -Yonik > http://www.lucidimagination.com > > > > On Wed, Nov 3, 2010 at 2:30

Re: Negative or zero value for fieldNorm

2010-11-04 Thread Markus Jelsma
, why does the title field return a fieldNorm=0 for many queries? And a subquestion, does the luke request handler return boost values for documents? I know i get boost values for fields but i haven't seen boost values for documents. Cheers, On Wednesday 03 November 2010 20:44:48 Markus Je

Re: Negative or zero value for fieldNorm

2010-11-04 Thread Markus Jelsma
ocument but all i ever see returned is boost=1.0. Is this correct? Anyway, i'm looking at Nutch now for reasons why i sends a zero boost on a docuement. On Thursday 04 November 2010 14:16:22 Yonik Seeley wrote: > On Thu, Nov 4, 2010 at 8:04 AM, Markus Jelsma > > wrote: > >

Re: Negative or zero value for fieldNorm

2010-11-04 Thread Markus Jelsma
On Thursday 04 November 2010 15:12:23 Yonik Seeley wrote: > On Thu, Nov 4, 2010 at 9:51 AM, Markus Jelsma > > wrote: > > I've done some testing with the example docs and it behaves similar when > > there is a zero doc boost. Luke, however, does not show me

Re: Optimize Index

2010-11-04 Thread Markus Jelsma
Huh? That's something new for me. Optmize removed documents that have been flagged for deletion. For relevancy it's important those are removed because document frequencies are not updated for deletes. Did i miss something? > For what it's worth, the Solr class instructor at the Lucene Revoluti

Replication and ignored fields

2010-11-05 Thread Markus Jelsma
n `ignored` fieldType and set the fields that are unused on the slaves to use the ignored fieldType. ..it doesn't work and makes perfectly sense because it's just the index files that get copied over. The question, how to ignore fields with replication? Cheers, -- Markus Jelsma -

Re: Replication and ignored fields

2010-11-05 Thread Markus Jelsma
solution architect > Cominvent AS - www.cominvent.com > > On 5. nov. 2010, at 16.11, Markus Jelsma wrote: > > Hi, > > > > I've got an ordinary master/slave replication set up. The master contains > > several fields that are not used by the slaves but are used by

Re: Using Multiple Cores for Multiple Users

2010-11-09 Thread Markus Jelsma
Hi, > All, > > I have a web application that requires the user to register and then login > to gain access to the site. Pretty standard stuff...Now I would like to > know what the best approach would be to implement a "customized" search > experience for each user. Would this mean creating a sepa

RE: Reduce QueryComponent prepare time

2012-11-20 Thread Markus Jelsma
re time > > Markus, > > It's hard to suggest anything until you provide a profiler snapshot which > says what it spends time in prepare for. As far as I know in prepare it > parses queries e.g. we have a really heavy query parsers, but I don't think > it's rea

RE: Reduce QueryComponent prepare time

2012-11-21 Thread Markus Jelsma
performant* (and *might be not* for you) than current eDismax. > Nothing more useful from me. > > Bye. > > > On Tue, Nov 20, 2012 at 7:01 PM, Markus Jelsma > wrote: > > > Hi, > > > > Profiling pointed me directly to the method i already suspected: > > Exte

Recip m parameter to take function value

2012-11-21 Thread Markus Jelsma
Hi, We need the recip function's m-parameter to take other functions e.g. recip(dateField, div(1,prod(1,2)), 1,1) but ValueSourceParser want to read a float instead. How could we modifiy either Solr or Lucene as well to take functions for that parameter? I've been looking at the various extende

RE: Solr UIMA with KEA

2012-11-22 Thread Markus Jelsma
See: http://nutch.apache.org/apidocs-2.1/org/apache/nutch/crawl/AdaptiveFetchSchedule.html -Original message- > From:nutchsolruser > Sent: Fri 23-Nov-2012 06:53 > To: solr-user@lucene.apache.org > Subject: Solr UIMA with KEA > > Is there any way we can extract tags or keyphrases from

RE: Solr UIMA with KEA

2012-11-22 Thread Markus Jelsma
Sorry, wrong list :) -Original message- > From:Markus Jelsma > Sent: Fri 23-Nov-2012 08:32 > To: solr-user@lucene.apache.org > Subject: RE: Solr UIMA with KEA > > See: > http://nutch.apache.org/apidocs-2.1/org/apache/nutch/crawl/AdaptiveFetchSchedule.html > > > -Original messag

RE: Spellchecker for multiple sites (and languages?)

2012-11-26 Thread Markus Jelsma
Hi - check the new spellchecker collate options. It limits spellchecker suggestions to the fq restrictions. If you filter on specific hosts, the spellchecker will only provide suggestions that are found in that host. Same goes for language. http://wiki.apache.org/solr/SpellCheckComponent#spellc

RE: SolrCloud(5x) - Errors while recovering

2012-11-27 Thread Markus Jelsma
Seems you got this issue: https://issues.apache.org/jira/browse/SOLR-4032 -Original message- > From:deniz > Sent: Tue 27-Nov-2012 05:04 > To: solr-user@lucene.apache.org > Subject: SolrCloud(5x) - Errors while recovering > > Here is briefly what is happening: > > I have a simple SolrCl

RE: positions and qf parameter in (e)dismax

2012-11-27 Thread Markus Jelsma
Hi - no we're not getting any errors because we enabled positions on all fields that are also listed in the qf-parameter. If we don't, and send a phrase query we would get an error such as: java.lang.IllegalStateException: field "h1" was indexed without position data; cannot run PhraseQuery (te

RE: SolrCloud(5x) - Errors while recovering

2012-11-27 Thread Markus Jelsma
It only seems to happen if a node dies while indexing. -Original message- > From:deniz > Sent: Tue 27-Nov-2012 10:34 > To: solr-user@lucene.apache.org > Subject: RE: SolrCloud(5x) - Errors while recovering > > another update > > having 300K docs causes the same error... > > I think th

RE: Extreme index size reduction on 4.1-SNAPSHOT?

2012-11-27 Thread Markus Jelsma
Hi, please check this issue: https://issues.apache.org/jira/browse/LUCENE-4226 But it is enabled because of: https://issues.apache.org/jira/browse/LUCENE-4509 Since it's suddenly default you would have to completely wipe the index and reindex the data, at least i had to, because of numerous code

RE: positions and qf parameter in (e)dismax

2012-11-28 Thread Markus Jelsma
uery) encounters a phrase and the field does not have > positions, a BooleanQuery with MUST would be generated instead of the > PhraseQuery. > > -- Jack Krupansky > > -Original Message- > From: Markus Jelsma > Sent: Tuesday, November 27, 2012 4:27 AM > To:

RE: Best way to increase boost to results that 'starts with' search keyword

2012-11-30 Thread Markus Jelsma
This issue adds the SpanFirstQuery to edismax. https://issues.apache.org/jira/browse/SOLR-3925 It unfortuntately cannot produce progressively higher boosts if the term is closer to the beginning. -Original message- > From:Jack Krupansky > Sent: Fri 30-Nov-2012 18:54 > To: solr-user@

RE: Exceptions in branch_4x log

2012-11-30 Thread Markus Jelsma
Hi, try updating your check out, i think that's fixed now. https://issues.apache.org/jira/browse/SOLR-4117 -Original message- > From:Shawn Heisey > Sent: Fri 30-Nov-2012 22:21 > To: solr-user@lucene.apache.org > Subject: Exceptions in branch_4x log > > This is branch_4x, checked out 20

The shard called `properties`

2012-12-05 Thread Markus Jelsma
Hi, We're suddenly seeing a shard called `properties` in the cloud graph page when testing today's trunk with a clean Zookeeper data directory. Any idea where it comes from? We have not changed the solr.xml on any node. Thanks

RE: Synomyns.txt not working with wildcards in query

2012-12-06 Thread Markus Jelsma
Hi Query's with wildcards or fuzzy operators are called multi term queries and do not pass through the field's analyzer as you might expect. See: http://wiki.apache.org/solr/MultitermQueryAnalysis -Original message- > From:Pratyul Kapoor > Sent: Thu 06-Dec-2012 06:28 > To: solr-user

RE: Disable term frequency for some fields in solr

2012-12-06 Thread Markus Jelsma
Hi, You can either use omitTermFreqAndPositions on that field or set a custom similarity for that field that returns 1 for tf > 0. http://wiki.apache.org/solr/SchemaXml#Common_field_options http://wiki.apache.org/solr/SchemaXml#Similarity -Original message- > From:Amit Jha > Sent: T

RE: The shard called `properties`

2012-12-06 Thread Markus Jelsma
ks for the pointer to the issue. > > Unfortunately, it seems that clients have to be hard coded to realize > properties is not a shard unless we add another nested layer. > > Should be 100% harmless. > > - Mark > > On Dec 5, 2012, at 5:05 AM, Markus Jelsma wrote

RE: Minimum HA Setup with SolrCloud

2012-12-06 Thread Markus Jelsma
The quorum is the minimun, so it depends on how many you have running in the ensemble. If it's three or four, then two is the quorum and therefore the minumum. Three is regarded as a minumum in the ensemble because two makes no sense. -Original message- > From:Jack Krupansky > Sent: T

RE: Minimum HA Setup with SolrCloud

2012-12-06 Thread Markus Jelsma
-Original message- > From:Yonik Seeley > Sent: Thu 06-Dec-2012 16:01 > To: solr-user@lucene.apache.org > Subject: Re: Minimum HA Setup with SolrCloud > > On Thu, Dec 6, 2012 at 9:56 AM, Markus Jelsma > wrote: > > The quorum is the minimun, so it depends on h

<    2   3   4   5   6   7   8   9   10   11   >