Re: Solr and Zend Lucene

2009-03-06 Thread revas
The luke request handler returns all the tokens from the index ,is this correct? On 3/5/09, revas revas...@gmail.com wrote: We will be using sqllite for db.This can be used for a cd version where we need to provide search On 3/5/09, Grant Ingersoll gsing...@apache.org wrote: On Mar 5,

search on id value- in a range

2009-03-06 Thread Neha Bhardwaj
Hi all, I wish to perform a search on the index ,based on a field called id(type:Integer). I want solr to return all the ids between a specified range. i.e. I need to fetch all records that have their id value between say 11-22 or say 55-60. Is there any provision to do it via solr

Ignoring Whitespace

2009-03-06 Thread Gargate, Siddharth
After parsing HTML documents, tika adds whitespaces (newlines and tabs) and this content gets stored as is in SOLR. If I fetch the teasers, the teaser contains these additonal whitespaces. How do I remove these whitespaces? At tika, solr or explicitly remove with my code? Thanks, Siddharth

Re: DataImportHandler and delta-import question

2009-03-06 Thread Marc Sturlese
Hey, You can donwload the DataImportHandler from the Solr1.3 release, that should work as does not implements rollbacks. If you want to test a nightly, I think is always best to test one of the recents. Tim Garafola wrote: Thanks. Can you recommend a build I can try? On Thu, Mar 5, 2009

Luke request handler

2009-03-06 Thread revas
Hi, I just want to confirm my understanding of luke request handler. It gives us the raw lucene index tokens on a field by field basis. What should be the query to return all tokens for a field . Is there any way to return all the token across all fields Regards Revas

Re: what crawler do you use for Solr indexing?

2009-03-06 Thread Toby Cole
Hi Tony, Strangely I started looking into the Solr/Nutch integration yesterday so I might be able to help :) The documentation for it is very sparse, but the trunk of nutch does have the solr integration committed. If I remember correctly, what I had to do was... I went through one of

Re: what crawler do you use for Solr indexing?

2009-03-06 Thread Ryan McKinley
Also consider droids: http://incubator.apache.org/droids/ On Mar 5, 2009, at 6:32 PM, Tony Wang wrote: Hi, I wonder if there's any open source crawler product that could be integrated with Solr. What crawler do you guys use? or you coded one by yourself? I have been trying to find out

How can I configure different types in Solr?

2009-03-06 Thread Praveen_Kumar_J
Hi How do I configure different custom types or schemas in Solr? Assume I have some custom types type1 and type1 (some composite classes). Can I configure these 2 types in a single schema file? I need these 2 types to be online for creating and searching data. Please provide me some sample

Re: Shard Query Problem

2009-03-06 Thread Anshul jain
Hi Chris, Thanks for the reply. here are the requesthandlers from solrconfig.xml: requestHandler name=standard class=solr.SearchHandler default=true !-- default values for query parameters -- lst name=defaults str name=echoParamsexplicit/str int name=rows10/int str

Re: what crawler do you use for Solr indexing?

2009-03-06 Thread Tony Wang
Thank you all so much! I sincerely appreciate the help received. Tony On Fri, Mar 6, 2009 at 5:02 AM, Toby Cole toby.c...@semantico.com wrote: Hi Tony, Strangely I started looking into the Solr/Nutch integration yesterday so I might be able to help :) The documentation for it is very

CJKAnalyzer and Chinese Text sort

2009-03-06 Thread Sachin
For some reasons this never made to the mailing list, hence re-posting. - Hi All, Is there any way to sort Chinese text in solr? We have currently setup schema.xml to use CJKAnalyzer/CJKTokenizer for analyzing/tokenizing the text and sort is done on a field which only uses

Re: what crawler do you use for Solr indexing?

2009-03-06 Thread Sean Timm
We too use Heritrix. We tried Nutch first but Nutch was not finding all of the documents that it was supposed to. When Nutch and Heritrix were both set to crawl our own site to a depth of three, Nutch missed some pages that were linked directly from the seed. We ended up with 10%-20% fewer pages

Re: what crawler do you use for Solr indexing?

2009-03-06 Thread Sean Timm
See http://crawler.archive.org/faq.html#new_writer For other Heritrix questions, this should probably go to the Heritrix list. -Sean Tony Wang wrote: Sean - I found Heritrix is pretty easy to set up. I am testing it on my server here http://66.197.161.133:8081, and am trying to create crawl

Search for a field whose value contains instead of begins with

2009-03-06 Thread Shyamsunder Reddy
I am indexing a document with field 'name'. The values of the field 'name' are like: Apple Corp, Apple iPod, Apple Games, Indian Apple, West Apple Corporation and  The Applemint Gimlet. Now my question is how to configure the field 'name' in my Solr configuration and schema, so that when I

Re: How can I configure different types in Solr?

2009-03-06 Thread Matthew Runo
I'm not 100% sure what you mean by custom types, but if you're talking about objects then there's no reason they can't both be in your schema. Any given document does not need to have all the fields in it, so you could flatten them both into one schema if you wanted. You could also use the

Re: Search for a field whose value contains instead of begins with

2009-03-06 Thread Shalin Shekhar Mangar
On Fri, Mar 6, 2009 at 9:24 PM, Shyamsunder Reddy sjh...@yahoo.com wrote: I am indexing a document with field 'name'. The values of the field 'name' are like: Apple Corp, Apple iPod, Apple Games, Indian Apple, West Apple Corporation and The Applemint Gimlet. Now my question is how to

bug report in INFO link at solr admin page using multicore

2009-03-06 Thread Marc Sturlese
Hey there, I think I have detected a bug in admin solr screen. I am using multicore with various cores. When I click a core in the admin page and after click INFO the info that apears (class, cache info..) it's always of the same core (the last one in solrconfig.xml). I don't know if there's

DIH Solr1.4

2009-03-06 Thread Wesley Small
I am evaluating the DIH in Solr 1.4-DEV and am receiving a Null Pointer Exception when the import process begins. Here are the details: [LOG MESSAGE] 2009-03-06 11:06:04,635 ERROR [STDERR] (http-0.0.0.0-20080-Processor3) Mar 6, 2009 11:06:04 AM org.apache.solr.handler.dataimport.DataImporter

Re: DIH Solr1.4

2009-03-06 Thread Shalin Shekhar Mangar
On Fri, Mar 6, 2009 at 9:56 PM, Wesley Small wesley.sm...@mtvstaff.comwrote: I am evaluating the DIH in Solr 1.4-DEV and am receiving a Null Pointer Exception when the import process begins. Here are the details: Which nightly build are you using? You can see this in the INFO page on solr

Re: DIH Solr1.4

2009-03-06 Thread Wesley Small
I am using: Solr Implementation Version: 1.4-dev 750448 - smallwes - 2009-03-05 08:01:30 On 3/6/09 11:35 AM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: Which nightly build are you using? You can see this in the INFO page on solr admin.

Re: How can I configure different types in Solr?

2009-03-06 Thread Cheng Zhang
One solr instance has only one doc type. So you have many types, the first option is to use multiple solr server instance. The second option is to use multiple core. In this case, you have one solr sever instance but in the server instance you have more than one core. If you don't want to use

Re: Shard Query Problem

2009-03-06 Thread Anshul jain
Hi, here is the whole file, if it helps ?xml version=1.0 encoding=UTF-8 ? !-- Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF

Re: How can I configure different types in Solr?

2009-03-06 Thread Walter Underwood
Or you can add a type field and filter on that. I do that with type:movie and type:people. --wunder On 3/6/09 9:10 AM, Cheng Zhang zhangyongji...@yahoo.com wrote: One solr instance has only one doc type. So you have many types, the first option is to use multiple solr server instance. The

Re: Add jdbc entity to DataImportHandler in runtime

2009-03-06 Thread Rui Pereira
Hi again,Isn't 'preImportDeleteQuery' a Solr 1.4 feature? I'm unable to import only the entities I want, all the index is deleted. In all entities I have a field that is unique by entity ('alias') and I configured the preImportQuery in all entities like this: alias: 'entity name'. Thanks in

query filter with the Key feature

2009-03-06 Thread Marc Sturlese
Hey there, I am trying to use filter query with the key feature and I am experiencing some problems: If I do this, no problem: str name=facet.query{!key=Fox}title:fox/str But if I try to do this, I am getting wrong results: str name=facet.query{!key=Fox}title:big fox/str I have also tried: str

Index an Oracle DATE column with a Solr DateField

2009-03-06 Thread George
Hi, I've an Oracle DATE column that I want to index with a Solr DateField. The part of my schema.xml looks like this: fieldType name=date class=solr.DateField sortMissingLast=true omitNorms=true/ field name=plazo type=date indexed=true stored=true default=NOW multiValued=false / I use

Multiple queries in single request

2009-03-06 Thread Brian Knoth
Apologies up front if this is a often-asked newbie question, but I've searched the solrhandler docs and I can't find what I'm interested in. I'd like to be able to present multiple queries to SOLR, for example: [(+a +b) ] [(+a +c)] [(+a+ d)] And get back a multi-result set (one for each

Re: Search for a field whose value contains instead of begins with

2009-03-06 Thread SJ
The nam' field defined as string. What is the best analyzer to analyze the name field text (ie Apple Corporation or The Applemint Gimlet) into individual tokens. Also shall I use stored or not stored property. field name=NAME type=string indexed=true stored=true/ Shalin Shekhar Mangar wrote:

Re: Search for a field whose value contains instead of begins with

2009-03-06 Thread Shalin Shekhar Mangar
On Sat, Mar 7, 2009 at 12:40 AM, SJ sjh...@yahoo.com wrote: The nam' field defined as string. What is the best analyzer to analyze the name field text (ie Apple Corporation or The Applemint Gimlet) into individual tokens. Also shall I use stored or not stored property. field name=NAME

Re: Add jdbc entity to DataImportHandler in runtime

2009-03-06 Thread Shalin Shekhar Mangar
On Fri, Mar 6, 2009 at 11:12 PM, Rui Pereira ruipereira...@gmail.comwrote: Hi again,Isn't 'preImportDeleteQuery' a Solr 1.4 feature? I'm unable to import only the entities I want, all the index is deleted. In all entities I have a field that is unique by entity ('alias') and I configured the

Re: DIH Solr1.4

2009-03-06 Thread Shalin Shekhar Mangar
On Fri, Mar 6, 2009 at 10:07 PM, Wesley Small wesley.sm...@mtvstaff.comwrote: I am using: Solr Implementation Version: 1.4-dev 750448 - smallwes - 2009-03-05 08:01:30 Thanks Wesley. I guess you were using the debug mode or the dataimport.jsp to test your changes? This bug is triggered when

Re: query filter with the Key feature

2009-03-06 Thread Marc Sturlese
Thanks for you replay. I have already solved... at the begining the mistake was: str name=facet.query{!key=Fox}title:big fox/str title:big fox is equivalent to title:big default_field:fox I was confused about what was happening with the word fox. Now I catch it! And: We need more details on

Re: Multiple queries in single request

2009-03-06 Thread Shalin Shekhar Mangar
On Sat, Mar 7, 2009 at 12:27 AM, Brian Knoth bkn...@myperfectgig.comwrote: Apologies up front if this is a often-asked newbie question, but I've searched the solrhandler docs and I can't find what I'm interested in. I'd like to be able to present multiple queries to SOLR, for example: [(+a

Re: commit / new searcher delay?

2009-03-06 Thread Shalin Shekhar Mangar
On Fri, Mar 6, 2009 at 8:47 AM, Steve Conover scono...@gmail.com wrote: That's exactly what I'm doing, but I'm explicitly replicating, and committing. Even under these circumstances, what could explain the delay after commit before the new index becomes available? How are you explicitly

RE: Multiple queries in single request

2009-03-06 Thread Brian Knoth
Thanks for your comment. Yes, it would be simple if it was just 3 queries, I really have about 50-60 I'd like to consolidate as a single request. I'm simply getting a count of records for each query, and right now, it takes about 40mS for each request as a single query, but multiplied by about

Re: Multiple queries in single request

2009-03-06 Thread Erik Hatcher
On Mar 6, 2009, at 2:55 PM, Shalin Shekhar Mangar wrote: On Sat, Mar 7, 2009 at 12:27 AM, Brian Knoth bkn...@myperfectgig.comwrote: Apologies up front if this is a often-asked newbie question, but I've searched the solrhandler docs and I can't find what I'm interested in. I'd like to

Re: Multiple queries in single request

2009-03-06 Thread Erik Hatcher
If you're only after the count for those queries, use a standard request with a bunch of facet.query's. Erik On Mar 6, 2009, at 3:12 PM, Brian Knoth wrote: Thanks for your comment. Yes, it would be simple if it was just 3 queries, I really have about 50-60 I'd like to consolidate

RE: Multiple queries in single request

2009-03-06 Thread Brian Knoth
Shalin, Erik: Faceting is exactly what I was looking for (and didn't even know it). This is giving me precisely what I wanted. Thanks for your help! brian -Original Message- From: Erik Hatcher [mailto:e...@ehatchersolutions.com] Sent: Friday, March 06, 2009 3:39 PM To:

Re: Multiple queries in single request

2009-03-06 Thread Ryan McKinley
No but you can always fire three requests. Writing your own handler which prints data in a custom format means that you can no longer use existing solr clients for java/ruby/python etc. That's not a fair characterization of at least the Ruby client. The NamedList (err, Hash in Ruby) is

Re: port of Nutch CommonGrams to Solr for help with slow phrase queries

2009-03-06 Thread Tom Burton-West
Hi Norberto, After working a bit on trying to port the Nutch CommonGrams code, I ran into lots of dependencies on Nutch and Hadoop. Would it be possible to get more information on how you use shingles (or code)? Are you creating shingles for all two word combinations or using a list of words?

Re: Search for a field whose value contains instead of begins with

2009-03-06 Thread Shyamsunder Reddy
Thanks. I have configured the field as text and search is working fine. But the results are not properly sorted. field name value is some company names in NY. Here is my search query: http://localhost:8090/solr/select?fq=TYPE:ORGq=NAME:new%20yorksort=NAME%20descrows=100fl=NAME here is my