Re: DirectUpdateHandler and DirectUpdateHandler2

2007-12-07 Thread Chris Hostetter

: Are there any features in DUH2 that are not in DUH? I mean semantic
: differences, not just speedups.

To the best of my knowledge no, but i am not an expert on DUH or DUH2.


-Hoss



Re: Access to SolrIndexSearcher in UpdateProcessor

2007-12-07 Thread Chris Hostetter
: 
: I'm trying to implement a custom UpdateProcessor which requires access to
: SolrIndexSearcher. However, I'm constantly running into "Too many open
: files" exception. I'm confused about which is the correct way to get access
: to SolrIndexSearcher in UpdateProcessor:
: 
: 1) req.getSearcher()

that's the way i would expect should work best ... it will give you a 
consistent view of the index for the duration of the request.  i can't 
imagine why it would give you too many open files ... it should be a 
reused searcher shared by all concurrent requests, so accessing it 
shouldn't open more file handles.  Searchers are refrence counted ... 
maybe there is an edge case where they aren't getting freed up properly.

can you post a small example of the kind of UpdateProcessor you have (it 
doesn't need to do everything you are doing, just something that 
demonstrates the too many files bug)



-Hoss



Re: Field seperater for highlighting multi-value fields

2007-12-07 Thread Chris Hostetter

: The default field separator seems to be a '.' when highlighting
: multi-value fields. Can this be overridden in 1.2 to another character?

Default field seperator where?  in the response?  can you give a specific 
example of what you are talking about?




-Hoss



Re: SolrJ and MoreLikeThis / Spellchecker

2007-12-07 Thread Mike Klaas
Trunk has much more data in its spellcheck response, see http:// 
wiki.apache.org/solr/SpellCheckerRequestHandler .


-Mike

On 7-Dec-07, at 3:46 PM, Matthew Runo wrote:

I'll give it a try. Seems like the Spellcheck response type is  
pretty basic.


Thanks!

Matthew Runo
Software Developer
702.943.7833

On Dec 7, 2007, at 11:23 AM, Ryan McKinley wrote:


Matthew Runo wrote:

Hello!
Please forgive my newbie question about SolrJ, but I was unable  
to find my answer in the SOLRJ source code or the wiki (I'll add  
it if someone helps).
Would anyone be so kind as to provide a quick example of using  
the Spellcheck handler and the MoreLikeThis handler with SOLR-J?


With spellcheck, you will be in new water (I think)... you can get  
the response as a NamedList, but there is not anything that puts  
that into user friendly functions.


SolrQuery q = new SolrQuery( "foo" );
q.setQueryType( "spelling" );
q.set( "anyparam", value );
QueryResponse rsp = solr.query( q )

NamedList nl = rsp.getResponse();

you will have to pick stuff out of the NamedList manually.  If you  
want to contribute a SpellCheckRequest/Response that would be great!


For MLT, the standard QueryRequest should work.  in 1.3-dev, both  
standard and dismax support mlt queries.  Perhaps we should add  
getters and setters to SolrQuery so you don't have to call:

q.set( MoreLikeThisParams.MLT, true );


ryan







Re: SolrJ and MoreLikeThis / Spellchecker

2007-12-07 Thread Matthew Runo
I'll give it a try. Seems like the Spellcheck response type is pretty  
basic.


Thanks!

Matthew Runo
Software Developer
702.943.7833

On Dec 7, 2007, at 11:23 AM, Ryan McKinley wrote:


Matthew Runo wrote:

Hello!
Please forgive my newbie question about SolrJ, but I was unable to  
find my answer in the SOLRJ source code or the wiki (I'll add it if  
someone helps).
Would anyone be so kind as to provide a quick example of using the  
Spellcheck handler and the MoreLikeThis handler with SOLR-J?


With spellcheck, you will be in new water (I think)... you can get  
the response as a NamedList, but there is not anything that puts  
that into user friendly functions.


SolrQuery q = new SolrQuery( "foo" );
q.setQueryType( "spelling" );
q.set( "anyparam", value );
QueryResponse rsp = solr.query( q )

NamedList nl = rsp.getResponse();

you will have to pick stuff out of the NamedList manually.  If you  
want to contribute a SpellCheckRequest/Response that would be great!


For MLT, the standard QueryRequest should work.  in 1.3-dev, both  
standard and dismax support mlt queries.  Perhaps we should add  
getters and setters to SolrQuery so you don't have to call:

q.set( MoreLikeThisParams.MLT, true );


ryan





Re: spellchecker and multi-core index replication

2007-12-07 Thread Ryan McKinley


OG: Yes, I think that makes sense - distribute everything for a given core, not 
just its index.  And the spellchecker could then also have its data dir (and 
only index/ underneath really) and be replicated in the same fashion.

Right?


Yes, that was my thought.  If an arbitrary directory could be 
distributed, then you could have


 /path/to/dist/index/...
 /path/to/dist/spelling-index/...
 /path/to/dist/foo

and that would all get put into a snapshot.  This would also let you put 
multiple cores within a single distribution:


 /path/to/dist/core0/index/...
 /path/to/dist/core0/spelling-index/...
 /path/to/dist/core0/foo
 /path/to/dist/core1/index/...
 /path/to/dist/core1/spelling-index/...
 /path/to/dist/core1/foo

ryan


Re: SolrJ and MoreLikeThis / Spellchecker

2007-12-07 Thread Ryan McKinley

Matthew Runo wrote:

Hello!

Please forgive my newbie question about SolrJ, but I was unable to find 
my answer in the SOLRJ source code or the wiki (I'll add it if someone 
helps).


Would anyone be so kind as to provide a quick example of using the 
Spellcheck handler and the MoreLikeThis handler with SOLR-J?




With spellcheck, you will be in new water (I think)... you can get the 
response as a NamedList, but there is not anything that puts that into 
user friendly functions.


SolrQuery q = new SolrQuery( "foo" );
q.setQueryType( "spelling" );
q.set( "anyparam", value );
QueryResponse rsp = solr.query( q )

NamedList nl = rsp.getResponse();

you will have to pick stuff out of the NamedList manually.  If you want 
to contribute a SpellCheckRequest/Response that would be great!


For MLT, the standard QueryRequest should work.  in 1.3-dev, both 
standard and dismax support mlt queries.  Perhaps we should add getters 
and setters to SolrQuery so you don't have to call:

q.set( MoreLikeThisParams.MLT, true );


ryan


display tokens

2007-12-07 Thread s d
How can I retrieve the "analyzed tokens" (e.g. the stemmed values) of a
specific field?


SolrJ and MoreLikeThis / Spellchecker

2007-12-07 Thread Matthew Runo

Hello!

Please forgive my newbie question about SolrJ, but I was unable to  
find my answer in the SOLRJ source code or the wiki (I'll add it if  
someone helps).


Would anyone be so kind as to provide a quick example of using the  
Spellcheck handler and the MoreLikeThis handler with SOLR-J?


The response format for it is so different, I'm not quite sure that my  
normal way of looping through the result docs would work - there are  
no docs in the XML (spellchecker included, as a sample).




0
22

runing
false

Running



Thanks!

Matthew Runo
Software Developer
702.943.7833



Re: spellchecker and multi-core index replication

2007-12-07 Thread Otis Gospodnetic
Right, that makes sense and it's really what I just said in my reply to Ryan's 
email.  But currently only data/index* is being replicated, unless I'm 
misreading this part of snapshooter:

# take a snapshot using hard links into temporary location
# then move it into place atomically
if [[ "${OS}" == "SunOS" || "${OS}" == "Darwin" ]]
then
  orig_dir=$(pwd)
  mkdir ${temp}
  cd ${data_dir}/index   # here here
  find . -print|cpio -pdlmu ${temp} 1>/dev/null 2>&1
  cd ${orig_dir}
else
  cp -lr ${data_dir}/index ${temp}  # here here
fi
mv ${temp} ${name}


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Yonik Seeley <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, December 7, 2007 1:16:19 PM
Subject: Re: spellchecker and multi-core index replication

On Dec 7, 2007 12:53 PM, Otis Gospodnetic <[EMAIL PROTECTED]>
 wrote:
> Hi,
>
> I wrote the original spellchecker request handler a while back and a
 lot of people seem to be using the spellchecker stuff, fixing and
 improving it! :)
> But recently I realized I never wrote anything that takes care of the
 spellchecker index replication.  Looking at snapshooter, it's
 completely unaware of the spellchecker index -- or *any* other index in there
 -- it explicitly looks for data/index.

I originally came up with data/index with the idea of taking snapshots
of the whole data directory...

/solr/data
/solr/data/index  (the main index dir)
/solr/data/foo
/solr/snapshots  (snapshots of the complete data directory)

-Yonik





Re: spellchecker and multi-core index replication

2007-12-07 Thread Otis Gospodnetic
Hi,

- Original Message 
From: Ryan McKinley <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, December 7, 2007 1:13:02 PM
Subject: Re: spellchecker and multi-core index replication


> 
> 2) with multi-core stuff getting in, is Bill Au going to be adding
 support for multiple indices anyway?
> 

As is, you can use all existing scripts for each index.  You just need 
to run snapwhatever for each index.


OG: Are you referring to the -d param?  For example, snapshooter -d?  In case 
of spellchecker index, I believe the index lives under the main 'data' 
directory and I think that is the problem.  I think -d would work if each 
index/core had its own separate data directory, for example:
solr/main/data/index-- main index
solr/spell/data/index   -- spellchecker index

This is much like the core entries in multicore.xml now, no?


Perhaps if distribution could be handled for a directory that is not 
necessarily the index (maybe a folder with stuff + index*) then you 
could have:


  
  


Then if /path/to/data was distributed, it could have:

  /path/to/data/core0/index/...
  /path/to/data/core0/boost.xml (see SOLR-418)
  /path/to/data/core1/index/...
  ...


OG: Yes, I think that makes sense - distribute everything for a given core, not 
just its index.  And the spellchecker could then also have its data dir (and 
only index/ underneath really) and be replicated in the same fashion.

Right?

Otis





Re: spellchecker and multi-core index replication

2007-12-07 Thread Ryan McKinley




2) with multi-core stuff getting in, is Bill Au going to be adding support for 
multiple indices anyway?



As is, you can use all existing scripts for each index.  You just need 
to run snapwhatever for each index.


Perhaps if distribution could be handled for a directory that is not 
necessarily the index (maybe a folder with stuff + index*) then you 
could have:



 
 


Then if /path/to/data was distributed, it could have:

 /path/to/data/core0/index/...
 /path/to/data/core0/boost.xml (see SOLR-418)
 /path/to/data/core1/index/...
 ...

ryan


Re: spellchecker and multi-core index replication

2007-12-07 Thread Yonik Seeley
On Dec 7, 2007 12:53 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
> Hi,
>
> I wrote the original spellchecker request handler a while back and a lot of 
> people seem to be using the spellchecker stuff, fixing and improving it! :)
> But recently I realized I never wrote anything that takes care of the 
> spellchecker index replication.  Looking at snapshooter, it's completely 
> unaware of the spellchecker index -- or *any* other index in there -- it 
> explicitly looks for data/index.

I originally came up with data/index with the idea of taking snapshots
of the whole data directory...

/solr/data
/solr/data/index  (the main index dir)
/solr/data/foo
/solr/snapshots  (snapshots of the complete data directory)

-Yonik


spellchecker and multi-core index replication

2007-12-07 Thread Otis Gospodnetic
Hi,

I wrote the original spellchecker request handler a while back and a lot of 
people seem to be using the spellchecker stuff, fixing and improving it! :)
But recently I realized I never wrote anything that takes care of the 
spellchecker index replication.  Looking at snapshooter, it's completely 
unaware of the spellchecker index -- or *any* other index in there -- it 
explicitly looks for data/index.

Questions:

1) has anyone written something to handle spellchecker index replication?

2) with multi-core stuff getting in, is Bill Au going to be adding support for 
multiple indices anyway?

Thanks,
Otis

--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch




Re: Searches starting with wildcard

2007-12-07 Thread Otis Gospodnetic
Uh, this one really is a FAQ.  Reverse your words before indexing is one 
approach.  Using N-grams is another.  Allowing the leading wildcard in the 
query parser is a third one.  The details of each must be in the ML archives...

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Suhas M <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Thursday, December 6, 2007 8:54:47 AM
Subject: Searches starting with wildcard


Hi All, 

I want to perform a search for documents that contain the word Ending
 with
given KEYWORD. 

Ex: Say document contains  "RenjiMathewThomas" 

Now I want to perform a search for all the documents that contain the
 word
starting with "Renji" 

Lucene does not allow searches starting with wildcard, is there any
workaround to achieve this?

Thanks, Suhas M.

-- 
View this message in context:
 http://www.nabble.com/Searches-starting-with-wildcard-tf4956210.html#a14192824
Sent from the Solr - User mailing list archive at Nabble.com.






Re: Facet question

2007-12-07 Thread Yonik Seeley
On Dec 7, 2007 11:34 AM, Steve Barr <[EMAIL PROTECTED]> wrote:
> I'm new to Solr and getting an unexpected result.  Basically I have a
> field "location" and one document with location values "Europe",
> "France", and "United States".  When I try to do a faceted query on
> the location field, "France" returns 0 results, but the other two
> locations are found.
[...]
> http://localhost:8080/solr/select?q=France&rows=0&facet=true&facet.limit=-1&facet.field=location

This is querying for "France" in the default search field (defined in
the schema).
If you want to show documents with only France in the location field,
change q to location:France
If you want the facets for all documents, try changing q to *:*

-Yonik


Facet question

2007-12-07 Thread Steve Barr
I'm new to Solr and getting an unexpected result.  Basically I have a
field "location" and one document with location values "Europe",
"France", and "United States".  When I try to do a faceted query on
the location field, "France" returns 0 results, but the other two
locations are found.

Schema.xml:
...
   
...

Add document:
...
Europe
United States
France
...

Querying for "France" shows no results:

http://localhost:8080/solr/select?q=France&rows=0&facet=true&facet.limit=-1&facet.field=location



0
10

true
France
-1
location
0







0
0
0





But querying for "Europe" or "United States" (United%20States) does
find a match, for ex:

http://localhost:8080/solr/select?q=Europe&rows=0&facet=true&facet.limit=-1&facet.field=location



0
1

true
Europe
-1
location
0







1
1
1





Thanks for any help!

Steve


solr on websphere.. is it possible?

2007-12-07 Thread Sean Laval

Has anyone managed to run solr 1.2 on WebSphere 6.1. It ought to be possible so 
far as I can tell but currently I just get Error 404: SRVE0190E: File not 
found: {0} when I try and launch the admin page.
 
Hope someone can help.
 
Thanks,
Sean
_
Telly addicts unite!
http://www.searchgamesbox.com/tvtown.shtml

Update Rich Documents client question

2007-12-07 Thread Ricardo Borillo
Hi,

I have follow http://wiki.apache.org/solr/UpdateRichDocuments to start
working with rich document parsing and it's great. Thank's Eric for
it!!!
Now, i want to integrate this feature in my Java application, and
here's my question:

Solrj client can be used to post to
http://localhost:8080/solr/update/rich or i must do it by myself?
I see CURL command, but there's some Java client example?

Thank's in advance

-- 
Salut,

Ricardo Borillo Domenech
http://xml-utils.com


Re: How do I search in all fields without index by solr

2007-12-07 Thread Laxmilal Menaria
Ok..thanks. will try

On 12/7/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
>
> Laxmilal Menaria wrote:
> > I have tried that :
> >
> > ?q=laxmilal&qt=dismax&fl=FriendID,Title,Address,PhoneNo,Comments
> > ?q=video&qt=dismax&qf=FriendID,Title,Address,PhoneNo,Comments
> >
>
> fl is for returning fields, you need 'qf'
>
>
> http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=%28DisMax%29#head-af452050ee272a1c88e2ff89dc0012049e69e180
>
>


-- 
Thanks,
Laxmilal menaria

http://www.chambal.com/
http://www.minalyzer.com/
http://www.bucketexplorer.com/


Re: How do I search in all fields without index by solr

2007-12-07 Thread Ryan McKinley

Laxmilal Menaria wrote:

I have tried that :

?q=laxmilal&qt=dismax&fl=FriendID,Title,Address,PhoneNo,Comments
?q=video&qt=dismax&qf=FriendID,Title,Address,PhoneNo,Comments



fl is for returning fields, you need 'qf'

http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=%28DisMax%29#head-af452050ee272a1c88e2ff89dc0012049e69e180



RE: How do I search in all fields without index by solr

2007-12-07 Thread SDIS M. Beauchamp
You have to read the example solrconfig.xml bundled with a fresh install of solr

You'll find everything about dismax request handler


-Message d'origine-
De : Laxmilal Menaria [mailto:[EMAIL PROTECTED] 
Envoyé : vendredi 7 décembre 2007 09:12
À : solr-user@lucene.apache.org
Objet : Re: How do I search in all fields without index by solr

I have tried that :

?q=laxmilal&qt=dismax&fl=FriendID,Title,Address,PhoneNo,Comments
?q=video&qt=dismax&qf=FriendID,Title,Address,PhoneNo,Comments

But both are not return search results, is any configuration in config for that 
? my configuration is :





LM

On 12/7/07, SDIS M. Beauchamp <[EMAIL PROTECTED]> wrote:
>
> You can also use the dismaxrequesthandler to search across multiple 
> field
>
>
>
> -Message d'origine-
> De : Laxmilal Menaria [mailto:[EMAIL PROTECTED] Envoyé : vendredi 
> 7 décembre 2007 08:25 À : solr-user@lucene.apache.org Objet : Re: How 
> do I search in all fields without index by solr
>
> Ok, thanks.. have tried it, It working.
>
> But if I use it and may be XXX or YYY value is too long, I think many 
> server dont support long urls so it may give us problem. So is there 
> any configuration in config file for future.
>
> LM
>
> On 12/7/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> >
> > You should be able to search any field:
> > ?q=field1:XXX field2:YYY
> >
> > You can register fieldTypes directory to an analyzer using:
> >   > positionIncrementGap="100">
> > > class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
> >  
> >
> > ryan
> >
> >
> > Laxmilal Menaria wrote:
> > > thanks for fast reply, I have dump my index in solr data folder 
> > > and able
> > to
> > > search in single field only, but want to search in all fields. 
> > > also how
> > can
> > > I configure StandradAnalyzer in solr config xml.
> > >
> > > LM
> > >
> > > On 12/7/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> > >> solr should be able to read any lucene index -- even if it did 
> > >> not create it.  The hitch is that you need to make sure the 
> > >> analyzers and fieldTypes match what is in your index otherwise it 
> > >> is unlikely for the result to be what you expect.
> > >>
> > >> To get solr to use your manually created index files, just dump 
> > >> them in the data/index directory
> > >>
> > >> ryan
> > >>
> > >>
> > >> Laxmilal Menaria wrote:
> > >>> I don't want to use solr for indexing database, I want to use 
> > >>> solr for searching on existing index created by me with using my 
> > >>> sample
> > >> application.
> > >>> LM
> > >>>
> > >>> On 12/7/07, Venkatraman S <[EMAIL PROTECTED]> wrote:
> >  On Dec 7, 2007 10:17 AM, Laxmilal Menaria 
> >  <[EMAIL PROTECTED]>
> > wrote:
> > 
> > > Hello everyone,
> > >
> > > I have created a simple java application which indexes 
> > > database
> > >> tables,
> > > now
> > > I want to configure the solr on my created index. My index has 
> > > 5
> > >> fields,
> > > FriendID, Title, Address, PhoneNo and Comments.
> > >
> >  Why you want to use solr for  indexing databases??? !!!
> >  rtfm!
> > 
> >  --
> >  Venkat
> >  Blog @ http://blizzardzblogs.blogspot.com
> > 
> > >>>
> > >>>
> > >>
> > >
> > >
> >
> >
>
>
> --
> Thanks,
> Laxmilal menaria
>
> http://www.chambal.com/
> http://www.minalyzer.com/
> http://www.bucketexplorer.com/
>
>


--
Thanks,
Laxmilal menaria

http://www.chambal.com/
http://www.minalyzer.com/
http://www.bucketexplorer.com/



Re: How do I search in all fields without index by solr

2007-12-07 Thread Laxmilal Menaria
I have tried that :

?q=laxmilal&qt=dismax&fl=FriendID,Title,Address,PhoneNo,Comments
?q=video&qt=dismax&qf=FriendID,Title,Address,PhoneNo,Comments

But both are not return search results, is any configuration in config for
that ? my configuration is :





LM

On 12/7/07, SDIS M. Beauchamp <[EMAIL PROTECTED]> wrote:
>
> You can also use the dismaxrequesthandler to search across multiple field
>
>
>
> -Message d'origine-
> De : Laxmilal Menaria [mailto:[EMAIL PROTECTED]
> Envoyé : vendredi 7 décembre 2007 08:25
> À : solr-user@lucene.apache.org
> Objet : Re: How do I search in all fields without index by solr
>
> Ok, thanks.. have tried it, It working.
>
> But if I use it and may be XXX or YYY value is too long, I think many
> server dont support long urls so it may give us problem. So is there any
> configuration in config file for future.
>
> LM
>
> On 12/7/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> >
> > You should be able to search any field:
> > ?q=field1:XXX field2:YYY
> >
> > You can register fieldTypes directory to an analyzer using:
> >   > positionIncrementGap="100">
> > > class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
> >  
> >
> > ryan
> >
> >
> > Laxmilal Menaria wrote:
> > > thanks for fast reply, I have dump my index in solr data folder and
> > > able
> > to
> > > search in single field only, but want to search in all fields. also
> > > how
> > can
> > > I configure StandradAnalyzer in solr config xml.
> > >
> > > LM
> > >
> > > On 12/7/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> > >> solr should be able to read any lucene index -- even if it did not
> > >> create it.  The hitch is that you need to make sure the analyzers
> > >> and fieldTypes match what is in your index otherwise it is unlikely
> > >> for the result to be what you expect.
> > >>
> > >> To get solr to use your manually created index files, just dump
> > >> them in the data/index directory
> > >>
> > >> ryan
> > >>
> > >>
> > >> Laxmilal Menaria wrote:
> > >>> I don't want to use solr for indexing database, I want to use solr
> > >>> for searching on existing index created by me with using my sample
> > >> application.
> > >>> LM
> > >>>
> > >>> On 12/7/07, Venkatraman S <[EMAIL PROTECTED]> wrote:
> >  On Dec 7, 2007 10:17 AM, Laxmilal Menaria <[EMAIL PROTECTED]>
> > wrote:
> > 
> > > Hello everyone,
> > >
> > > I have created a simple java application which indexes database
> > >> tables,
> > > now
> > > I want to configure the solr on my created index. My index has 5
> > >> fields,
> > > FriendID, Title, Address, PhoneNo and Comments.
> > >
> >  Why you want to use solr for  indexing databases??? !!!
> >  rtfm!
> > 
> >  --
> >  Venkat
> >  Blog @ http://blizzardzblogs.blogspot.com
> > 
> > >>>
> > >>>
> > >>
> > >
> > >
> >
> >
>
>
> --
> Thanks,
> Laxmilal menaria
>
> http://www.chambal.com/
> http://www.minalyzer.com/
> http://www.bucketexplorer.com/
>
>


-- 
Thanks,
Laxmilal menaria

http://www.chambal.com/
http://www.minalyzer.com/
http://www.bucketexplorer.com/