Re: Search for FirstName with first Char uppercase followed by * not giving result; getting result with all lowercase and *

2011-01-30 Thread Ahmet Arslan
 
:When i try george* I get results. Whereas George* fetches no results.
 

Wildcard queries are not analyzed by QueryParser.






Re: solr - uima error

2011-01-30 Thread Tommaso Teofili
I found the issue is in the README.txt as the right class to use is
UIMAUpdateRequestProcessorFactory, please change that in your solrconfig.
Regards,
Tommaso

2011/1/30 Darx Oman darxo...@gmail.com

 Hi
 I already copied apache-solr-uima-4.0-SNAPSHOT.jartosolr\lib
 but what causing the error is this


 
 updateRequestProcessorChain name=uima

 processor class=org.apache.solr.uima.processor.UIMAProcessorFactory/

 processor class=solr.LogUpdateProcessorFactory /

 processor class=solr.RunUpdateProcessorFactory /

 /updateRequestProcessorChain





 this class is not found

 processor class=org.apache.solr.uima.processor.UIMAProcessorFactory



Re: Http Connection is hanging while deleteByQuery

2011-01-30 Thread shan2812

Hello Ravi Kiran,

I am not quite sure if it has anything to do with the number of records.
Anyways there are around 400K docs in Solr, and my deleteByQuery() tries to
delete about 400 docs..

The thing is, it does seem to delete the records, but the control never
seems to come back to the caller. It just hangs. Also that that I did not
wait until it threw any exception(lets say an hour) as I felt anyway it
should be completed in few minutes. 

I tried to let it run hoping to see an exception, but for 3-4 hours nothing
happened really.


-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Http-Connection-is-hanging-while-deleteByQuery-tp2367405p2383477.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search for FirstName with first Char uppercase followed by * not giving result; getting result with all lowercase and *

2011-01-30 Thread Mark Fletcher
Hi Ahmet,

Thanks for the reply.

I had attached the Analysis report of the query George*

It is found to be split into terms *George** and *George* by the
WordDelimiterFilterFactory and the LowerCaseFilterFactory converts it to *
george** and *george*

When I indexed *George *it was also finally analyzed and stored as *george*
Theny why is it that I don't get a match as per the analysis report I had
attached in my previous mail.

Or Am I missing something basic here?

Many Thanks.

M
On Sun, Jan 30, 2011 at 4:34 AM, Ahmet Arslan iori...@yahoo.com wrote:


 :When i try george* I get results. Whereas George* fetches no results.


 Wildcard queries are not analyzed by QueryParser.







SolrJ (Trunk) Invalid version or the data in not in 'javabin' format

2011-01-30 Thread Em

Hello list,

I build an application that uses SolrJ to communicate with Solr.

What did I do?
Well, I deleted all the solrj-lib stuff from my application's
Webcontent-directory and inserted the solrj-lib from the freshly compiled
solr 4.0 - trunk. 
However, when trying to query Solr 4.0 it shows me a 
RuntimeException:
Invalid version or the data in not in 'javabin' format

Maybe its interesting: Solr was empty. I do not know how it would look like
if there was something inside the index. 

Any ideas how to fix that problem?

Thank you
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/SolrJ-Trunk-Invalid-version-or-the-data-in-not-in-javabin-format-tp2384421p2384421.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search for FirstName with first Char uppercase followed by * not giving result; getting result with all lowercase and *

2011-01-30 Thread Savvas-Andreas Moysidis
Hi Mark,

When I indexed *George *it was also finally analyzed and stored as *george*
Theny why is it that I don't get a match as per the analysis report I had

your indexed term is george but you search for George* which does not go
through the same analysis process as it did when it was indexed. So, since
the terms you are searching for are not
lowercased you are trying to find something that starts with George (capital
G) which doesn't exist in you index.

If you are not hitting Solr directly, maybe you can lowercase you input
text before feeding it to Solr?

On 30 January 2011 16:38, Mark Fletcher mark.fletcher2...@gmail.com wrote:

 Hi Ahmet,

 Thanks for the reply.

 I had attached the Analysis report of the query George*

 It is found to be split into terms *George** and *George* by the
 WordDelimiterFilterFactory and the LowerCaseFilterFactory converts it to *
 george** and *george*

 When I indexed *George *it was also finally analyzed and stored as *george*
 Theny why is it that I don't get a match as per the analysis report I had
 attached in my previous mail.

 Or Am I missing something basic here?

 Many Thanks.

 M
 On Sun, Jan 30, 2011 at 4:34 AM, Ahmet Arslan iori...@yahoo.com wrote:

 
  :When i try george* I get results. Whereas George* fetches no results.
 
 
  Wildcard queries are not analyzed by QueryParser.
 
 
 
 
 



RE: match count per shard and across shards

2011-01-30 Thread Upayavira
Brilliant. So obvious.

Upayavira

On Sat, 29 Jan 2011 18:53 -0700, Bob Sandiford
bob.sandif...@sirsidynix.com wrote:
 Or - you could add a standard field to each shard, populate with a
 distinct value for each shard, and facet on that field.  Then look at the
 facet counts of the value that corresponds to a shard, and, hey-presto,
 you're done...
 
 Bob Sandiford | Lead Software Engineer | SirsiDynix
 P: 800.288.8020 X6943 | bob.sandif...@sirsidynix.com
 www.sirsidynix.com 
 
 
  -Original Message-
  From: Upayavira [mailto:u...@odoko.co.uk]
  Sent: Saturday, January 29, 2011 6:52 PM
  To: solr-user@lucene.apache.org
  Subject: Re: match count per shard and across shards
  
  To my knowledge, the distributed search functionality is intended to be
  transparent, thus no details deriving from it are exposed (e.g. what
  docs come from which shard), so, no, I don't believe it to be possible.
  
  The only way I know right now that you could achieve it is by two (sets
  of) queries. One would be a distributed search across all shards, and
  the other would be a single hit to every shard. To fake such a facet,
  this second set of queries would only need to ask for totals, so it
  could use a rows=0.
  
  Otherwise you'd have to enhance the distributed indexing code to expose
  some of this information in its response.
  
  Upayavira
  
  On Sat, 29 Jan 2011 03:48 -0800, csj christiansonnejen...@gmail.com
  wrote:
  
   Hi,
  
   Is it possible to construct a Solr query that will return the total
   number
   of hits there across all shards, and at the same time getting the
  number
   of
   hits per shard?
  
   I was thinking along the lines of a faceted search, but I'm not deep
   enough
   into Solr capabilities and query parameters to figure it out.
  
   Regards,
  
   Christian Sonne Jensen
  
   --
   View this message in context:
   http://lucene.472066.n3.nabble.com/match-count-per-shard-and-across-
  shards-tp2369627p2369627.html
   Sent from the Solr - User mailing list archive at Nabble.com.
  
  ---
  Enterprise Search Consultant at Sourcesense UK,
  Making Sense of Open Source
  
 
 
--- 
Enterprise Search Consultant at Sourcesense UK, 
Making Sense of Open Source



Re: match count per shard and across shards

2011-01-30 Thread csj

Hi,

FYI:
I figured out a solution my self. I wanted a smart way to get the shard
count for a query (how many documents were found in each shard). The smart
consisted in having all these counts in just one query using faceting. I was
asking if Solr could help with this, e.g. had some smart info for shards, I
could facet out of the box. But apparently it does not.

But in my situation I can use my knowledge of how the shards are organised.
They are organised chronologically, and I happen to know the date
boundaries. 

My solution is simply to facet those boundaries. In this way I can query
once and include all known shards and have their count for the search. This
may have a performance penalty, but it is at least for now a simple way.

Christian Sonne Jensen
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/match-count-per-shard-and-across-shards-tp2369627p2385061.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: SolrJ (Trunk) Invalid version or the data in not in 'javabin' format

2011-01-30 Thread Koji Sekiguchi

(11/01/31 3:11), Em wrote:


Hello list,

I build an application that uses SolrJ to communicate with Solr.

What did I do?
Well, I deleted all the solrj-lib stuff from my application's
Webcontent-directory and inserted the solrj-lib from the freshly compiled
solr 4.0 - trunk.
However, when trying to query Solr 4.0 it shows me a
RuntimeException:
Invalid version or the data in not in 'javabin' format


I've just committed a small change so that you can see the version difference
(I'll open the JIRA issue later because it is in maintenance now):

Index: solr/src/common/org/apache/solr/common/util/JavaBinCodec.java
===
--- solr/src/common/org/apache/solr/common/util/JavaBinCodec.java   
(revision 1065245)
+++ solr/src/common/org/apache/solr/common/util/JavaBinCodec.java   
(working copy)
@@ -96,7 +96,8 @@
 FastInputStream dis = FastInputStream.wrap(is);
 version = dis.readByte();
 if (version != VERSION) {
-  throw new RuntimeException(Invalid version or the data in not in 'javabin' 
format);
+  throw new RuntimeException(Invalid version (expected  + VERSION +
+  , but  + version + ) or the data in not in 'javabin' format);
 }
 return readVal(dis);
   }

Can you try the latest trunk and see the version difference?

Koji
--
http://www.rondhuit.com/en/


Re: solr - uima error

2011-01-30 Thread Darx Oman
Thanx Tommaso
now solr starts OK.

but what really I don't comprehend is how my text get annotated by UIMA
because when I started indexing none of the UIMA new fields get data

If you please point me to some kind of articles that explain how this thing
works...