Re: SOLR SpeelChecker and german Umlauts

2009-07-01 Thread Kraus, Ralf | pixelhouse GmbH

Michael Ludwig schrieb:

Kraus, Ralf | pixelhouse GmbH schrieb:

When I am searching for ONE word with an german umlaut like
kräuterkeckse (the right word is kräuterkekse) the spellchecker
gives me two corrections :

Spellcheck for kr = kren
Spellcheck for uterkeksse = butterkekse

WHY is SOLR break this ONE word apart ?


Moin Ralf,

please read the following threads to understand the issue. In short,
you need to specify your query in spellcheck.q as well.

DANKE ! :-)


Greets -Ralf-


SOLR SpeelChecker and german Umlauts

2009-06-30 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

I really need some help with the SOLR SpeelChecker and german Umlauts.
So far I am really satisfied with the JAROWINKLER algorithm.

Now my problem :-)

When I am searching for ONE word with an german umlaut like 
kräuterkeckse (the right word is kräuterkekse) the spellchecker

gives me two corrections :

Spellcheck for kr = kren
Spellcheck for uterkeksse = butterkekse

WHY is SOLR break this ONE word apart ?

My Config is :

spellcheck = 'true';
spellcheck.dictionary = 'jarowinkler'
spellcheck.onlyMorePopular = 'true'
spellcheck.build = 'false'
spellcheck.count = 1

Greets -Ralf-


Re: Big Problem with special characters

2009-04-21 Thread Kraus, Ralf | pixelhouse GmbH

Otis Gospodnetic schrieb:

Try debugQuery=true and see if the resulting query string makes sense.


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

thx for the hint...

My problem was the WhitespaceTokenizer :-( After I change back to 
StandardTokenizer everythign was fine !


Greets -Ralf-



Big Problem with special characters

2009-04-20 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

first some details about my SOLR installation:

schema.xml

   fieldType name=text_test class=solr.TextField 
positionIncrementGap=100 

   analyzer type=index
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.StopFilterFactory ignoreCase=true 
words=stopwords.txt/

   filter class=solr.LengthFilterFactory min=2 max=50 /
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=german /

   /analyzer

   analyzer type=query
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.StopFilterFactory ignoreCase=true 
words=stopwords.txt/

   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=german /

   /analyzer
   /fieldType

Search:
qf=name^2.0+name2^1.5+name3
wt=phps
rows=30
start=0
sort=score+desc
fl=*,score
q=speed
qt=dismax

When I have a string like (speed) in name3 or name2 SOLR dont find 
it at all :-( If I search for (speed) erverything is fine !


Greets -Ralf-



Using Lucene MultiFieldQueryParser with SOLR

2009-04-17 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

I am searching for a way to use the Lucene MultiFieldQueryParser in my 
SOLR Installation.

Is there a chance to change the solrQueryParser ?

In my old Lucene Setting I used to combine many different types of 
QueryParser in my Querry...


Or is there a chance to get MultiFieldQueryParser  functions in SOLR ?

Greets -Ralf-


Re: Using Lucene MultiFieldQueryParser with SOLR

2009-04-17 Thread Kraus, Ralf | pixelhouse GmbH

Marc Sturlese schrieb:

Think there's no search handler that uses MultiFieldQueryParser in Solr. But
check DismaxRequestHandler, probably will do the job. Yo can specify all the
fields where you want to search in and it will build the query using boolean
queries. It includes also many more features:
http://wiki.apache.org/solr/DisMaxRequestHandler
  

Is there a chance to combine RequestHandler ?
I need to use some additional normal boolean and integer querries !

Greets -Ralf-


Re: Using Lucene MultiFieldQueryParser with SOLR

2009-04-17 Thread Kraus, Ralf | pixelhouse GmbH

Marc Sturlese schrieb:

Think there's no search handler that uses MultiFieldQueryParser in Solr. But
check DismaxRequestHandler, probably will do the job. Yo can specify all the
fields where you want to search in and it will build the query using boolean
queries. It includes also many more features:
http://wiki.apache.org/solr/DisMaxRequestHandler

THX A LOT !

You really made my day !

Greets -Ralf-


Re: Using Lucene MultiFieldQueryParser with SOLR

2009-04-17 Thread Kraus, Ralf | pixelhouse GmbH

Marc Sturlese schrieb:

Well dismax has a q.alt parameter where you can specify a query in lucene
sintax. The query must be empty to use q.alt:
http://.../select?q=q.alt=phone_number:1234567
This would search in the field phone_number independly of what fields you
have configured in teh dismax.
  
Now I use the fq parameter in combination with q.alt ... Runs fine 
yet :-)

The fq parameter sets my additional query parameter :-)

Greets -Ralf-




Re: Using Lucene MultiFieldQueryParser with SOLR

2009-04-17 Thread Kraus, Ralf | pixelhouse GmbH

Marc Sturlese schrieb:

The only problem I found with q.alt is that it doesn't allow highlighting (or
at least it doesn't showed it for me). If you find out how to do it let me
know.

I use highlighting only with the normal querry !
My q.alt is *.*

But its really sad that the dismax dont support wildcarts :-(

Greets -Ralf-


Re: Disable logging in SOLR

2009-04-15 Thread Kraus, Ralf | pixelhouse GmbH

Bill Au schrieb:

Have you tried setting logging level to OFF from Solr's admin GUI:
http://wiki.apache.org/solr/SolrAdminGUI
  

thx 4 the hint !

But after I restart my tomcat its all reseted to default ? :-(

Greets -Ralf-


Re: Disable logging in SOLR

2009-04-15 Thread Kraus, Ralf | pixelhouse GmbH

Mark Miller schrieb:

Kraus, Ralf | pixelhouse GmbH wrote:

Hi,

is there a way to disable all logging output in SOLR ?
I mean the output text like :

INFO: [core_de] webapp=/solr path=/update params={wt=json} status=0 
QTime=3736


greets -Ralf-

You probably do not want to totally disable logging in Solr. More 
likely, your looking to make Solr less chatty by not logging the INFO 
level. Solr is a bit chatty by default, mostly I think, because that 
can be very useful and is often worth the likely very small 
performance hit of all the extra logging. At the least though, I think 
you want to leave Severe/Error logging on in most cases, and possibly 
WARN.


Its easy enough to change the logging levels though. Solr 1.3 uses 
java.util.logging and Solr 1.4 uses SLF4J defaulting to 
java.util.logging.


So you can either change the system level properties file in your JDK 
folder, or you can use a param at startup: 
|-Djava.util.logging.config.file=/path/to/my/logging.properties

Thats exactly the way I choose yesterday ;-)

Thx

Greets -Ralf-


Disable logging in SOLR

2009-04-14 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

is there a way to disable all logging output in SOLR ?
I mean the output text like :

INFO: [core_de] webapp=/solr path=/update params={wt=json} status=0 
QTime=3736


greets -Ralf-



Re: Combination of solr.xml and solrconfig.xml

2009-03-24 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

question ;-)

!DOCTYPE config SYSTEM http://java.sun.com/dtd/web-app_2_3.dtd; [

   !ENTITY default_solrconfig SYSTEM 
/var/lib/tomcat5.5/webapps/solr/default_solrconfig.xml


]

Is there a chance to set the home directory using a variable ? For 
example an unix enviroment variable ?


Greets -Ralf-

No chance ?

Greets -Ralf-


BOSTing FIELDS in SCHEMA.XML

2009-03-23 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

is there a chance to set boosting directly in my schema.xml ?

   field name=Name1 type=text indexed=true  stored=true 
multivalued=true omitNorms=false boost=2 /
   field name=Name2 type=text indexed=true  stored=true 
multivalued=true omitNorms=false boost=1.5f /
   field name=Name3 type=text indexed=true  stored=true 
multivalued=true omitNorms=false boost=0.5f /

   field name=Name4 type=text indexed=true  stored=true /

I want do it directly in the index and not in my query !

Greets -Ralf-


Combination of solr.xml and solrconfig.xml

2009-03-19 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

I have a little question concerning the combination of solr.xml and 
solrconfig.xml !
I am running a multicore configuration for doing multiple language index 
files ...


my structure is like

\solr (solr.xml)
\solr\core_de (solrconfig.xml for german profile)
\solr\core_uk (solrconfig.xml for english profile)

My problem is that there is many many redundant code in the two 
solrconfig.xml file and I wonder
if I can move this code to solr.xml and only leave the core specific 
lines in the solrconfig.xml ?


Is there any chance to combine solrconfig.xml files ? Or any other 
suggestions ?


Greets,

Ralf


Re: Combination of solr.xml and solrconfig.xml

2009-03-19 Thread Kraus, Ralf | pixelhouse GmbH

Giovanni De Stefano schrieb:

Guten Tag Ralf,

try to look here: http://wiki.apache.org/solr/CoreAdmin

It says: You can also specify properties in solr.xml which can be used in
the solrconfig.xml and schema.xml files. I believe this might help.
  

thx !!

I tried this :

(My solr.xml :)

solr persistent=true sharedLib=lib
 config
   updateHandler class=solr.DirectUpdateHandler2 /
   etc. . . . .
 /config

 cores adminPath=/admin/cores

   core name=core_uk instanceDir=core_uk

 config

   dataDir//x/x/dataDir


 /config

   /core

 /cores

/solr

Doesnt seem to work :-(

Greets -Ralf-


Re: Combination of solr.xml and solrconfig.xml

2009-03-19 Thread Kraus, Ralf | pixelhouse GmbH

Giovanni De Stefano schrieb:

Hello Ralf,

I believe you want to do something like:

solr persistent=true sharedLib=lib
  cores adminPath=/admin/cores
  core name=core0 instanceDir=core0
property name=dataDir value=/data/core0 /
  /core
  core name=core1 instanceDir=core1 /
 /cores
/solr
  

Okay I will try to explain it better :-)

- my two solrconfig.xml files are around 200 lines long
- ONLY one line is the difference between this files : 
dataDir/www/solr//x/dataDir
- I want to transfer this 199 other lines to my solr.xml so I don´t have 
this lines redundant.


Greets -Ralf-



Re: Combination of solr.xml and solrconfig.xml

2009-03-19 Thread Kraus, Ralf | pixelhouse GmbH

Giovanni De Stefano schrieb:

On the other hand, if your files have 199 lines in *common*, then yes, you
could/should move those 199 common lines in solr.xml.
  

YES ! Thats my case

But how could I transfer this lines (solrconfig.xml)

config
 dataDir/www/solr//xx/dataDir
 updateHandler class=solr.DirectUpdateHandler2 /
 
abortOnConfigurationError${solr.abortOnConfigurationError:true}/abortOnConfigurationError


 indexDefaults
  useCompoundFilefalse/useCompoundFile
  mergeFactor2/mergeFactor
  ramBufferSizeMB32/ramBufferSizeMB
  maxMergeDocs2147483647/maxMergeDocs
  maxFieldLength1/maxFieldLength
  writeLockTimeout1000/writeLockTimeout
  commitLockTimeout1/commitLockTimeout
  lockTypesingle/lockType
 /indexDefaults

 mainIndex
  useCompoundFilefalse/useCompoundFile
  mergeFactor2/mergeFactor
  ramBufferSizeMB32/ramBufferSizeMB
  maxMergeDocs2147483647/maxMergeDocs
  maxFieldLength1/maxFieldLength
  unlockOnStartupfalse/unlockOnStartup
 /mainIndex

etc.

to my solr.xml

???

Greets -Ralf-


Re: Combination of solr.xml and solrconfig.xml

2009-03-19 Thread Kraus, Ralf | pixelhouse GmbH

Erik Hatcher schrieb:

Unfortunately Solr doesn't offer that kind of reusability (yet).

But you could just go old school and use XML entity includes to pull 
in a fragment of a file, and externalize the common pieces that way. 
Not as pretty, but oughta work.


thx a lot ! This sounds perfect... (yet)

Greets -Ralf-



SPELLCHECK Problems

2009-02-11 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

My SOLRCONFIG.XML

   requestHandler name=/spellCheckCK class=solr.SearchHandler
   lst name=defaults
   /lst
   arr name=last-components
   strspellcheck/str
   /arr
   /requestHandler

   searchComponent name=spellcheck class=solr.SpellCheckComponent

   str name=queryAnalyzerFieldTypetextSpell/str

   lst name=spellchecker
   str name=namedefault/str
   str name=fieldspell/str
   str name=spellcheckIndexDir./spellchecker1/str
   str name=buildOnOptimizetrue/str
   /lst

   lst name=spellchecker
   str name=namejarowinkler/str
   str name=fieldspell/str
   !-- Use a different Distance Measure --
   str 
name=distanceMeasureorg.apache.lucene.search.spell.JaroWinklerDistance/str

   str name=spellcheckIndexDir./spellchecker2/str
   str name=buildOnOptimizetrue/str
   /lst

   lst name=spellchecker
   str name=classnamesolr.FileBasedSpellChecker/str
   str name=namefile/str
   str name=sourceLocationdictionary.txt/str
   str name=characterEncodingUTF-8/str
   str name=spellcheckIndexDir./spellcheckerFile/str
   str name=buildOnOptimizetrue/str
   /lst

   /searchComponent

My Schema.xml

field name=RezeptNameSpellCheck type=textSpell indexed=true 
stored=true multiValued=true/

copyField source=RezeptName dest=RezeptNameSpellCheck /

Search:
spellcheck=true
wt=phps
rows=30
start=0
sort=score+desc
spellcheck.build=true
spellcheck.extendedResults=false
spellcheck.count=1
q=sudeln
spellcheck.onlyMorePopular=true
spellcheck.dictionary=file

Now my Problems :-)

If I use the file choice with spellcheck.dictionary=file I got very 
bad suggestions :-( If I use default or Jarowinkler I dont get any 
suggestions at all :-(

Whats the problem ?

Greets,

Ralf


Re: SPELLCHECK Problems

2009-02-11 Thread Kraus, Ralf | pixelhouse GmbH

Grant Ingersoll schrieb:

What's your textSpell FieldType look like?

Spelling is definitely something that needs tuning, so you might have 
to play with some of the knobs like accuracy, etc.


As for JaroWinkler, and I suppose the default, your field is spell, 
but based on your configuration, I gather you really want it to be 
RezeptNameSpellCheck.  I am guessing that if you point Luke at your 
those two spell checking indexes, you're going to find that they are 
empty.



Hey ! Thx a lot ... Thats indeed was my problem :-)

Greets,

Ralf


Need help with DictionaryCompoundWordTokenFilterFactory

2009-02-06 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

Now I ran into another problem by using the 
solr.DictionaryCompoundWordTokenFilterFactory :-(
If I search for the german word Spargelcremesuppe which contains 
Spargel, Creme and Suppe SOLR will find way to many result.
Its because SOLR finds EVERY entry with either one of the three words in 
it :-(


Here is my schema.xml

   fieldType name=text_text class=solr.TextField 
positionIncrementGap=100

   analyzer
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter 
class=solr.DictionaryCompoundWordTokenFilterFactory

   dictionary=dictionary.txt
   minWordSize=5
   minSubwordSize=2
   maxSubwordSize=15
   onlyLongestMatch=true /
   filter class=solr.SynonymFilterFactory 
synonyms=synonyms.txt ignoreCase=true expand=true/
   filter class=solr.StopFilterFactory ignoreCase=true 
words=stopwords.txt/

   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
   /fieldType

Any help ?

Greets,

Ralf Kraus


Re: Problem with setting solr.solr.home property

2009-02-03 Thread Kraus, Ralf | pixelhouse GmbH

Manupriya schrieb:

Hi,

Till now I was working with the jetty server bundled with the SOLR
distribution. But I want to deploy solr.war to another jetty server. Here I
am facing some problem with solr/home. Whenever I start the jetty server, I

try to extract the solr.war and edit the web.xml !

Greets -Ralf-



Re: SOLR Problem with special chars

2009-01-21 Thread Kraus, Ralf | pixelhouse GmbH

Otis Gospodnetic schrieb:

Ralf,

Can you paste the part of your schema.xml where you defined the relevant field?

Otis


Sure !

   fieldType name=text_normal class=solr.TextField 
positionIncrementGap=100

   analyzer type=index
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
  
   analyzer type=query

   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
  
   /fieldType


Greets


Re: SOLR Problem with special chars

2009-01-21 Thread Kraus, Ralf | pixelhouse GmbH

Otis Gospodnetic schrieb:

now it works :

   fieldType name=text_new class=solr.TextField 
positionIncrementGap=100

   analyzer type=index
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.StopFilterFactory ignoreCase=true 
words=stopwords.txt/

   filter class=solr.WordDelimiterFilterFactory
   generateWordParts=1
   generateNumberParts=1
   catenateWords=1
   catenateNumbers=1
   catenateAll=0
   splitOnCaseChange=1/
   filter class=solr.LengthFilterFactory min=3 
max=50 /

   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /
   filter class=solr.EnglishPorterFilterFactory 
protected=protwords.txt /

   /analyzer

   analyzer type=query
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.SynonymFilterFactory 
synonyms=synonyms.txt ignoreCase=true expand=true/
   filter class=solr.StopFilterFactory ignoreCase=true 
words=stopwords.txt/

   filter class=solr.WordDelimiterFilterFactory
   generateWordParts=1
   generateNumberParts=1
   catenateWords=0
   catenateNumbers=0
   catenateAll=0
   splitOnCaseChange=1/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /
   filter class=solr.EnglishPorterFilterFactory 
protected=protwords.txt /

   /analyzer
   /fieldType


Greets,

Ralf


SOLR Problem with special chars

2009-01-20 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

My string in my DB is like Kellogs, Corn- (Flakes)

When I search with Kellogs or Corn or Flakes I cant find the entry 
in my index :-(

Is there something I missing ?

Greets,

--
Ralf Kraus


Re: Need help with SolrIndexSearcher CoreContainer

2008-11-17 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

After 5-6 searches I run out of memory :-(

Examples:

  String homeDir  = /var/lib/tomcat5.5/webapps/solr;
  File configFile = new File( homeDir, solr.xml );
CoreContainer myCoreContainer = new CoreContainer( 
homeDir, configFile );

  mySolrCore = myCoreContainer.getCore(core_de);
RefCountedSolrIndexSearcher temp_search = 
mySolrCore.getSearcher();

  SolrIndexSearcher searcher = temp_search.get();

No one ever worked directly with CoreContainer and SolrIndexSearcher ?

Greets -Ralf-


Need help with SolrIndexSearcher CoreContainer

2008-11-12 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

I want to use a SolrIndexSearcher for some special searches in my app...
I startup my Solr with two cores in it (core_de  core_uk).

But when I try this then my Solr Server generates a complete new cory 
instead of

using the existing one...

After 5-6 searches I run out of memory :-(

Examples:

  String homeDir  = /var/lib/tomcat5.5/webapps/solr;
  File configFile = new File( homeDir, solr.xml );
CoreContainer myCoreContainer = new CoreContainer( homeDir, 
configFile );

  mySolrCore = myCoreContainer.getCore(core_de);
RefCountedSolrIndexSearcher temp_search = 
mySolrCore.getSearcher();

  SolrIndexSearcher searcher = temp_search.get();

Greets -Ralf-


Re: Problem with SolrJ and SolrIndexSearcher

2008-11-11 Thread Kraus, Ralf | pixelhouse GmbH

Otis Gospodnetic schrieb:

Hi Ralf,

Your subject mentions SolrJ, but I don't see any SolrJ code in your email.
It looks as if you are messing with Solr's own code.
  

thx for you help...

Greets


Problem with SolrJ and SolrIndexSearcher

2008-11-11 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

I want to use a SolrIndexSearcher for some special searches in my app...
I startup my Solr with two cores in it (core_de  core_uk).

But when I try this then my Solr Server generates a complete new cory 
instead of

using the existing one...

After 5-6 searches I run out of memory :-(

Examples:

   String homeDir  = /var/lib/tomcat5.5/webapps/solr;
   File configFile = new File( homeDir, solr.xml );
  
   CoreContainer myCoreContainer = new CoreContainer( homeDir, 
configFile );

   mySolrCore = myCoreContainer.getCore(core_de);
  
   RefCountedSolrIndexSearcher temp_search = 
mySolrCore.getSearcher();

   SolrIndexSearcher searcher = temp_search.get();

Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-31 Thread Kraus, Ralf | pixelhouse GmbH

Hey,

I think it will have the disadvantage of being a lot slower though...

How were you handling things with Lucene? You must have used Java 
then? If you even want to get close to that performance I think you 
need to use non http embedded solr.

I am using this :

- I wrote a JAVA JSP file to get an EmbeddedSolrServer
- Now I call this JSP file from my PHP script and the JSP makes my 
search request to SOLR

- after that I generate a CSV file out of the JSP and read it from PHP

It´s the same way I did it with the prior LUCENE engine I used.
But now the peformence is 10% from the prior LUCENE speed :-(

Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-31 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

Thx a lot for the tip !

But when I try it I got

 HTTP/1.1 500 null java.lang.NullPointerException at 
org.apache.solr.common.util.StrUtils.splitSmart(StrUtils.java:37)


My Request is :
INFO: [core_de] webapp=/solr path=/select/ 
params={wt=phpsquery=Toolsrecords=30start_record=0} status=500 QTime=1


Exception in SOLR:
SCHWERWIEGEND: java.lang.NullPointerException
   at org.apache.solr.common.util.StrUtils.splitSmart(StrUtils.java:37)
   at 
org.apache.solr.search.OldLuceneQParser.parse(LuceneQParserPlugin.java:104)

   at org.apache.solr.search.QParser.getQuery(QParser.java:88)
   at 
org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:82)
   at 
org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:148)
   at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)

   at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204)
   at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
   at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
   at 
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:202)
   at 
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
   at 
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
   at 
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
   at 
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
   at 
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
   at 
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107)
   at 
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148)
   at 
org.apache.coyote.http11.Http11AprProcessor.process(Http11AprProcessor.java:833)
   at 
org.apache.coyote.http11.Http11AprProtocol$Http11ConnectionHandler.process(Http11AprProtocol.java:639)
   at 
org.apache.tomcat.util.net.AprEndpoint$Worker.run(AprEndpoint.java:1285)

   at java.lang.Thread.run(Thread.java:595)

Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-31 Thread Kraus, Ralf | pixelhouse GmbH

Hi,


And rows instead of records, and start instead of start_record.  :)

Erik



You´re my man :-)

Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-31 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

  queryResponseWriter name=phps 
class=org.apache.solr.request.PHPSerializedResponseWriter/


Then in PHP, hit Solr directly like this:

$response = unserialize(file_get_contents($url));

Where $url is something like http://localhost:8983/solr/select?q=*:*

No SOLR is 2times faster than LUCENE = Strike !
Hello weekend I am comming :-)

Greets -Ralf-


Performanec Lucene / Solr

2008-10-30 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

I am validating Sorl 1.3 now for about 3 weeks... My goal is to migrate
from Lucene to Solr because of the much better plugins and search functions.

Right now I am stress testing the performence and sending 2500 search 
request via JSON protocol and from my PHPUnit testcase.

All search reuqest are different so caching don´t do it for me.
Right now our old Lucene-JSPs are avout 4 times faster than my SOLR 
Sollution :-(


any chances I can tweak my solrconfig.xml ?

Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-30 Thread Kraus, Ralf | pixelhouse GmbH

Mark Miller schrieb:

 Right now I am stress testing the performence and sending 2500 search

request via JSON protocol and from my PHPUnit testcase.
All search reuqest are different so caching don´t do it for me.
Right now our old Lucene-JSPs are avout 4 times faster than my SOLR
Sollution :-(




Well, with Lucene it is an API call in the same JVM in the same web
application. With Solr, you are making HTTP calls across the network,
serializing requests and de-serializing responses. So the comparison 
is not

exactly apples to apples.

Look at what Solr offers -- replication, caching, plugins etc. Will you
really need to go over 2500 requests per second? Do you need to be 
concerned
with performance above and beyond that? Will it be easier to scale 
out to

more boxes?

  

And have you tried solrj without http?


Right now I am using this php classes to send and receiver my requests :

- Apache_Solr_Service.php
- Responce.php

It has the advantage that I don´t need to write extra JSP oder JAVA code...

Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-30 Thread Kraus, Ralf | pixelhouse GmbH

Mark Miller schrieb:

Kraus, Ralf | pixelhouse GmbH wrote:

Mark Miller schrieb:

 Right now I am stress testing the performence and sending 2500 search

request via JSON protocol and from my PHPUnit testcase.
All search reuqest are different so caching don´t do it for me.
Right now our old Lucene-JSPs are avout 4 times faster than my SOLR
Sollution :-(




Well, with Lucene it is an API call in the same JVM in the same web
application. With Solr, you are making HTTP calls across the network,
serializing requests and de-serializing responses. So the 
comparison is not

exactly apples to apples.

Look at what Solr offers -- replication, caching, plugins etc. Will 
you
really need to go over 2500 requests per second? Do you need to be 
concerned
with performance above and beyond that? Will it be easier to scale 
out to

more boxes?

  

And have you tried solrj without http?


Right now I am using this php classes to send and receiver my requests :

- Apache_Solr_Service.php
- Responce.php

It has the advantage that I don´t need to write extra JSP oder JAVA 
code...


Greets -Ralf-

I think it will have the disadvantage of being a lot slower though...

How were you handling things with Lucene? You must have used Java 
then? If you even want to get close to that performance I think you 
need to use non http embedded solr.


Okay okay :-) I am writing a new JSP Handler for my requests as we speak 
:-) I really hope performence will be better than with {wt=javabin} 


Greets -Ralf-


Re: Performanec Lucene / Solr

2008-10-30 Thread Kraus, Ralf | pixelhouse GmbH

Grant Ingersoll schrieb:
Have you gone through 
http://wiki.apache.org/solr/SolrPerformanceFactors ?


Can you explain a little more about your testcase, maybe even share 
code?  I only know a little PHP, but maybe someone else who is better 
versed might spot something.

I just wrote my JSP script for using solrj instead
performence is much much better now !

Greets -Ralf-


Override Similarity

2008-10-28 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

is there a chance to override the Similarity in my search ?
In fact I want that all result return a 1 (with the idf methode).

Greets -Ralf-


Override Similarity

2008-10-28 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

is there a chance to override the Similarity in my search ?
In fact I want that all result return a 1 (with the idf methode).

Greets -Ralf-


Sorting TEXT Field problems :-(

2008-10-28 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

Querry:
{wt=jsonrows=30json.nl=mapstart=0sort=RezeptName+asc}

Result :

Doppeldecker
Eiersalat
Curry - Eiersalat
Eiersalat

Why is my second Curry... after Doppeldecker ???
RezeptName is a normal text field defined as :

   fieldType name=text class=solr.TextField 
positionIncrementGap=100

   analyzer type=index
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
  
   analyzer type=query

   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
  
   /fieldType



Greets -Ralf-


Re: Sorting TEXT Field problems :-(

2008-10-28 Thread Kraus, Ralf | pixelhouse GmbH

Thomas Traeger schrieb:

Kraus, Ralf | pixelhouse GmbH schrieb:

Hello,

Querry:
{wt=jsonrows=30json.nl=mapstart=0sort=RezeptName+asc}

Result :

Doppeldecker
Eiersalat
Curry - Eiersalat
Eiersalat

Why is my second Curry... after Doppeldecker ???
RezeptName is a normal text field defined as :

   fieldType name=text class=solr.TextField 
positionIncrementGap=100

   analyzer type=index
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
 analyzer type=query
   tokenizer class=solr.WhitespaceTokenizerFactory/
   filter class=solr.LowerCaseFilterFactory/
   filter class=solr.RemoveDuplicatesTokenFilterFactory/
   filter class=solr.SnowballPorterFilterFactory 
language=German /

   /analyzer
 /fieldType


Greets -Ralf-


Hi,

normally you would define at least one special field for sorting: 
http://wiki.apache.org/solr/CommonQueryParameters#head-9f40612b42721ed9e1979a4a80d68f4f8524e9b4 



you have to use a single valued, indexed but untokenized field (or use 
a tokenizer that produces only one token)


You might also look at field alphaOnlySort in the example schema.

Tom


runs perfekt now ! Thx

Greets -Ralf-



Re: Need help with DictionaryCompoundWordTokenFilterFactory

2008-10-14 Thread Kraus, Ralf | pixelhouse GmbH

Steven A Rowe schrieb:

Oops, variable-name != attribute-name.

Thanks Hoss.

Steve

So 

dictFile or dictionary  ???

Greets -Ralf-





Re: Need help with DictionaryCompoundWordTokenFilterFactory

2008-10-14 Thread Kraus, Ralf | pixelhouse GmbH

Chris Hostetter schrieb:
: :dictFile=de_DR.xml 
: 
: according to the code the param name is dictionary not dictFile.


PS: the dictionary file shouldn't be and XML file, it should look just 
like a stopwords file (one word per line)



-Hoss

  

thx !

It finally runs perfect !

Greets -Ralf-


Re: Need help with DictionaryCompoundWordTokenFilterFactory

2008-10-13 Thread Kraus, Ralf | pixelhouse GmbH

Thx a lot !

I downloaded a dictionary called de_DR.xml and put it into my conf 
directory...

Then I changed my schema.xml to :

class=solr.DictionaryCompoundWordTokenFilterFactory
dictFile=./conf/de_DR.xml
minWordSize=5
minSubwordSize=2
maxSubwordSize=15
onlyLongestMatch=true

but solr can´t find the dictionary file :-(

SCHWERWIEGEND: Could not start SOLR. Check solr/home property
java.lang.RuntimeException: Error opening null
at
org.apache.solr.core.SolrResourceLoader.openResource(SolrResourceLoader.java:191)
at
org.apache.solr.core.SolrResourceLoader.getLines(SolrResourceLoader.java:237)
at
org.apache.solr.core.SolrResourceLoader.getLines(SolrResourceLoader.java:213)
at
org.apache.solr.analysis.DictionaryCompoundWordTokenFilterFactory.inform(DictionaryCompoundWordTokenFilterFactory.java:49)
at
org.apache.solr.core.SolrResourceLoader.inform(SolrResourceLoader.java:322)

Any hints ?

Greets -Ralf-


Re: Multi-language solr1.3 what would you reckon?

2008-10-13 Thread Kraus, Ralf | pixelhouse GmbH

Hannes Carl Meyer schrieb:

Hi,

is it really neccessary to put it all into one index? You could also use the
Solr MultiCore/MultipleIndexes feature and seperate by language.
  

Is there a good webpage with infos about the multiindex-feature ?
I know http://wiki.apache.org/solr/MultipleIndexes but there is not 
enough info :-(



Greets -Ralf-



Need help with DictionaryCompoundWordTokenFilterFactory

2008-10-10 Thread Kraus, Ralf | pixelhouse GmbH

Hi,

I am trying to solve the typical german Donaudampfschiff- problem by 
using the DictionaryCompoundWordTokenFilter ...
Anyone can show me how to configure my schema.xml to use the 
DictionaryCompoundWordTokenFilterFactory ???


Greets -Ralf-


Re: Need help with more than just one index

2008-10-09 Thread Kraus, Ralf | pixelhouse GmbH

Hannes Carl Meyer schrieb:

Hi Ralf,

since Solr 1.3 it is possible to run multiple cores (indexes) inside a
single deployment, please check:

http://wiki.apache.org/solr/MultipleIndexes

it is not even about seperating indexes but also have different
configurations, index and query analyzers etc.
  

Thx a lot Hannes !

Greets -Ralf-


Need help with more than just one index

2008-10-09 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

I am wondering if there is a chance to use solr with more than just one 
index ? Is there a chance a could switch to another index if

I want to search another context ?

for example :

searching for books : use index1 (schema1.xml)
searching for magazines : use index 2 (schema2.xml)

please help me...

--
Greets -Ralf-



Re: feeding documents tru API

2008-10-09 Thread Kraus, Ralf | pixelhouse GmbH

Cam Bazz schrieb:

Hello,

I have been looking at the API documentation but I dont know where to
look in order to feed documents tru API without using xml files.

any ideas?

  

Look for the SolrIndexWriter class...

http://lucene.apache.org/solr/api/org/apache/solr/update/SolrIndexWriter.html

--
Greets -Ralf-



Problems with WordDelimiterFilterFactory

2008-10-08 Thread Kraus, Ralf | pixelhouse GmbH

Hello,

I am playing arount with WordDelimiterFilterFactory and run into some 
problems...


When I search with RedElectronicLed I got this entries : Red - 
Electronic - Led = Perfect !


But I don´t get RedElectronicLed entries in my DB :-(

Is there a chance I will get both ? Red - Electronic - Led and 
redelectronicled ?



--
Greetings
Ralf Kraus