Re: Git repo

2012-02-20 Thread Igor MILOVANOVIC
http://git.apache.org/ On Sun, Feb 19, 2012 at 7:50 PM, Mark Diggory mdigg...@atmire.com wrote: Is there a git repo location that mirrors apache svn repos for solr? Cheers, Mark -- [image: @mire Inc.] *Mark Diggory *(Schedule a Meeting

Re: Development inside or outside of Solr?

2012-02-20 Thread bing
I have looked into the TikaCLI with -language option, and learned that Tika can output only the language metadata. It cannot help me to solve my problem though, as my main concern is whether to change Solr or not. Thank you all the same. -- View this message in context:

Solr logging

2012-02-20 Thread ola nowak
Hi, I want to set my Solr to use log4j and to write log messages into separate file instead of writing all on standard output. How can I do it? Which jars should I add? Where should I put log4j.xml file? Regards, Alex

Re: Solr logging

2012-02-20 Thread darul
I get similar questions in the past :) http://lucene.472066.n3.nabble.com/Jetty-logging-td3476715.html#a3483146 wish it will help you. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760173.html Sent from the Solr - User mailing list archive at

Re: Solr logging

2012-02-20 Thread ola nowak
Thanks a lot. I've added (and deleted) those libraries and now I don't get this messages to stdout :) I see that log4j is running and it can't find its config file. I wish I could add this to the solr.war. Is this possible? I want to avoid setting paramemeters in glassfish. Regards, Alex On

processing of merged tokens

2012-02-20 Thread Carlos Gonzalez-Cadenas
Hello all, For our search system we'd like to be able to process merged tokens, i.e. when a user enters a query like hotelsin barcelona, we'd like to know that the user means hotels in barcelona. At some point in the past we implemented this kind of functionality with shingles (using

Re: Solr logging

2012-02-20 Thread darul
Yes, you can update your .war archive by adding/removing expected jars. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760285.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr logging

2012-02-20 Thread ola nowak
I've already done that. What I'm more interested is if I can add log4j.xml to war and where to put to make it works On Mon, Feb 20, 2012 at 10:49 AM, darul daru...@gmail.com wrote: Yes, you can update your .war archive by adding/removing expected jars. -- View this message in context:

Re: Solr logging

2012-02-20 Thread darul
Hmm, I did not try to achieve this but interested if you find a way... After I believe than having log4j config file outside war archive is a better solution, if you may need to update its content for example. -- View this message in context:

Re: Solr logging

2012-02-20 Thread ola nowak
Yep. I suppose it is. But I have several applications installed on glassfish and I want each one of them to write into separate file. And Your solution with this jvm option was redirecting all messages from all apps to one file. Does anyone knows how to accomplish that? On Mon, Feb 20, 2012 at

Re: Solr logging

2012-02-20 Thread darul
This case explained here: http://stackoverflow.com/questions/762918/how-to-configure-multiple-log4j-for-different-wars-in-a-single-ear http://techcrawler.wordpress.com/ -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-logging-tp3760171p3760352.html Sent from the Solr

Re: Payload and exact search - 2

2012-02-20 Thread leonardo2
Ok, it works!! Thanks you very much. Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/Payload-and-exact-search-2-tp3750355p3760477.html Sent from the Solr - User mailing list archive at Nabble.com.

solr and tika

2012-02-20 Thread alessio crisantemi
Hi all, In a new installation of sOlr (1.4) I configured Tika for indexing rich documents. So, I commit my files and I can find it after indexing with an http query * http://localhost:8983/solr/select?q=attr_content:parola*; (for search the word 'parola') and I find the committed text. but if I

Re: Development inside or outside of Solr?

2012-02-20 Thread François Schiettecatte
You could take a look at this: http://www.let.rug.nl/vannoord/TextCat/ Will probably require some work to integrate/implement through François On Feb 20, 2012, at 3:37 AM, bing wrote: I have looked into the TikaCLI with -language option, and learned that Tika can output only the

Re: Solr logging

2012-02-20 Thread François Schiettecatte
Ola Here is what I have for this: ## # # Log4J configuration for SOLR # # http://wiki.apache.org/solr/SolrLogging # # # 1) Download LOG4J: # http://logging.apache.org/log4j/1.2/ #

Problem with SolrCloud + Zookeeper + DataImportHandler

2012-02-20 Thread Agnieszka Kukałowicz
Hi All, I've recently downloaded latest solr trunk to configure solrcloud with zookeeper using standard configuration from wiki: http://wiki.apache.org/solr/SolrCloud. The problem occurred when I tried to configure DataImportHandler in solrconfig.xml: requestHandler name=/dataimport

Re: custom scoring

2012-02-20 Thread Carlos Gonzalez-Cadenas
Hello all: We've done some tests with Em's approach of putting a BooleanQuery in front of our user query, that means: BooleanQuery must (DismaxQuery) should (FunctionQuery) The FunctionQuery obtains the SOLR IR score by means of a QueryValueSource, then does the SQRT of this value, and

Re: custom scoring

2012-02-20 Thread Em
Carlos, nice to hear that the approach helped you! Could you show us how your query-request looks like after reworking? Regards, Em Am 20.02.2012 13:30, schrieb Carlos Gonzalez-Cadenas: Hello all: We've done some tests with Em's approach of putting a BooleanQuery in front of our user

Re: Development inside or outside of Solr?

2012-02-20 Thread Erick Erickson
Either is possible. For the first, you would write a custom update processor that handled the dual Tika call... For the second, consider writing a SolrJ program that just does it all on the client. Just download Tika from the apache project (or tease out all the jars from the Solr distro) and

Re: custom scoring

2012-02-20 Thread Carlos Gonzalez-Cadenas
Yeah Em, it helped a lot :) Here it is (for the user query hoteles): *+(stopword_shortened_phrase:hoteles | stopword_phrase:hoteles | wildcard_stopword_shortened_phrase:hoteles | wildcard_stopword_phrase:hoteles) * *product(pow(query((stopword_shortened_phrase:hoteles | stopword_phrase:hoteles

How to check for inactive cores in a solr multicore setup?

2012-02-20 Thread Nasima Banu
Hello, I am trying to figure out a way to detect inactive cores in a multicore setup. How is that possible? I queried the STATUS of a core through the CoreAdminHandler. Could anyone please tell me what the 'current' field means?? Eg : http://localhost:8080/solr/admin/cores?action=STATUScore=2

RE: customizing standard tokenizer

2012-02-20 Thread Torsten Krah
Thx, will use the custom tokenizer. Its less error prone than the workarounds mentioned. smime.p7s Description: S/MIME cryptographic signature

Re: DataImportHandler running out of memory

2012-02-20 Thread v_shan
DIH still running out of memory for me, with Full Import on a database of size 1.5 GB. Solr version: 3_5_0 Note that I have already added batchSize=-1 but getting same error. Sharing my DIH config below. dataConfig dataSource

Re: custom scoring

2012-02-20 Thread Em
Could you please provide me the original request (the HTTP-request)? I am a little bit confused to what query_score refers. As far as I can see it isn't a magic-value. Kind regards, Em Am 20.02.2012 14:05, schrieb Carlos Gonzalez-Cadenas: Yeah Em, it helped a lot :) Here it is (for the user

Re: custom scoring

2012-02-20 Thread Carlos Gonzalez-Cadenas
Hi Em: The HTTP request is not gonna help you a lot because we use a custom QParser (that builds the query that I've pasted before). In any case, here it is: http://localhost:8080/solr/core0/select?shards=…(shards

postCommit confusion?

2012-02-20 Thread Esad Mumdzic
in a solr master slave replication, if I register postCommit listener on a slave, which index reader should I get if I do: @Override public final void postCommit() { final RefCountedSolrIndexSearcher refC = core

Is Sphinx better suited to me, or should I look at Solr?

2012-02-20 Thread Spadez
I am creating what is effectively a search engine. Content is collected via spiders at then is inserted into my database and becomes searchable and filterable. I invision there being around 90K records to be searched at any one time. The content is blog posts and forum posts so we are basically

Re: custom scoring

2012-02-20 Thread Em
Hi Carlos, query_score is a field that is indexed and stored with every document. Thanks for clarifying that, now the whole query-string makes more sense to me. Did you check whether query() - without product() and pow() - is also much slower than a normal query? I guess, if the

How to index a facetfield by searching words matching from another Textfield

2012-02-20 Thread Xavier
Hi everyone, I'm a new Solr User but i used to work on Endeca. There is a modul called TextTagger with Endeca that is auto indexing values in a facetfield (multivalued) when he find words (from a given wordslist) into an other TextField from that document. I didn't see any subjects or any ways

Re: Is Sphinx better suited to me, or should I look at Solr?

2012-02-20 Thread Em
Hi James, I can not speak for Sphinx, since I never used it. However, from reading your requirements there is nothing that fears Solr. Although Sphinx is written in C++, running Solr on top of a HotSpot JVM gives you high performance. Furthermore the HotSpot JVM is optimizing your code at

Re: How to index a facetfield by searching words matching from another Textfield

2012-02-20 Thread Em
Hi Xavier, sounds like a job for KeepWordFilter! From the javadocs: A TokenFilter that only keeps tokens with text contained in the required words. This filter behaves like the inverse of StopFilter. However, you have to provide the wordslist as a .txt-file. By using copyFields and the

lucene operators interfearing in edismax

2012-02-20 Thread jmlucjav
Hi, I am using edismax with end user entered strings. One search was not finding what appeared to be the best match. The search was: Sage Creek Organics - Enchanted If I remove the -, the doc I want is found as best score. Turns out (I think) the - is the culprit as the best match has

Re: lucene operators interfearing in edismax

2012-02-20 Thread Yonik Seeley
This should be fixed in trunk by LUCENE-2566 QueryParser: Unary operators +,-,! will not be treated as operators if they are followed by whitespace. -Yonik lucidimagination.com On Mon, Feb 20, 2012 at 2:09 PM, jmlucjav jmluc...@gmail.com wrote: Hi, I am using edismax with end user entered

Exception importing multi-valued UUID field

2012-02-20 Thread Greg Pelly
Hi, I exported a csv file from SOLR and made some changes, I then tried to reimport the file and got the exception below. It seems UUID field type can't import multi-values, I removed all of the multi-values and it imported without an issue. Cheers org.apache.solr.common.SolrException: Error

Re: Exception importing multi-valued UUID field

2012-02-20 Thread Greg Pelly
I also tried it with the comma escaped, so: '845b9db2-2a25-44e3-8eb4-3bf17cd16738\,c5477d5d-e77c-45e9-ab61-f7ca05499b37' So that's in the same format as it was exported, Excel must have removed the slash. But I still get the error with the slash. On Tue, Feb 21, 2012 at 11:26 AM, Greg Pelly

Re: Is Sphinx better suited to me, or should I look at Solr?

2012-02-20 Thread Damien Camilleri
I gave up on sphinx and went to solr. I feel it is more mature. For example, sphinx didn't have an auto start init script and they tried to hit me up for consultancy fees cos I asked a simple question. I use php and use solarium php client. Nice oop interface. Solr has a great community. My

Re: Exception importing multi-valued UUID field

2012-02-20 Thread Yonik Seeley
On Mon, Feb 20, 2012 at 7:26 PM, Greg Pelly gfpe...@gmail.com wrote: I exported a csv file from SOLR and made some changes, I then tried to reimport the file and got the exception below. It seems UUID field type can't import multi-values, I removed all of the multi-values and it imported

Re: Exception importing multi-valued UUID field

2012-02-20 Thread Erick Erickson
I don't think escaping is your problem, you probably want to take that bit out. Try adding f.youruuidfieldname.split=true when importing. You might also have to specify something like f.houruuidfieldname.separator=, but probably not, I suspect it's the default. See the split heading at: