Korean Tokenizer in solr

2014-07-10 Thread Poornima Jay
Hi, Anyone tried to implement korean language in solr 3.6.1. I define the field as below in my schema file but the fieldtype is not working. fieldType name=text_kr class=solr.TextField positionIncrementGap=1000       analyzer type=index         tokenizer class=solr.KoreanTokenizerFactory/      

Re: Korean Tokenizer in solr

2014-07-10 Thread Alexandre Rafalovitch
Double check your xml file that you don't - for example - define your fieldType outside of fields section. Or maybe you have exception earlier about some component in the type definition. This is not about Korean language, it seems. Something more fundamentally about XML config. Regards,

Integrate NLP to solr

2014-07-10 Thread Kamal Kishore Aggarwal
Dear Team, I am working on solr 4.8.1 version. I want to integrate solr with NLP in order to improve the search relevancy. I am following the tutorials: https://wiki.apache.org/solr/OpenNLP https://issues.apache.org/jira/browse/LUCENE-2899 Its says: - pull the latest trunk or 4.0 branch

Re: Korean Tokenizer in solr

2014-07-10 Thread Poornima Jay
I have defined the fieldtype inside the fields section.  When i checked the error log i found the below error Caused by: java.lang.ClassNotFoundException: solr.KoreanTokenizerFactory SEVERE: org.apache.solr.common.SolrException: analyzer without class or tokenizer filter list Do i need to

Re: Korean Tokenizer in solr

2014-07-10 Thread Alexandre Rafalovitch
I don't think Solr ships with Korean Tokenizer, does it? If you are using a 3rd party one, you need to give full class name, not just solr.Korean... And you need the library added in the lib statement in solrconfig.xml (at least in Solr 4). Regards, Alex. Personal website:

Re: Korean Tokenizer in solr

2014-07-10 Thread Poornima Jay
Till now I was thinking solr will support KoreanTokenizer. I haven't used any other 3rd party one.  Actually the issue i am facing is I need to integrate English, Chinese, Japanese and Korean language search in a single site. Based on the user's selected language to search the fields will be

Re: Korean Tokenizer in solr

2014-07-10 Thread Alexandre Rafalovitch
I would suggest you read through all 12 (?) articles in this series: http://discovery-grindstone.blogspot.com/2013/10/cjk-with-solr-for-libraries-part-1.html . It will probably lay out most of the issues for you. And if you are starting, I would really suggest using the latest Solr (4.9). A lot

Dataimport does not seem to commit in SOLR Cloud

2014-07-10 Thread dominic.martino
Hi all, Hope I am in the right place. I currently have a setup which is a Solr Cloud which has 3 nodes, I then have a zookeeper assemble with 3 nodes as well. I am doing a dataimport which fills the index with 3.8 million documents, if I look in the 'Cloud' section of the Solr admin

Re: Complement of {!join}

2014-07-10 Thread Erik Hatcher
Yeah, using multiple _query_’s has always been possible. _query_ is just a special field name that gets interpreted specially. Same with _val_ for function queries. It’s a feature that’s been leveraged and “documented” (by way of presentation materials and blogs, at least) such as: -

Re: Query in metadata sent to Solr

2014-07-10 Thread Ameya Aware
Hi I am using same solrconfig.xml which comes in default solr package. Do i need to make change anywhere? Thanks, Ameya On Wed, Jul 9, 2014 at 5:59 PM, Ahmet Arslan iori...@yahoo.com.invalid wrote: Hi, Field name sent with literal is Modified. In your screenshot, it is last_modified . Do

Re: Facets on Nested documents

2014-07-10 Thread Walter Liguori
Thank you very much Mikhail. Walter Ing. Walter Liguori 2014-07-09 21:43 GMT+02:00 Mikhail Khludnev mkhlud...@griddynamics.com: Colleagues, So far you can either vote or contribute to https://issues.apache.org/jira/browse/SOLR-5743 Walter, Usually, index-time tricks loose relationships

Reference numbers for major page fauls per seconds, index size, query throughput

2014-07-10 Thread Harald Kirsch
Hi everyone, currently I am taking some performance measurements on a Solr installation and I am trying to figure out if what I see mostly fits expectations: The data is as follows: - solr 4.8.1 - 8 millon documents - mostly office documents with real text content, stored - index size on

Re: run multiple queries at the same time

2014-07-10 Thread Jack Krupansky
How is running multiple queries at the same time different from running them in parallel? So, why not just run them in parallel. I mean, Solr can accept simultaneous query requests just fine, so what's the issue here? IOW, put the logic in your application layer and use SOlr as a streamlined

[Schema API] EmbeddedSolrServer Test

2014-07-10 Thread Alessandro Benedetti
Hi guys, I'm struggling testing schemaAPI REST endpoints thourgh EmbeddedSolrServer. Out of the box the Embedded Solr Server is not able to recognize the schema request handler. So I was trying to follow an approach like this : public static void init() throws Exception { final

Re: Integrate NLP to solr

2014-07-10 Thread Ahmet Arslan
Hi Kamal, Uncommitted code may not work.  Have you considered using https://cwiki.apache.org/confluence/display/solr/UIMA+Integration Ahmet On Thursday, July 10, 2014 10:33 AM, Kamal Kishore Aggarwal kkroyal@gmail.com wrote: Dear Team, I am working on solr 4.8.1 version. I want to

No server hosting shard on local SolrCloud installation

2014-07-10 Thread Ewa Kowalczuk
Hi! I'm trying to setup SolrCloud on single node (the reason I'm using Cloud version is I want to use aliasing collections feature). I want to use external Zookeeper (located on the same node) and Tomcat. I followed the instructions from: http://wiki.apache.org/solr/SolrCloudTomcat However,

Re: Query in metadata sent to Solr

2014-07-10 Thread Ahmet Arslan
Hi, Okey then remove this line from solrconfig.xml  str name=uprefixignored_/str and explicitly define a field named Modified in schema.xml with stored=true.  With this you should be able to see Modified field. Please see definition of uprefix parameter :

Solr 4.x and master-slave schema

2014-07-10 Thread Jorge Luis Betancourt Gonzalez
Hi all: We have a small installation of Solr 3.6 in our hands, right now we have 3 physical servers (1 master and 2 slaves) the ingestion process it’s done in the master which replicates by solr internal mechanism into the slaves, which handles all the queries. We are trying to update to Solr

Re: Query in metadata sent to Solr

2014-07-10 Thread Ameya Aware
Hi, Please find below solr log: INFO - 2014-07-09 15:30:56.243; org.apache.solr.update.processor.LogUpdateProcessor; [collection1] webapp=/solr path=/update/extract params={literal.deny_token_ document=DEAD_AUTHORITYliteral.DocIcon=docxresource.name=Anarchism-

RE: Lower/UpperCase Issue

2014-07-10 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Thanks Erick Shawn, I will analyze with your input and share outcome. Thanks Ravi -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, July 10, 2014 12:17 AM To: solr-user@lucene.apache.org Subject: Re: Lower/UpperCase Issue Side note. Puttling

Re: Query in metadata sent to Solr

2014-07-10 Thread Ahmet Arslan
Hi, Okey you want to use the value that sent with literal.Modified parameter. First, try literalsOverride=true parameter.  If that does not work, use FirstFieldValueUpdateProcessorFactory to reduce multivalued modified field to a single valued one. ahmet On Thursday, July 10, 2014 9:16 PM,

Re: Query in metadata sent to Solr

2014-07-10 Thread Ameya Aware
Also, When using below dynamic field dynamicField name=* type=string multiValued=true stored=true indexed=true / You said that we will get to see all field generated solr-cell and mcf. Where exactly i could see all these fields so that i can create my schema? Thanks, Ameya On Thu, Jul 10,

Re: Query in metadata sent to Solr

2014-07-10 Thread Ahmet Arslan
Hi, With that dynamic filed approach, you can see all these fields' values in result/response page. e.g. q=*:*fl=* Ahmet On Thursday, July 10, 2014 10:18 PM, Ameya Aware ameya.aw...@gmail.com wrote: Also, When using below dynamic field dynamicField name=* type=string multiValued=true

Re: Query in metadata sent to Solr

2014-07-10 Thread Ameya Aware
Got it. Thanks man. Ameya On Thu, Jul 10, 2014 at 3:36 PM, Ahmet Arslan iori...@yahoo.com.invalid wrote: Hi, With that dynamic filed approach, you can see all these fields' values in result/response page. e.g. q=*:*fl=* Ahmet On Thursday, July 10, 2014 10:18 PM, Ameya Aware

Re: Changing default behavior of solr for overwrite the whole document on uniquekey duplication

2014-07-10 Thread Ali Nazemian
Thank you very much. Now I understand what was the idea. It is better than changing Solr. But does performance remain same in this situation? On Tue, Jul 8, 2014 at 10:43 PM, Chris Hostetter hossman_luc...@fucit.org wrote: I think you are missunderstanding what Himanshu is suggesting to you.

Re: [Schema API] EmbeddedSolrServer Test

2014-07-10 Thread Chris Hostetter
EmbeddedSolrServer is really just for dealing with embedded SolrCore objects. It doesn't have ay of the other HTTP related plumbing assocaited with it -- which is why it can't be used for replication and/or cloud features, let alone the RESTLet based endpoints. if you want to write a test

Creating summary snippets for a null query

2014-07-10 Thread Michael Keeling
Hello, I'm moving to Solr from another search engine that would always return a snippet even for any empty/null search. In the case of an empty/null search, the first N characters would be returned as the summary snippet. What would be the best way to configure Solr and index content so that at

Re: Creating summary snippets for a null query

2014-07-10 Thread Ahmet Arslan
Hi, Please see hl.alternateField and hl.maxAlternateFieldLength parameters. http://wiki.apache.org/solr/HighlightingParameters#hl.alternateField Ahmet On Friday, July 11, 2014 12:03 AM, Michael Keeling michael.keel...@gmail.com wrote: Hello, I'm moving to Solr from another search engine

Re: SOLR-6143 Bad facet counts from CollapsingQParserPlugin

2014-07-10 Thread shamik
Are there any plans to release this feature anytime soon ? I think this is pretty important as a lot of search use case are dependent on the facet count being returned by the search result. This issue renders renders the CollapsingQParserPlugin pretty much unusable. I'm now reverting back to the

Re: Complement of {!join}

2014-07-10 Thread Alexandre Rafalovitch
Yes, slide 11 of first presentation and slide 35 of the second one do show that the _query or {!} can be used multiple types in one top query. The other slides/links do not, they only show it once. So, I think a lot of people may be missing that power. I suspect it's a bit like using the bbox

Re: run multiple queries at the same time

2014-07-10 Thread Lee Chunki
Hi Jack Krupansky, Thank you for your answer. Yes, I meant that in parallel. so, do you mean I should build sold component to run them in parallel ? I am not good at JAVA, is there any why to do that with built in solr component ? Thanks, Chunki. On Jul 10, 2014, at 11:53 PM, Jack

Re: run multiple queries at the same time

2014-07-10 Thread Alexandre Rafalovitch
There is no built-in component to do this kind of split. But there is a SynonymFilter, which is shown in the example schema and, I think, mentioned in the tutorial. Are you sure your needs are not covered by that already? Regards, Alex. Personal website: http://www.outerthoughts.com/ Current

Does solrj support partial update for solr cloud?

2014-07-10 Thread Shuai Zhang
For now,I used solr 4.7.1, when I test the partial update operation, I found it worked fine in HttpSolrServer, But when I used solr cloud CloudSolrServer , it cannot be supported!!! The document will be updated totally instead of partial update!!! The code I used in my program

Re: Does solrj support partial update for solr cloud?

2014-07-10 Thread shamik
Yes it does and pretty straight forward. Refer to following url : http://heliosearch.org/solr/atomic-updates/ http://www.mumuio.com/solrj-4-0-0-alpha-atomic-updates/ -- View this message in context: