Re: Source code of post in example package of Solr

2012-01-30 Thread Ahmet Arslan
> I am using the following jar to index files in xml format,
> and I want to
> look into the source code. Where can I find it? Thanks.
> 
> \apache-solr-3.5.0\example\exampledocs>java -jar post.jar
> *.xml

Hi Bing, it is 
http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/java/org/apache/solr/util/SimplePostTool.java


Re: Something like "featured results" in solr response?

2012-01-30 Thread Pranav Prakash
Wow, this looks interesting.

*Pranav Prakash*

"temet nosce"

Twitter  | Blog  |
Google 


On Mon, Jan 30, 2012 at 21:16, Erick Erickson wrote:

> There's the tricky line:
> "If the file exists in the /conf/ directory it will be loaded once at
> start-up. If it exists in the data directory, it will be reloaded for
> each IndexReader."
>
> on the page: http://wiki.apache.org/solr/QueryElevationComponent
>
> Which basically means that if your config file is in the right directory,
> it'll be reloaded whenever the index changes, i.e. when a replication
> happens in a master/slave setup or when a commit happens on
> a single machine used for both indexing  and searching.
>
> Best
> Erick
>
> On Mon, Jan 30, 2012 at 8:31 AM, Pranav Prakash  wrote:
> > Thanks a lot :-) This is exactly what I had read back then. However,
> going
> > through it now, it seems that everytime a document needs to be elevated,
> it
> > has to be in the config file. Which means that Solr should be restarted.
> > This does not make a lot of sense for a production environment, where
> Solr
> > restarts are as infrequent as config changes.
> >
> > What could be a sound way to implement this?
> >
> > *Pranav Prakash*
> >
> > "temet nosce"
> >
> > Twitter  | Blog <
> http://blog.myblive.com> |
> > Google 
> >
> >
> > 2012/1/30 Rafał Kuć 
> >
> >> Hello!
> >>
> >> Please look at http://wiki.apache.org/solr/QueryElevationComponent.
> >>
> >> --
> >> Regards,
> >>  Rafał Kuć
> >>  Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
> >>
> >> > Hi,
> >>
> >> > I believe, there is a feature in Solr, which allows to return a set of
> >> > "featured" documents for a query. I did read it couple of months back,
> >> and
> >> > now when I have decided to work on it, I somehow can't find it's
> >> reference.
> >>
> >> > Here is the description - For a search keyword, apart from the results
> >> > generated by Solr (which is based on relevancy, score), there is
> another
> >> > set of documents which just comes up. It is very much similar to the
> >> > "sponsored results" feature of Google.
> >>
> >> > Can you guys point me to the appropriate resources for the same?
> >>
> >>
> >> > *Pranav Prakash*
> >>
> >> > "temet nosce"
> >>
> >> > Twitter  | Blog <
> >> http://blog.myblive.com> |
> >> > Google 
> >>
> >>
> >>
> >>
> >>
>


Re: solr custom component

2012-01-30 Thread Peter Markey
Hi Eric,

I tried looking for a sample code to sort on Date but was unable to find
one? I am using 3.4 version.
Any idea as to where I can find one?

Thanks a ton

On Fri, Jan 27, 2012 at 8:13 AM, Erick Erickson wrote:

> Why not just sort on date and take the first doc returned in the list?
>
> Best
> Erick
>
> On Thu, Jan 26, 2012 at 10:33 AM, Peter Markey 
> wrote:
> > Hello,
> >
> > I am building a custom component in Solr and I am trying to construct a
> > query to get the latest (based on a date field) DocID using
> SolrIndexSearcher.
> > Below is a short snippet of my code:
> >
> > SolrIndexSearcher searcher =
> > final SchemaField sf = searcher.getSchema().getField(dateField);
> > //dateField is one of the fields that contains timestamp of the record
> >
> > final IndexSchema schema = searcher.getSchema();
> >
> > Query rangeQ = ((DateField)(sf.getType())).getRangeQuery(null,
> sf,null,NOW,
> > false,true); //NOW is current Date
> >
> > DocList dateDocs = searcher.getDocList(rangeQ, base, null, 0, 1); //base
> is
> > a set of doc filters to limit search
> >
> >
> >
> > Though I get some docs that satisfy the query, my goal is to get the doc
> > whose's dateField is closest to the current time. Are there any other
> > queries I can employ for this?
> >
> >
> > Thanks a lot for any suggestions.
>


Source code of post in example package of Solr

2012-01-30 Thread bing
Hi, all, 

I am using the following jar to index files in xml format, and I want to
look into the source code. Where can I find it? Thanks.

\apache-solr-3.5.0\example\exampledocs>java -jar post.jar *.xml

Best Regards, 
Bing

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Source-code-of-post-in-example-package-of-Solr-tp3702100p3702100.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: can solr automatically search for different punctuation of a word

2012-01-30 Thread alxsss

 Hi Chantal,

In the readme file at  solr/contrib/analysis-extras/README.txt it says to add 
the ICU library (in lib/)

Do I need also add ... and where?

Thanks.
Alex.

 

 

-Original Message-
From: Chantal Ackermann 
To: solr-user 
Sent: Fri, Jan 13, 2012 1:52 am
Subject: Re: can solr automatically search for different punctuation of a word


Hi Alex,



for me, ICUFoldingFilterFactory works very good. It does lowercasing and

removes diacritica (this is how umlauts and accenting of letters is

called - punctuation means comma, points etc.). It will work for any any

language, not only German. And it will also handle apostrophs as in

"C'est bien".



ICU requires additional libraries in the classpath. For an in-built solr

solution have a look at ASCIIFoldingFilterFactory.



http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ASCIIFoldingFilterFactory

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ICUFoldingFilterFactory







Example configuration:















And dependencies (example for Maven) in addition to solr-core:



org.apache.lucene

lucene-icu

${solr.version}

runtime





org.apache.solr

solr-analysis-extras

${solr.version}

runtime





Cheers,

Chantal



On Fri, 2012-01-13 at 00:09 +0100, alx...@aim.com wrote:

> Hello,

> 

> I would like to know if solr has a functionality to automatically search for 
> a 

different punctuation of a word. 

> For example if I if a user searches for a word Uber, and stemmer is german 

lang, then solr looks for both Uber and  Über,  like in synonyms.

> 

> Is it possible to give a file with a list of possible substitutions of 
> letters 

to solr and have it search for all possible punctuations?

> 

> 

> Thanks.

> Alex.




 


Re: language specific fields of "text"

2012-01-30 Thread bing
Hi, Paul, 

I understand your point of missing "text_en" in the document. It is. Not
"text_en" but "text" exists.
But then it arises the question: isn't it dynamic to add language specific
suffixes to an existing filed "text"?

I am new here. As far as I know, for some field "title", people can create
"title_en" "title_fr" to incorporate different analyzers in the same schema.
Even this, I am not seeing it happens. Thus, I am thinking whether it is
possible I neglect some obvious point? 

"Bing" is very common in the names of Chinese, as there are several Chinese
characters corresponding to the same pronunciation. 

Thanks for reply.

Best Regards, 
Bing

--
View this message in context: 
http://lucene.472066.n3.nabble.com/language-specific-fields-of-text-tp3698985p3702053.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multilingual search in multicore solr

2012-01-30 Thread bing
Hi, Erick Erickson, 

Your suggestions are sound. 

For (1), if I use SolrJ as the client to access Solr, then java coding
becomes the most challenging part. Technically, I want to achieve the same
effect with highlighting, faceting search, language detection, etc. Do you
know some example SC that I can refer to? 

For (2), I agree with you on the difficulty in detecting language from just
a few words. Thus, alternatively I can suggest a set of results and let
users to decide. 
You also mentioned score. Say, I have not so many cores, and so for every
query I direct it to all the cores, returned with a set of scores.  Is it
confident to conclude that the highest score gives the most confidence of
the results? 

Thanks.

Best Regards, 
Ni Bing

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multilingual-search-in-multicore-solr-tp3698969p3702041.html
Sent from the Solr - User mailing list archive at Nabble.com.


Indexing leave behind write.lock file.

2012-01-30 Thread Koorosh Vakhshoori
Hi,
 I am using SolrJ to reindex a core in a multiCore setup. The general
flow of my program is as follows (pseudo code):

String solrHome = "/opt/solr/home";
File solrXml = new File( solrHome, "solr.xml" );
container = new CoreContainer();
container.load(solrHome, solrXml);
SolrServer solr = new EmbeddedSolrServer(container, "core1");
solr.deleteByQuery("*:*");
SolrInputDocument doc1 = new SolrInputDocument();
doc1.addField( "id", "id1", 1.0f );
doc1.addField( "name", "doc1", 1.0f );
Collection docs = new ArrayList();
docs.add( doc1 );
solr.commit();
SolrCore curCore = container.getCore("core1");
curCore.close();

I thought for sure by calling close(), I would also be releasing all
associated resources including the lock on the core that is
I would getting rid of the write.lock file.

I am using Solr 4.0 code from the development truck which is about a month old.

Any suggestion here appreciated.

Regards,

Koorosh


product(popularity,score) gives error undefined field score

2012-01-30 Thread abhayd
hi 

I m trying to add some weight for popularity in the score returned by solr
query. 
http://localhost:10101/solr/syx/select?q={!boost%20b=product(popularity,score)}SIM&rows=100&fl=score,id&debug=true
 

I get error "undefined field score"

Any idea how to do this?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/product-popularity-score-gives-error-undefined-field-score-tp3701734p3701734.html
Sent from the Solr - User mailing list archive at Nabble.com.


SolrCloud war?

2012-01-30 Thread Darren Govoni

Hi,
  Is there any issue with running the new SolrCloud deployed as a war 
in another app server?

Has anyone tried this yet?

thanks.


Re: SOLVED: Strange things happen when I query with many facet.prefixes and fq filters

2012-01-30 Thread Yuhao
Good question.  I checked the output sent to Jetty.  In the case where it 
returns a blank page, nothing at all is sent to Jetty.  This raised my 
suspicion that Solr never got a chance to process the query.  Sure enough, it 
led me to the finding that Jetty by default cannot take more than 4 KB of 
header.  After I increased that limit, everything works.
Problem solved.





 From: Erick Erickson 
To: solr-user@lucene.apache.org; Yuhao  
Sent: Sunday, January 29, 2012 1:05 PM
Subject: Re: Strange things happen when I query with many facet.prefixes and fq 
filters
 
The very first question I have is "what do your Solr logs show"? I suspect
you'll see something interesting there. Otherwise, there's no way really to
say what's going on here without reproducing your setup...

Best
Erick

On Fri, Jan 27, 2012 at 6:48 PM, Yuhao  wrote:
> Hi,
>
> I'm having issues when running the following query, which is produced by 
> expanding several hierarchical facets (implemented the facet.prefix way).  I 
> realize it's pretty massive, but I'd like to figure out what exactly is 
> causing the problem.  Is it too many facet.prefix clauses, too many fq 
> filters, the combo of both, or what.  Anyway, here is the URL I start out
>  with:
>
> http://40.163.5.153:920/solr/browse?&fq=Gene_Ontology_Associations%3A%220%2Fbiological_process%28GO%3A0008150%29%22&fq=Gene_Ontology_Associations%3A%221%2Fbiological_process%28GO%3A0008150%29%3Bmetabolic+process%28GO%3A0008152%29%22&fq=Gene_Ontology_Associations%3A%222%2Fbiological_process%28GO%3A0008150%29%3Bmetabolic+process%28GO%3A0008152%29%3Bsteroid+metabolic+process%28GO%3A0008202%29%22&fq=Gene_Ontology_Associations%3A%223%2Fbiological_process%28GO%3A0008150%29%3Bmetabolic+process%28GO%3A0008152%29%3Bsteroid+metabolic+process%28GO%3A0008202%29%3Bcholesterol+metabolic+process%28GO%3A0008203%29%22&fq=Mouse_Phenotype_Associations%3A%220%2Fmammalian+phenotype%28MP%3A001%29%22&fq=Mouse_Phenotype_Associations%3A%221%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%22&fq=Mouse_Phenotype_Associations%3A%222%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fme
> tabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%22&fq=Mouse_Phenotype_Associations%3A%223%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%3Babnormal+lipid+homeostasis%28MP%3A0002118%29%22&fq=Mouse_Phenotype_Associations%3A%224%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%3Babnormal+lipid+homeostasis%28MP%3A0002118%29%3Babnormal+cholesterol+homeostasis%28MP%3A0005278%29%22&fq=Mouse_Phenotype_Associations%3A%225%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%3Babnormal+lipid+homeostasis%28MP%3A0002118%29%3Babnormal+cholesterol+homeostasis%28MP%3A0005278%29%3Babnormal+cholesterol+level%28MP%3A0003947%29%22&fq=Mouse_Phenotype_Associations%3A%226%2Fm
> ammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%3Babnormal+lipid+homeostasis%28MP%3A0002118%29%3Babnormal+cholesterol+homeostasis%28MP%3A0005278%29%3Babnormal+cholesterol+level%28MP%3A0003947%29%3Bdecreased+cholesterol+level%28MP%3A0003983%29%22&fq=Mouse_Phenotype_Associations%3A%227%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%3Babnormal+lipid+homeostasis%28MP%3A0002118%29%3Babnormal+cholesterol+homeostasis%28MP%3A0005278%29%3Babnormal+cholesterol+level%28MP%3A0003947%29%3Bdecreased+cholesterol+level%28MP%3A0003983%29%3Bdecreased+liver+cholesterol+level%28MP%3A0010026%29%22&fq=BKL_Diagnostic_Marker_Associations%3A%220%2FCardiovascular+Diseases%28MESH%3AD002318%29%22&fq=BKL_Molecular_Mechanism_Associations%3A%220%2FCardiovascular+Diseases%28MESH%3AD002318%29%22&fq=
> BKL_Diagnostic_Marker_Associations%3A%221%2FCardiovascular+Diseases%28MESH%3AD002318%29%3BArteriosclerosis%28MESH%3AD001161%29%22&q=&fq=BKL_Diagnostic_Marker_Associations:%222%2FCardiovascular+Diseases%28MESH%3AD002318%29%3BArteriosclerosis%28MESH%3AD001161%29%3BAtherosclerosis%28MESH%3AD050197%29%22&f.Gene_Ontology_Associations.facet.prefix=4%2Fbiological_process%28GO%3A0008150%29%3Bmetabolic+process%28GO%3A0008152%29%3Bsteroid+metabolic+process%28GO%3A0008202%29%3Bcholesterol+metabolic+process%28GO%3A0008203%29&f.Mouse_Phenotype_Associations.facet.prefix=8%2Fmammalian+phenotype%28MP%3A001%29%3Bhomeostasis%2Fmetabolism+phenotype%28MP%3A0005376%29%3Babnormal+homeostasis%28MP%3A0001764%29%3Babnormal+lipid+homeostasis%28MP%3A0002118%29%3Babnormal+cholesterol+homeostasis%28MP%3A0005278%29%3Babnormal+cholesterol+level%28MP%3A0003947%29%3Bdecreased+cholesterol+level%28MP%3A0

Out of Memory

2012-01-30 Thread Milan Dobrota
Hi,

I have a Solr instance with 6M item index. It normally uses around 3G of
memory. I have suddenly started getting out of memory errors and increasing
the Xmx parameter to over 4G didn't fix the problem. It was just buying us
time. Inspecting the heap, I figured that 90% of memory is occupied by
FieldCache. Is this normal? We do very little sorting and no faceting.

Is FieldCache ever supposed to get cleared? Can this be done through HTTP?

Do we need more memory? If so, I don't understand why the minimal set of
changes we introduced (one multivalued field) would cause the memory to
drastically increase.

The communication with the Solr instance is done via HTTP.

Java version:
java version "1.6.0_17"
Java(TM) SE Runtime Environment (build 1.6.0_17-b04)
Java HotSpot(TM) 64-Bit Server VM (build 14.3-b01, mixed mode)

Milan


RE: Sort by geoDist() syntax error on 3.5

2012-01-30 Thread Steven A Rowe
No, it's not.

If I follow the link below your email to nabble.com, I can read your message, 
including your suggested correct syntax - I'll quote it below for you, and 
since I don't use nabble, everyone who reads this list will be able to see it:


> Correct syntax may be :
>
> select?q=%20&start=0&rows=8&qt=event&fl=*,geodata,%20score&fq={!geofilt}&pt=48.867,2.333&sfield=geodata&d=20&sort=geodist()%20asc


If you depend on people to follow a link to nabble.com to read your message, 
I'm afraid you won't get a lot of readers - people expect to be able to read 
through their own mail readers.

Steve

> -Original Message-
> From: darul [mailto:daru...@gmail.com]
> Sent: Monday, January 30, 2012 11:52 AM
> To: solr-user@lucene.apache.org
> Subject: RE: Sort by geoDist() syntax error on 3.5
> 
> Is it readable now ?
> 
> Thanks Steve.
> 
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Sort-by-
> geoDist-syntax-error-on-3-5-tp360p3700539.html
> Sent from the Solr - User mailing list archive at Nabble.com.


RE: Sort by geoDist() syntax error on 3.5

2012-01-30 Thread darul
Is it readable now ?

Thanks Steve.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-by-geoDist-syntax-error-on-3-5-tp360p3700539.html
Sent from the Solr - User mailing list archive at Nabble.com.


Index-Analyzer on Master with StopFilterFactory and Query-Analyzer on Slave with StopFilterFactory

2012-01-30 Thread Daniel Brügge
Hi,

I am using a 'text_general' fieldType (class = solr.TextField) in my
schema. And I have a master/slave setup,
where I index on the master and read from the slaves. In the text_general
field I am using 2 analyzers. One for
indexing and one for querying with stopword-filters.

What I am thinking is if it would make sense to have a different schema on
the master than on the slave? So just the
index-analyzer on the master's schema and the query-analyzer on the slave's
schema?



 

 
 

 

 
 

 

What do you think?

Thanks & best regards

Daniel


RE: Sort by geoDist() syntax error on 3.5

2012-01-30 Thread Steven A Rowe
Hi darul,

Nobody got your suggested "Correct syntax": Nabble.com strips markup, e.g. XML, 
from emails that you send through them.

I have complained to them about this problem through their support channel 
repeatedly, and you can see the result: they have done nothing to fix the 
problem.

My suggestion: find a different provider who won't trash your messages.

Nabble, if you're listening: get it together!

Steve

> -Original Message-
> From: darul [mailto:daru...@gmail.com]
> Sent: Monday, January 30, 2012 8:50 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Sort by geoDist() syntax error on 3.5
> 
> Well, I guess I have found the problem in the following thread "geodist()
> sort does not work if sfield parameter is enclosed in LocalParams"
> 
> http://www.lucidimagination.com/search/link?url=https://issues.apache.org/
> jira/browse/SOLR-2254
> 
> Correct syntax may be :
> 
> 
> 
> 
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Sort-by-
> geoDist-syntax-error-on-3-5-tp360p3700020.html
> Sent from the Solr - User mailing list archive at Nabble.com.


Re: Something like "featured results" in solr response?

2012-01-30 Thread Erick Erickson
There's the tricky line:
"If the file exists in the /conf/ directory it will be loaded once at
start-up. If it exists in the data directory, it will be reloaded for
each IndexReader."

on the page: http://wiki.apache.org/solr/QueryElevationComponent

Which basically means that if your config file is in the right directory,
it'll be reloaded whenever the index changes, i.e. when a replication
happens in a master/slave setup or when a commit happens on
a single machine used for both indexing  and searching.

Best
Erick

On Mon, Jan 30, 2012 at 8:31 AM, Pranav Prakash  wrote:
> Thanks a lot :-) This is exactly what I had read back then. However, going
> through it now, it seems that everytime a document needs to be elevated, it
> has to be in the config file. Which means that Solr should be restarted.
> This does not make a lot of sense for a production environment, where Solr
> restarts are as infrequent as config changes.
>
> What could be a sound way to implement this?
>
> *Pranav Prakash*
>
> "temet nosce"
>
> Twitter  | Blog  |
> Google 
>
>
> 2012/1/30 Rafał Kuć 
>
>> Hello!
>>
>> Please look at http://wiki.apache.org/solr/QueryElevationComponent.
>>
>> --
>> Regards,
>>  Rafał Kuć
>>  Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
>>
>> > Hi,
>>
>> > I believe, there is a feature in Solr, which allows to return a set of
>> > "featured" documents for a query. I did read it couple of months back,
>> and
>> > now when I have decided to work on it, I somehow can't find it's
>> reference.
>>
>> > Here is the description - For a search keyword, apart from the results
>> > generated by Solr (which is based on relevancy, score), there is another
>> > set of documents which just comes up. It is very much similar to the
>> > "sponsored results" feature of Google.
>>
>> > Can you guys point me to the appropriate resources for the same?
>>
>>
>> > *Pranav Prakash*
>>
>> > "temet nosce"
>>
>> > Twitter  | Blog <
>> http://blog.myblive.com> |
>> > Google 
>>
>>
>>
>>
>>


Re: Multilingual search in multicore solr

2012-01-30 Thread Erick Erickson
for <1>. Not that I know of. What you can do, and relatively simply
at that, is create a SolrJ program that uses Tika to parse the files
on the *client*. At that point you can do anything you'd like, including
detect language, route the document to the right core, etc. This will
also give you more control over how meta-data parsed by Tika is
mapped to your documents.

about <2>. This is hard, mostly because the queries have very
little text to analyze. Consider not worrying about it, just send the
query to all the languages in the hope that anything language
specific is scored higher.

You can do things like detect the language the browser defaults
to or ask the user to provide a "preferred language", but trying
to determine the language based on a short phrase is notoriously
hard.

Best
Erick

On Sun, Jan 29, 2012 at 10:50 PM, bing  wrote:
> Hi, all,
>
> I am going to multilingual search in multicore solr. Specifically, the
> design of the solr server is like: I have several cores corresponding to
> different languages, where each core has its configuration files and data.
>
> I have following questions:
>
> 1. While indexing a document, I use ExtractingRequestHandler in Tika0.10
> (embed in Solr3.5.0) and I can get a field "language_s" after indexing. Is
> it possible to get the info of the "language_s" before indexing happens, so
> that I can put the document in the corresponding core?
>
> 2. In searching with a query, is it possible that I can use language
> detection function to determine the language code of the query, so that I
> direct the query to the corresponding core?
>
> Thanks for your suggestions.
>
> Note:  In this thread I would like to stick on multicore solr and want to
> see whether the problems can be solved. Meanwhile, I am aware that
> multilingual search does not necessarily need multicore solr, which I have
> learned in previous thread.
> http://lucene.472066.n3.nabble.com/Tika0-10-language-identifier-in-Solr3-5-0-tt3671712.html#none
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Multilingual-search-in-multicore-solr-tp3698969p3698969.html
> Sent from the Solr - User mailing list archive at Nabble.com.


Re: "sage 200" not matching "... sage 200."

2012-01-30 Thread Ahmet Arslan
> The trailing full-stop above is not
> being matched when searching for "sage 200" for the below
> field type...
> 
> Do I need the WordDelimiterFilterFactory for this to work as
> expected? I don't see any mention of periods being discussed
> in the docs.
> 
> 
>  positionIncrementGap="100">
>     
>          class="solr.WhitespaceTokenizerFactory"/>
>          class="solr.SynonymFilterFactory"
> synonyms="textgen-synonyms.txt" ignoreCase="true"
> expand="true"/>
>          class="solr.LowerCaseFilterFactory"/>
>     
>     
>          class="solr.WhitespaceTokenizerFactory"/>
>          class="solr.SynonymFilterFactory"
> synonyms="textgen-synonyms.txt" ignoreCase="true"
> expand="true"/>
>          class="solr.LowerCaseFilterFactory"/>
>     
> 
> 

White space tokenizer leaves periods. Either use StandardTokenizer or include 
WordDelimeterFilter. 

Analysis page visualizes created tokens, it is useful when 
testing/understanding tokenizer/filter behavior.


"sage 200" not matching "... sage 200."

2012-01-30 Thread Robert Brown
The trailing full-stop above is not being matched when searching for 
"sage 200" for the below field type...


Do I need the WordDelimiterFilterFactory for this to work as expected? 
I don't see any mention of periods being discussed in the docs.



positionIncrementGap="100">



		synonyms="textgen-synonyms.txt" ignoreCase="true" expand="true"/>





		synonyms="textgen-synonyms.txt" ignoreCase="true" expand="true"/>





Thanks,
Rob


--

IntelCompute
Web Design & Local Online Marketing

http://www.intelcompute.com



Re: Sort by geoDist() syntax error on 3.5

2012-01-30 Thread darul
Well, I guess I have found the problem in the following thread "geodist()
sort does not work if sfield parameter is enclosed in LocalParams"

http://www.lucidimagination.com/search/link?url=https://issues.apache.org/jira/browse/SOLR-2254

Correct syntax may be :




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-by-geoDist-syntax-error-on-3-5-tp360p3700020.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Something like "featured results" in solr response?

2012-01-30 Thread Ahmet Arslan
> it seems that everytime a document needs to
> be elevated, it
> has to be in the config file. Which means that Solr should
> be restarted.
> This does not make a lot of sense for a production
> environment, where Solr
> restarts are as infrequent as config changes.
> 
> What could be a sound way to implement this?

There are open issues about this:

https://issues.apache.org/jira/browse/SOLR-2465
https://issues.apache.org/jira/browse/SOLR-1307


Sort by geoDist() syntax error on 3.5

2012-01-30 Thread darul
Hello,

Looking in threads to find the correct syntax, I have not found any
solutions to get right syntax on sorting by distance.

Here query with no sort :


With sort activated, it throws an exception :

I have tried:


No success.

Schema:


Thanks in advance, I really need it as you may imagine.

Jul

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sort-by-geoDist-syntax-error-on-3-5-tp360p360.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Something like "featured results" in solr response?

2012-01-30 Thread Pranav Prakash
Thanks a lot :-) This is exactly what I had read back then. However, going
through it now, it seems that everytime a document needs to be elevated, it
has to be in the config file. Which means that Solr should be restarted.
This does not make a lot of sense for a production environment, where Solr
restarts are as infrequent as config changes.

What could be a sound way to implement this?

*Pranav Prakash*

"temet nosce"

Twitter  | Blog  |
Google 


2012/1/30 Rafał Kuć 

> Hello!
>
> Please look at http://wiki.apache.org/solr/QueryElevationComponent.
>
> --
> Regards,
>  Rafał Kuć
>  Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
>
> > Hi,
>
> > I believe, there is a feature in Solr, which allows to return a set of
> > "featured" documents for a query. I did read it couple of months back,
> and
> > now when I have decided to work on it, I somehow can't find it's
> reference.
>
> > Here is the description - For a search keyword, apart from the results
> > generated by Solr (which is based on relevancy, score), there is another
> > set of documents which just comes up. It is very much similar to the
> > "sponsored results" feature of Google.
>
> > Can you guys point me to the appropriate resources for the same?
>
>
> > *Pranav Prakash*
>
> > "temet nosce"
>
> > Twitter  | Blog <
> http://blog.myblive.com> |
> > Google 
>
>
>
>
>


Re: Something like "featured results" in solr response?

2012-01-30 Thread Rafał Kuć
Hello!

Please look at http://wiki.apache.org/solr/QueryElevationComponent.

-- 
Regards,
 Rafał Kuć
 Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch

> Hi,

> I believe, there is a feature in Solr, which allows to return a set of
> "featured" documents for a query. I did read it couple of months back, and
> now when I have decided to work on it, I somehow can't find it's reference.

> Here is the description - For a search keyword, apart from the results
> generated by Solr (which is based on relevancy, score), there is another
> set of documents which just comes up. It is very much similar to the
> "sponsored results" feature of Google.

> Can you guys point me to the appropriate resources for the same?


> *Pranav Prakash*

> "temet nosce"

> Twitter  | Blog  |
> Google 






Something like "featured results" in solr response?

2012-01-30 Thread Pranav Prakash
Hi,

I believe, there is a feature in Solr, which allows to return a set of
"featured" documents for a query. I did read it couple of months back, and
now when I have decided to work on it, I somehow can't find it's reference.

Here is the description - For a search keyword, apart from the results
generated by Solr (which is based on relevancy, score), there is another
set of documents which just comes up. It is very much similar to the
"sponsored results" feature of Google.

Can you guys point me to the appropriate resources for the same?


*Pranav Prakash*

"temet nosce"

Twitter  | Blog  |
Google 


Re: Which patch 236 to choose for collapse - Solr 3.5

2012-01-30 Thread O. Klein
You need to remove the 
class="org.apache.solr.handler.component.CollapseComponent" from
solrconfig.xml as it no longer exists.

With group.format=simple you get output looking more like the output you are
used to.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3699595.html
Sent from the Solr - User mailing list archive at Nabble.com.


stop distributed facet query when a condition is met

2012-01-30 Thread Dmitry Kan
Hello!

I probably have an unusual question. I'm trying to implement a modification
to FacetComponent's distributed search. This modification would act as
follows:

1. First ask latest in time shard. If facet.limit has been satisfied,
return what is found.
2. If facet.limit is not yet satisfied, continue searching with the
remaining shard set.

What would be the proven way to achieve this?

-- 
Regards,

Dmitry Kan


Query for exact part of sentence

2012-01-30 Thread Arkadi Colson

Hi

I'm using the pecl PHP class to query SOLR and was wondering how to 
query for a part of a sentence exactly.


There are 2 data items index in SOLR
1327497476: 123 456 789
1327497521. 1234 5678 9011

However when running the query, both data items are returned as you can 
see below. Any idea why?


Thanks!

SolrObject Object
(
[responseHeader] =>  SolrObject Object
(
[status] =>  0
[QTime] =>  5016
[params] =>  SolrObject Object
(
[debugQuery] =>  true
[shards] =>  
solr01:8983/solr,solr02:8983/solr,solr03:8983/solr
[fl] =>  
id,smsc_module,smsc_ssid,smsc_description,smsc_content,smsc_courseid,smsc_date_created,smsc_date_edited,score,metadata_stream_size,metadata_stream_source_info,metadata_stream_name,metadata_stream_content_type,last_modified,author,title,subject
[sort] =>  smsc_date_created asc
[indent] =>  on
[start] =>  0
[q] =>  (smsc_content:\"123 456\" || smsc_description:\"123 456\")&&  
(smsc_module:Intradesk)&&  (smsc_date_created:[2011-12-25T10:29:51Z TO NOW])&&  (smsc_ssid:38)
[distrib] =>  true
[wt] =>  xml
[version] =>  2.2
[rows] =>  55
)

)

[response] =>  SolrObject Object
(
[numFound] =>  2
[start] =>  0
[docs] =>  Array
(
[0] =>  SolrObject Object
(
[smsc_module] =>  Intradesk
[smsc_ssid] =>  38
[id] =>  1327497476
[smsc_courseid] =>  0
[smsc_date_created] =>  2011-12-25T10:29:51Z
[smsc_date_edited] =>  2011-12-25T10:29:51Z
[score] =>  10.028017
)

[1] =>  SolrObject Object
(
[smsc_module] =>  Intradesk
[smsc_ssid] =>  38
[id] =>  1327497521
[smsc_courseid] =>  0
[smsc_date_created] =>  2011-12-25T10:29:51Z
[smsc_date_edited] =>  2011-12-25T10:29:51Z
[score] =>  5.541335
)

)

)
[debug] =>  SolrObject Object
(
[rawquerystring] =>  (smsc_content:\"123 456\" || smsc_description:\"123 456\")&&  
(smsc_module:Intradesk)&&  (smsc_date_created:[2011-12-25T10:29:51Z TO NOW])&&  (smsc_ssid:38)
[querystring] =>  (smsc_content:\"123 456\" || smsc_description:\"123 456\")&&  
(smsc_module:Intradesk)&&  (smsc_date_created:[2011-12-25T10:29:51Z TO NOW])&&  (smsc_ssid:38)
[parsedquery] =>  +(smsc_content:123 smsc_content:456 
smsc_description:123 smsc_content:456) +smsc_module:intradesk 
+smsc_date_created:[2011-12-25T10:29:51Z TO 2012-01-25T13:33:21.098Z] +smsc_ssid:38
[parsedquery_toString] =>  +(smsc_content:123 smsc_content:456 
smsc_description:123 smsc_content:456) +smsc_module:intradesk 
+smsc_date_created:[2011-12-25T10:29:51 TO 2012-01-25T13:33:21.098] 
+smsc_ssid:`#8;#0;#0;#0;&
[QParser] =>  LuceneQParser
[timing] =>  SolrObject Object