Re: Solr, cURL, and Java runtime

2011-04-02 Thread Chris Hostetter
: 
: I'm experiencing something weird while trying to post updates to Solr docs
: via cURL from exec in Java runtime.  I can't figure out if this is something
: strange with Solr's update mechanism, cURL, or Java runtime oddities, so

There are so many frightening things in that sentence i don't even know 
where to begin.

please don't do what you just described.  it's making me cry.

seriously -- that's a scary, overly complicated, brittle way to try to 
talk to Solr fro ma java app.  I could not bring myself to read any of 
your email passed that line because of how much it disturbed me.

Please consider using SolrJ.

If you really have some reason why you don't want to use SolrJ, then at 
least use some Java based HTTP library to talk to Solr.  the 
SimplePostTool code in solr's code base shows a really dirt simple example 
of using pure java to POST data to Solr.

Trying to use Runtime.exec with curl, with xml passed to curl on the 
commandline is just asking for a world of shell escaping non-portable pain 
... as you've already unconvered.


-Hoss


Re: Spellchecking Escaped Queries

2011-04-02 Thread Chris Hostetter

: I'm having an issue performing a spellcheck on some information and
: search of the archive isn't helping.

For this type of quesiton, there's not much feedback anyone can offer w/o 
knowing exactly what analyzers you have configured for hte various 
fieldtypes (both the field you index/search and the fieldtype used for 
spellchecking)

it's also fairly critical to know how you have the spellcheck component 
configured.

off the cuff: i'd guess that maybe WordDelimiterFilter is being used in a 
wonky way given your usecase -- but like i said: would need to see the 
configs to make a guess.


-Hoss


Re: Question about http://wiki.apache.org/solr/Deduplication

2011-04-02 Thread Chris Hostetter

: Is it possible in solr to have multivalued "id"? Or I need to make my
: own "mv_ID" for this? Any ideas how to achieve this efficiently?

This isn't something the SignatureUpdateProcessor is going to be able to 
hel pyou with -- it does the deduplication be changing hte low level 
"update" (implemented as a delete then add) so that the key used to delete 
the older documents is based on the signature field instead of the id 
field.

in order to do what you are describing, you would need to query the index 
for matching signatures, then add the resulting ids to your document 
before doing that "update"

You could posibly do this in a custom UpdateProcessor, but you'd have to 
do something tricky to ensure you didn't overlook docs that had been addd 
but not yet committed when checking for dups.

I don't have a good suggestion for how to do this internally in Slr -- it 
seems like the type of bulk processing logic that would be better suited 
for an external process before you ever start indexing (much like link 
analysis for back refrences)

-Hoss


Multiple Words in String

2011-04-02 Thread Chris Fauerbach
Good afternoon everyone!
I am stumped, and I would love some help.I'm new to solr/lucene,
but I have thrown myself into it, so I think I have a solid
understanding.   Using the analysis tool in the admin interface, I see
these words stemmed and processed as I assume they would be, so I'm
stuck.

In my index, I have two documents, each with a text field, and here
are example values

1) microsoft.com
2) micro soft

I want to do a search using microsoft or "micro soft" and find both.
I'm using the dismax interface, the fields are properly listed in the
config, and I can find both records, but never at the same time.
Here's my schema.xml for my text field, any thoughts on what I can do
to find these together?



  








  
  








  



admin/index.jsp double submit on IE

2011-04-02 Thread Jeffrey Chang
Hi,

I noticed /admin/index.jsp could issue a double submit on IE causing Jetty
to error out.

Fixed by modifying index.jsp's javascript submit to return false.

... queryForm.submit(); return false; ...

Not sure if I should log a defect for this or not.

- Jeff


Using EmbeddedSolrServer with static documents

2011-04-02 Thread michael.i
Hi,
I am new to solr so please excuse me if my question sounds basic.

I would like to use the EmbeddedSolrServer.
It happens that all examples I've found on the web use documents that have
been generated dynamically such as:


SolrServer solrServer = new EmbeddedSolrServer(container, "core");
SolrInputDocument doc = new SolrInputDocument();
doc.addField("docText", "This is a sample file");
solrServer.add(doc);
solrServer.commit();


I would like to be able to load a document that is stored on the filesystem.
Ideally, I would have liked to do something such as:
SolrInputDocument doc = new SolrInputDocument("path/myDoc.txt");
solrServer.add(doc);
solrServer.commit();

It does not seem possible to do such thing. Am I missing something? Are
there some best practices with regards to referring to a document on the
filesystem?

Thanx!
Michael.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Using-EmbeddedSolrServer-with-static-documents-tp2767614p2767614.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multiple terms in query

2011-04-02 Thread Ahmet Arslan
> When I search multiple terms in solr query , though I get
> all the results
> containing either of the terms , but the results which
> contain both terms
> are outranked by the others. For example:
> 
> Panasonic Camera
> 
> most of the top results are those which contain "Panasonic"
> ,  and they are
> of accessories rather than a camera. but when I search
> Panasonic Camera^16 I
> get what I want.
> 
> What's the correct method of doing this? that is when
> multiple terms are
> searched , how to make sure , records contain all the terms
> shown first in
> solr?

Jan's trick using Function Queries (map and query)

http://search-lucene.com/m/nK6t9j1fuc2/

http://wiki.apache.org/solr/FunctionQuery


Re: Multiple terms in query

2011-04-02 Thread lboutros
You could turn on the debug mode, there is a part which explain the scoring
of the query. It is a bit tricky but that could help.

Could you paste your query (full url), and the field definition in your
schema please ?

Ludovic.

-
Jouve
France.
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multiple-terms-in-query-tp2766334p2766617.html
Sent from the Solr - User mailing list archive at Nabble.com.