date:20130124

Re: Solr HTTP Replication Question

2013-01-24 Thread Amit Nithian

Okay so after some debugging I found the problem. While the replication
piece will download the index from the master server and move the files to
the index directory but during the commit phase, these "older" generation
files are deleted and the index is essentially left in tact.

I noticed that a full copy is needed if the index is "stale" (meaning that
files in common between the master and slave have different sizes) but also
I think a full copy should be needed if the slaves generation is higher
than the master as well. In short, to me it's not sufficient enough to
simply say a full copy is needed if the slave's index version is >=
master's index version. I'll create a patch and file a bug along with a
more thorough writeup of how I got in this state.

Thanks!
Amit

On Thu, Jan 24, 2013 at 2:33 PM, Amit Nithian  wrote:

> Does Solr's replication look at the generation difference between master
> and slave when determining whether or not to replicate?
>
> To be more clear:
> What happens if a slave's generation is higher than the master yet the
> slave's index version is less than the master's index version?
>
> I looked at the source and didn't seem to see any reason why the
> generation matters other than fetching the file list from the master for a
> given generation. It's too wordy to explain how this happened so I'll go
> into details on that if anyone cares.
>
> Thanks!
> Amit
>

RE: SOLR 4 getting stuck during restart

2013-01-24 Thread vijeshnair

Thanks James for the heads up and apologies for a delayed response.Here's the
full details about this issue. Mine is an e-com app so the index contains
the product catalog comprising roughly 13million products. At this point I
thought of using the index based dictionary as the bet option for the "Did
you Mean" functionality. I am not sure if every one facing this issue, but
here is what I am observing as far as dictionary is concerned.

Index based dictionary

- I was building the dictionary using the following url, once I completed
the full indexing. For the time being I have kept the buildOnCommit and
buildOnOptimize options intentionally to false, as I didn't want it to slow
down the full indexing.

http://localhost:8090/solr/select?rows=0&spellcheck=true&spellcheck.build=true&spellcheck.dictionary=jarowinkler

- Once I created the dictionary when I tried to re-start my tomcat, I am
facing the issue which I have stated before (I was waiting for around 20mts,
the restart didn't happen).
- When I removed the dictionary from the "data" folder, the server restart
started working.
- I have tried the spellcheck.collation=false as you suggested, but it
didn't help.

Direct Spell Checker

I have experimented with the new "DirectSolrSpellChecker", where it does not
create a separate dictionary folder, rather build the spellchecker in the
main index itself. The results were exactly same as before, I was getting
stuck during the restarts. I think the traditional spellchecker would be
better in this case, as you can remove, restart and move back the dictionary
as and when required. Where in case of DirectSolrSpellChecker, it doesn't
create a separate dictionary folder, so not sure what to remove from the
index, so that server can restart.

James, I will request you to validate this, and it will be really great help
if you can point out if I am doing any mistakes here. If you think what I am
doing make sens, I will go ahead and log this bug in JIIRA.

Thanks
Vijesh K Nair

--
View this message in context:
http://lucene.472066.n3.nabble.com/SOLR-4-getting-stuck-during-restart-tp4034734p4036163.html
Sent from the Solr - User mailing list archive at Nabble.com.

52 matches

Mail list logo