Yes, a sophisticated app with lots of complex jobs will have to be quite smart 
about how it decides to commit. The goal for LCF would be simply to supply 
enough job status so that such a sophisticated app could decide that the job 
status warrants a commit. As I suggested, the simplest case would be to see 
that all non-continuous jobs (at least those that the app cares about) have 
completed.

The app end might or might not be Solr itself. It could indeed be a plug-in for 
Solr, or just some other app process that has the specified context handler.

And, yes, the "commit at end of job" option is not terribly useful for complex, 
overlapping job arrangements. It's primary use case is for initial evaluation 
of LCF. But it might be sufficient for some simpler apps. Not all Solr apps are 
horribly complicated.

Maybe the option should technically be spec'ed as "commit at end of job, but 
only if no other jobs are active with the Solr output connector".

In some cases you might only want to commit when a specific job completes. For 
example, maybe a series of jobs are scheduled to run in sequence and the commit 
is only desired on completion of the final job in that sequence. In that case, 
the option is desired at the job level rather than for the Solr output 
connection itself. Is there any provision for job-specific output connector 
options?

-- Jack Krupansky


From: [email protected] 
Sent: Wednesday, June 02, 2010 10:19 AM
To: [email protected] 
Subject: RE: Setting up Solr


What about job deletion document cleanup, etc?  Overlapping job runs using the 
same output connection?  We've had this discussion before; the connector can 
certainly have hooks added but unless you intend to construct some kind of data 
structure on the Solr end that tries to keep track of all that, you're likely 
not going to get quite what you are looking for.

 

Karl

 

 

From: ext Jack Krupansky [mailto:[email protected]] 
Sent: Wednesday, June 02, 2010 10:15 AM
To: [email protected]
Subject: Re: Setting up Solr

 

It would be nice to have a "commit at end of job" option for the Solr output 
connector. Granted, commit policy can be a lot more complicated than that, but 
it is a simple use case that would facilitate initial evaluations of LCF with 
Solr.

 

Thinking further ahead, it would be very useful to have "job status 
notification" messages that could be sent to an app (say, a 
"/update/lcf-job-status" request handler) that would note start, end, abort, 
and periodic status of LCF jobs. Then the app could commit as it desires with 
respect to individual job completion and larger collections of jobs for 
different repositories. For example, an app might wait for all non-continuous 
jobs to complete before committing. That would be a more comprehensive 
longer-term solution for the commit problem, but the simple end-of-job commit 
option would be more user-friendly in the near-term.


-- Jack Krupansky

 

From: [email protected] 

Sent: Wednesday, June 02, 2010 9:09 AM

To: [email protected] 

Subject: RE: Setting up Solr

 

Solr has autocommit functionality built in.  Google for it and you will find 
out how to configure it.

 

Karl

 

From: ext [email protected] [mailto:[email protected]] 
Sent: Wednesday, June 02, 2010 9:08 AM
To: [email protected]
Subject: RE: Setting up Solr

 

Why can we have a job for this ? else is there any other way ?? (Windows ? in 
linux there are cron jobs )

 

Thanks & Regards,

Rohan G Patil

Cognizant  Programmer Analyst Trainee,Bangalore || Mob # +91 9535577001 

[email protected]

 

From: [email protected] [mailto:[email protected]] 
Sent: Wednesday, June 02, 2010 6:32 PM
To: [email protected]
Subject: RE: Setting up Solr

 

You can send any argument you want by configuring the output connector.  
However, the explicit commit on every post will slow down performance of your 
crawls.

 

Karl

 

From: ext [email protected] [mailto:[email protected]] 
Sent: Wednesday, June 02, 2010 9:00 AM
To: [email protected]
Subject: RE: Setting up Solr

 

Hi,

 

Yes that is where I was stuck up.. making an explicit commit.. 

 

Can I send the argument commit=true while configuring the Repo connector.

 

Thanks & Regards,

Rohan G Patil

Cognizant  Programmer Analyst Trainee,Bangalore || Mob # +91 9535577001 

[email protected]

 

From: Jack Krupansky [mailto:[email protected]] 
Sent: Wednesday, June 02, 2010 4:42 PM
To: [email protected]
Subject: Re: Setting up Solr

 

A short Solr tutorial is here:

 

http://lucene.apache.org/solr/tutorial.html

After running an LCF job that uses a Solr output connection, be sure to 
manually force a Solr "commit", for example:

 

    cd .../apache-solr-1.4.0/example/exampledocs
    java -jar post.jar


-- Jack Krupansky

 

From: [email protected] 

Sent: Wednesday, June 02, 2010 1:46 AM

To: [email protected] 

Subject: Setting up Solr

 

Hi,

 

I am stuck at setting up the Solr server to be used with LCF.

 

I am new to Solr.

 

Thanks & Regards,

Rohan G Patil

Cognizant  Programmer Analyst Trainee,Bangalore || Mob # +91 9535577001 

[email protected]

 

      This e-mail and any files transmitted with it are for the sole use of 
      the intended recipient(s) and may contain confidential and privileged 
      information.
      If you are not the intended recipient, please contact the sender by 
      reply e-mail and destroy all copies of the original message.
      Any unauthorized review, use, disclosure, dissemination, forwarding, 
      printing or copying of this email or any action taken in reliance on this 
      e-mail is strictly prohibited and may be unlawful.
     

 

      This e-mail and any files transmitted with it are for the sole use of 
      the intended recipient(s) and may contain confidential and privileged 
      information.
      If you are not the intended recipient, please contact the sender by 
      reply e-mail and destroy all copies of the original message.
      Any unauthorized review, use, disclosure, dissemination, forwarding, 
      printing or copying of this email or any action taken in reliance on this 
      e-mail is strictly prohibited and may be unlawful.
     

 

      This e-mail and any files transmitted with it are for the sole use of 
      the intended recipient(s) and may contain confidential and privileged 
      information.
      If you are not the intended recipient, please contact the sender by 
      reply e-mail and destroy all copies of the original message.
      Any unauthorized review, use, disclosure, dissemination, forwarding, 
      printing or copying of this email or any action taken in reliance on this 
      e-mail is strictly prohibited and may be unlawful.
     

 

Reply via email to