Re: [caiman-discuss] Engine's use of errsvc

Sarah Jelinek Mon, 19 Jul 2010 10:28:23 -0700


Hi Karen,


On 07/17/10 01:30 AM, Karen Tung wrote:

During the implementation of the engine, we found a problem
with using errsvc.

Background:
------------------------
-  Only 1 instance of errsvc exists in the name space
of the application and all modules and libraries it uses.
- Errors (instances of ErrorInfo) are stored in one single list called
_ERRORS
in errsvc.py.
- Engine is going to use errsvc to store exception(s) raised
by checkpoints' execute() method.  Engine will use checkpoint name
as the mod_id for the ErrorInfo objects so application can
easily identify which checkpoint failed.

Problem:
-----------------
- If the application choose to execute the same checkpoint multiple
times, and all the execution failed, multiple ErrorInfo with the
same mod_id will be added to the list of errors.  Application
can not easily figure out which ErrorInfo belongs to which
invocation of that same checkpoint.

How likely is this scenario to actually happen? I assume that what youare saying is that only 1 checkpoint of this type is registered but theapp runs it multiple times? Is it likely that the same code paths wouldbe run in the multiple invocations? Even if the mod_id is the same, theerror 'stack' would be different, wouldn't it, with multiple runs of thesame checkpoint? Wouldn't it be likely that something woudl bedifferent, such as input with the multiple invocations? I think therehas to be some differentiating data even if you run the same checkpointmultiple times. Maybe I am not seeing a scenario where this could happenwithout anything different?

Possible Solutions:
-------------------------------
1) engine.execute_checkpoint() will always call
errsvc.clear_error_list() before it executes any checkpoint.
This way, when execution completes, the errsvc
will only contain errors raised during that execution.
The problem with this approach is that
error messages in errsvc that's not stored by the engine prior
to the execution, and not dealt with by the app, will get lost.

I personally don't see any issue with this - if you've got this far and no-one
has handled the error, then it's probably not worth remembering... BUT, there
would be no harm in logging anything in the error list before you clear it, just
for completeness.

Actually, I do have a concern about this. If someone is running a testrun, and wants to ignore errors along the way but capture them in theerror service for later dumping by the app, this means we will lose thisdata. {resumably they were running these checkpoints multiple times fora reason. Logging it is ok, but it makes it harder for the user to seethe errors.

3) Move the responsibility to the application to manage the
>  error list.  They should either clear it or remove the ErrorInfo objects
>  they are not interested in.  The engine.execute_checkpoint()
>  will simplely append to the error list, and return the list of
>  checkpoints names that failed.

It's everyone's responsibility to look for errors - so before Engine.execute()

is called, the Application really should have looked for errors that might have
occurred so far anyway.

I thought the whole idea was to set the stop_on_error bit so that theexecution engine could determine what to do. And, the app then takes theerror info objects and does what it needs to do. How would the app checkthe errors before the Engine.execute call in between checkpoints? Itcouldn't, and I would think it would be hard for the app to know whichErrorInfo objects it is interested in.

Seems to me if the app is really running the same checkpoint multipletimes that we need a way to differentiate these invocations. Even beyonderror handling don't we have the same issues with logging? From yourdesign spec for the engine we are relying on the checkpoint name, whichin this case would be the same for this scenario.


sarah


_______________________________________________
caiman-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss

Re: [caiman-discuss] Engine's use of errsvc

Reply via email to