Re: SCR interaction with ServiceTracker

Kris Pruden Wed, 16 Dec 2009 15:02:16 -0800


On Dec 16, 2009, at 12:42 PM, Felix Meschberger wrote:

Hi,

Kris Pruden schrieb:
Hi,
I'm puzzling through some strange behaviour in my applicationrelated towhat appears to be a race and/or some weird interaction betweenservicesregistered via SCR and bundles trying to monitor these services viathe
ServiceTracker API.

I don't have any good theories at the moment, so I was hoping if I
described what I'm seeing it might job somebody's memory...

Here's my situation.  I have an application made up of about a
half-dozen or so bundles, most (but not all) of which register their
services using DS/SCR. I'm using pax:exam to run automatedfunctionaltests for this application. Unfortunately, my tests are notreliable -they fail intermittently. When there's a failure, it's alwaysbecause
of an NPE on one of the SCR-injected @Reference attributes of a
service/component. It's hard to tell for sure, but what looks likeis
happening is this:
1. The SCR starts and registers a component/service
2. The functional test, which was waiting for that service to become
available, gets a reference to the service
3. The SCR stops the service, then immediately starts and registers a
new instance of the service
Unfortunately, at this point the test bundle now has a reference tothe"stopped" instance of the service, which has had all of its@Reference
fields set to null (hence the NPE).

What I can't figure out is why this is happening.
Anyone have any suggestions about where I should look next, or anyknown
SCR gotchas that I might be running into?
Well, from far outside, it must be said, that a service may come andgo
at any time for any reason...
Now, this doesn't help you, of course. But without a more in-depthlookinto the situation I cannot tell much, other than: It should workactually.
Let me quickly recapitulate:
Your component under test is a SCR component which also isregistered as
a service: Is this a delayed service component or a service factory
component ?

It's not a factory, and it is configured to start immediately. Theseare the annotations on the service in question:


@Component(immediate=true)
@Service

SCR stops and immediately restarts the service: Are you updating
configuration admin configuration supplied to the component during the
test ?

Not as far as I know. I should point out that I'm not 100% sure thatmy claim of the start/stop/start behaviour is in fact true. I seeevidence that this happens - I've seen log output from the SCRannouncing the activation of a component, then another messageindicating that it's deactivated, then another one saying it'sactivated again. I've instrumented the activate/deactivate methods onthe service implementation and confirmed that this is in facthappening, and that the second activation is a new instance of theservice class. However, even in cases where I don't see thishappening, I still am seeing the NPE from time to time (more on this).

You talk about a @Reference annotation: Does this mean the SCRcomponent

under test has a reference to another service ? Is this reference
mandatory or optional ? Is it static or dynamic ? Is there a change in
the referenced service during the test ?

Yes, so to be a bit more concrete I have the service my functionaltest is exercising, call it ServiceA. This service is annotated asabove and contains a dependency on a second service, ServiceB. Theclass definition looks something like this:


@Component(immediate=true)
@Service
public class ServiceAImpl implements ServiceA {
    @Reference
    ServiceB serviceB;

    ...
}

ServiceB is similarly defined, and (maybe this is relevant) has itsown dependencies on other services, also injected via the @Referenceannotation as above, but at least one of these is not itself managedvia SCR (that is, it's a regular old service registered via a bundleactivator). Further (again, not sure if it's relevant), this non-DSservice is actually provided via a ServiceFactory...

Last but not least: What DS implementation are you using ? Or: what
version of Felix SCR are you using ?


We're using Felix SCR 1.2.0.

One other data point: I'm pretty sure the issue is at least somewhattiming related. The reason is that I have tried the build on twoseparate computers, one a (relatively) slow laptop (a macbook pro),the other a quite powerful workstation (quad-core, 8GB RAM, runninglinux). The tests reliably *pass* on the slow laptop, but fairlyreliably *fails* on the fast machine. My theory is that the fastmachine is winning the race against SCR in this start/stop/start loop(and therefore losing) while the slow machine is losing that race...

I'm still running experiments; I'll update with any new data I findthat I think might be helpful. I realize issues like this are hardenough to figure out when you're watching them happen, so I don'treally expect you to know what's going on here (although if you dothat would be awesome). Mostly I'm just wondering if this setup istickling a known timing issue or something along those lines...


Thanks again,

Kris

Regards
Felix


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: SCR interaction with ServiceTracker

Reply via email to