Re: [DAS] Adjacent feature extension

Jonathan Warren Mon, 07 Mar 2011 02:58:47 -0800


On 7 Mar 2011, at 10:35, Thomas Down wrote:

On Mon, Mar 7, 2011 at 10:04 AM, Andy Jenkinson <[email protected]>wrote:
Hi Thomas,
Thanks for this. Regarding the option of whether to return just onefeature
per side or all overlapping features, the only other advantage that
immediately springs to mind for the latter (in addition to somemeasure ofconsistency, as you mention) is that it allows the client toimmediatelyrender the exact region of that feature without triggering anotherrequest.It would generally mean changing zoom level. I'm can't say ifclients arelikely to follow this mechanism as opposed to, say, pan and centreon thefeature, but if they wanted to it would be more efficient (andpossibly alittle bit more efficient anyway depending on how your client doesits
requests).
Yep, I agree. I'd be interested to learn whether there are anyclients thatwould seriously consider taking advantage of this. My own thinkingis thateven if we do adjust zoom level (as Dalliance sometimes does, e.g.in the"jump to gene..." navigation op), clients are much more likely tozoom to aview that contains the target feature plus a "sensible" amount offlankingsequence, rather than a view where the target feature is perfectlyframed.
Furthermore, this rather seems like optimizing for the case whereonly one
annotation source is active.   Surely we're talking about the
*distributed*annotation system, and clients will still have to go off
and query all the
other annotation sources, even if they are able to skip the one which
responded to the "adjacent" query. So long as there's some kind ofquery
parallelization in place, this probably isn't a performance issue.

My vote would ideally to change feature_by_id to return one featureand have the adjacent_feature as returning one feature. This in myopinion would mean these capabilities on servers do "exactly as theysay on the tin" and would be easier to implement for data providersand are thus more likely to be implemented?If the feature_id capability as it stands is needed it could bechanged to something more akin to what it means like feature_id_regionbut I would bet no one would bother to change it/use it?

However the reality is that we are too late to change the oldfeature_by_id, but I don't think we need to make the same mistaketwice by repeating it for adjacent_features?

Do any other client developers feel differently?
Disadvantages I can think of:
- "adjacent" request takes marginally longer
- not quite as obvious what clients should put in their UI controls- need
to pick a feature to be able to do "jump to BRCA1"
- risk of servers not implementing it correctly and only returningonefeature anyway (although I don't think this is likely as theconcept is
different to "feature-by-id")

Some things to further define:
- servers can't return a fake feature
Yep, will clarify this.
- should servers return features on different reference sequencesif there
are none one the current one?
In my opinion, absolutely yes. Otherwise the "10 features in thegenome"
case remains a massive pain (and potentially a disaster, for
inhomogeneous-dstributed data; won't someone think of the MHC tilingarrays?:-). And even worse for the "10 features in UniProt" case (where Ican also
see this feature being quite interesting).
I've tried to be explicit about this in my proposal (see thepenultimateparagraph + example 3), but any suggestions for furtherclarifications are
welcome.
- how should servers treat features that overlap the adjacentrange? Treatthem as the adjacent feature to return, or only include featurescompletelyoutside the query range? What if the next feature completelyoutside thequery range is part of the same feature hierarchy (e.g. an exonoutside the
current window).
It's a point rather than a range, but yes I agree this is still anopenquestion. I'd actually written the spec such that overlappingfeatures doget returned (on the assumption that clients will do "trivial" casesofnext/previous feature in-memory without a network round trip), butagain if
other client developers do things differently, I'd like to know.
I think "include overlapping" will have less special-cases to worryabout,though. e.g. the PART/PARENT issue you allude to. Let clients dealwith
that ("dumb servers, smart clients").

                Thomas.
_______________________________________________
DAS mailing list
[email protected]
http://lists.open-bio.org/mailman/listinfo/das


Jonathan Warren
Senior Developer and DAS coordinator
blog: http://biodasman.wordpress.com/
[email protected]
Ext: 2314
Telephone: 01223 492314










--

The Wellcome Trust Sanger Institute is operated by Genome ResearchLimited, a charity registered in England with number 1021457 and acompany registered in England with number 2742969, whose registeredoffice is 215 Euston Road, London, NW1 2BE._______________________________________________

DAS mailing list
[email protected]
http://lists.open-bio.org/mailman/listinfo/das

Re: [DAS] Adjacent feature extension

Reply via email to