Re: Integrating Annotator into Memex

Benjamin Young Thu, 23 Apr 2020 07:03:15 -0700

Thanks for posting that here, Oliver!

We'll probably have to take those questions one at a time. 🙂 I'd actually like 
to start with the last one--as it spells out some target APIs and code.


> 3. What do you think will be the challenges to get Apache Annotator work on 
> both a reader and a full-html version?

If I understand the use case correctly, you're wanting annotation on something 
like Firefox's "reader view" (where the original HTML is stripped away, and 
only the content remains) and wanting those same annotations to be 
re-anchor-able on the original HTML (and vice versa).

If that's indeed what you're after, then the "hard" part is making sure we have 
a way for implementations to "opt-in" to fuzzy anchoring when they both create 
and use an annotation.

For starters, you could simply store the TextQuoteSelector which *should* 
re-anchor on both those representations (and possibly even on a PDF), but it 
would come at the cost of performance on large documents. So, what you'd want 
to follow that up with is additional, narrower, more brittle selectors which 
would (knowingly) fail when you switch representations, but would give you 
better performance on a specific representation--i.e. you'd have an XPath or 
CSS selector for the original HTML which would fail on the "reader" and/or 
"PDF" view at which point you'd (knowingly) fall back to the TextQuoteSelector.

I think the core "plumbing" for that is already available, but Randall or 
Gerben would know better. 🙂

Is that what you're after?

Cheers,
Benjamin


--

http://bigbluehat.com/

http://linkedin.com/in/benjaminyoung

________________________________
From: Oliver Sauter <o...@worldbrain.io>
Sent: Wednesday, April 22, 2020 12:17 PM
To: dev@annotator.incubator.apache.org <dev@annotator.incubator.apache.org>
Subject: Integrating Annotator into Memex

Hey folks,

I just had a call with Benjamin and we talked about the ability to integrate 
annotator into getmemex.com <http://getmemex.com/>
Right now we use the Hypothes.is <http://hypothes.is/> library but it is 
causing us some troubles (mainly the ram usage for hooking it into each tab)

But also we are about to start the development of the Pocket-style 
offline-reader for desktop and mobile on which we want to also integrate 
annotation capabilities.
This means there is an anticipated use case where people annotate on a 
reader-version and want to see the annotations also successfully anchored on a 
live html page. Annotating a reader-version will be missing a lot of details 
usually used for anchoring the annotations, so the challenge would be to make 
those interoperable with Apache Annotator.

So the questions I have:
1. How mature is Annotator in terms of its ability to replace the hypothesis 
annotation library? What still needs to be done (and how much work for that? 
Where do you need help?)
2. How much work do you anticipate for a replacement?
3. What do you think will be the challenges to get Apache Annotator work on 
both a reader and a full-html version?

I’ve been looking forward to find a way to collaborate so hopefully this time 
is the time!
Cheers
Oliver

Re: Integrating Annotator into Memex

Reply via email to