On 2/3/07, Erik Hatcher <[EMAIL PROTECTED]> wrote:
On Feb 2, 2007, at 4:29 PM, Yonik Seeley wrote:
> One downside of doing joins is that it makes it pretty hard to
> distribute/federate in the future because a document doesn't stand
> alone.
The connection between objects is key in our library domain though.
> A flat structure for tagging could be to add a
> taguser and tag field to the actual document each time a user
> tagged a document.
I've been contemplating how that would look and work. But the
downsides you mention are sorta show-stoppers for our needs:
The main one being query or facet by all tags for a specific user?
That's doable I think.
I assume an annotation is a comment (like a few sentences)?
If you search on comments, do you just get the comments back with a
pointer to the original doc, or do you get the original doc back?
Storing comments on a document:
- could lead to increased relevancy... all comments from all users would be
considered together for term-freq
- easy to get comments for a list of documents in a single query
- can use lucene syntax across "A" fields like tite, and commentary.
+title:solr+comments:great
- harder to search for comments from a specific user only
(need sloppy phrase or span queries to do this?)
Storing comments separately:
- if you search in comments, you get the exact comment that matched... if you
stored all comments on the A doc, you wouldn't know which matched
(but highlighting
could help with that).
- easy to search comments only from a specific user
Do comments need to be included in faceting in any way?
-Yonik
ps: If I'm making less sense than usual, it might just be because it's
the time of the year that kids bring home nasty germs, and I'm feeling
rather fuzzy headed :-)