On 2/3/07, Erik Hatcher <[EMAIL PROTECTED]> wrote:
On Feb 2, 2007, at 4:29 PM, Yonik Seeley wrote:
> One downside of doing joins is that it makes it pretty hard to
> distribute/federate in the future because a document doesn't stand
> alone.

The connection between objects is key in our library domain though.

> A flat structure for tagging could be to add a
> taguser and tag field to the actual document each time a user
> tagged a document.

I've been contemplating how that would look and work.  But the
downsides you mention are sorta show-stoppers for our needs:

The main one being query or facet by all tags for a specific user?
That's doable I think.

I assume an annotation is a comment (like a few sentences)?
If you search on comments, do you just get the comments back with a
pointer to the original doc, or do you get the original doc back?

Storing comments on a document:
- could lead to increased relevancy... all comments from all users would be
  considered together for term-freq
- easy to get comments for a list of documents in a single query
- can use lucene syntax across "A" fields like tite, and commentary.
   +title:solr+comments:great
- harder to search for comments from a specific user only
  (need sloppy phrase or span queries to do this?)

Storing comments separately:
- if you search in comments, you get the exact comment that matched... if you
  stored all comments on the A doc, you wouldn't know which matched
(but highlighting
  could help with that).
- easy to search comments only from a specific user

Do comments need to be included in faceting in any way?

-Yonik

ps: If I'm making less sense than usual, it might just be because it's
the time of the year that kids bring home nasty germs, and I'm feeling
rather fuzzy headed :-)

Reply via email to