Thanks. That's very abstract and old, but perhaps I could work something out using this.

Any other pointers / opinions welcome...


Itamar.


On 17/06/2011 03:26, Andrzej Bialecki wrote:

On 6/17/11 12:29 AM, Itamar Syn-Hershko wrote:
No, that was not what I meant.


I'm not interested in coloring the actual text, but in giving the user
an indication of how relevant the results are. Instead of displaying the
result score, I want to give some visual meter to show that. The highest
ranking result will be green if the it is really relevant to the query,
yellow if less and so on.


My question is if there's a good enough way to measure this - for
example if the first results is 20 times the score of the second, or
that's something I can't really do...

What you describe reminds me of a special case of a more general visualization technique called Tilebars. Tilebars are an N x M histogram, where N rows correspond to N query terms (or clauses), and M columns represent buckets of score contributions distributed over the sections of the document. Each cell is then colored according to the portion of score that it contains.

The coloring schema may be non-linear, and the document division into chunks may not be linear either.

You can then collapse Tilebars if you wish in one or both dimensions, e.g.:

* N x 1 represents relative score contributions from each query term
* 1 x M represents the distribution of score contributions along the length of document * 1 x 1 represents the relative score of all clauses for the whole document, relative to other documents (the difference is in coloring)

See this link for more details: http://people.ischool.berkeley.edu/~hearst/research/tilebars.html


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to