Marc G. Fournier wrote:
On Thu, 13 May 2010, Alvaro Herrera wrote:

Excerpts from Yeb Havinga's message of jue may 13 15:06:53 -0400 2010:

My $0.02 - I like the whole 'don't sort, search' (or how did they call
it?) just let the inbox fill up, google is fast enough. What would be
really interesting is to have some extra 'tags/headers' added to the
emails (document classification with e.g. self organizing map/kohonen),
so my local filters could make labels based on that, instead of perhaps
badly spelled keywords in subjects or message body.

I missed this when I read it the first time .. all list email does have an X-Mailing-List header added so that you can label based on list itself ... is that what you mean, or are you thinking of something else entirely?
Something else: if automatic classification of articles was in place, there would be need of fewer mailing lists, depending on the quality of the classification.

IMHO the problem of handling the big volume of the lists is not solved by splitting into more, since it does not decrease the amount of posts that are interesting from the subscribers perspective. It would only mean that posters are more likely to make mistakes, a possible increase in crossposts or 'my question was not answered there so now I try here' on the sender part, and at the subscriber side bigger chance to miss interesting articles. That my current mailing list setup works for me supports this claim; I did not subscribe to less lists, but managed to decrease the ms spent at 'handling' to an amount that became workable.

Though I do not believe algorithmic article classification/ranking to provide a 100% fool proof filter, it might help decreasing the "ms spent per article" more. Take a look at how "carrot2" clusters results from the query "postgresql prepared transactions site:postgresql.org" - http://search.carrot2.org/stable/search?source=web&view=tree&skin=fancy-compact&query=postgresql+prepared+transactions+site%3Apostgresql.org&results=100&algorithm=lingo&EToolsDocumentSource.country=ALL&EToolsDocumentSource.language=ENGLISH&EToolsDocumentSource.safeSearch=false

I wonder if a cluster algorithm could tag articles with (multiple) keywords, e.g. 'hackers','prepared transaction','dba' etc etc. I could then make filters or ranking on: hackers AND optimizer -> +10.

regards,
Yeb Havinga


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Reply via email to