Re: Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-06 Thread Felipe Sateler
On Sat, 06 Aug 2011 07:58:35 +0200, Andreas Tille wrote: On Fri, Aug 05, 2011 at 11:28:55PM +, Felipe Sateler wrote: This does not apply to our topic as well. These lists are not (publicly) archived (or am I missing something?) BTW, both gmane and the mail archive seem to be

Re: [Soc-coordination] Removing Spam from the Listarchives (was: Debian mailing lists archives as mbox

2011-08-05 Thread Christian PERRIER
Quoting Olly Betts (o...@survex.com): rather than having a separate flag state for those, but the reports have been reviewed by a human, so should be higher quality than the unfiltered reports from clicks on the Spam buttons. Reports we get through the Spam buttons *are* reviewed by humans.

Re: [Soc-coordination] Removing Spam from the Listarchives (was: Debian mailing lists archives as mbox

2011-08-05 Thread Iain Lane
Hiya, On Fri, Aug 05, 2011 at 09:34:45AM +0200, Christian PERRIER wrote: Quoting Olly Betts (o...@survex.com): rather than having a separate flag state for those, but the reports have been reviewed by a human, so should be higher quality than the unfiltered reports from clicks on the

Re: [Soc-coordination] Removing Spam from the Listarchives (was: Debian mailing lists archives as mbox

2011-08-05 Thread Alexander Wirt
Olly Betts schrieb am Friday, den 05. August 2011: On Thu, Aug 04, 2011 at 11:01:22AM +, Cord Beermann wrote: [we should seperate the GSoC-project from the ListArchiveSpam efforts] Indeed, I've dropped soc-coordination. as i wrote the review-stuff explained in

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-05 Thread Bernhard R. Link
* Alexander Wirt formo...@formorer.de [110804 11:30]: P.S. I know its nice to be open. But publishing real names and mailaddresses is a problem and at least problematic under german law (and probably for other countries). While publishing real names and mailing addresses might be a problem,

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-05 Thread Alexander Wirt
Bernhard R. Link schrieb am Friday, den 05. August 2011: * Alexander Wirt formo...@formorer.de [110804 11:30]: P.S. I know its nice to be open. But publishing real names and mailaddresses is a problem and at least problematic under german law (and probably for other countries). While

Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Andreas Tille
On Fri, Aug 05, 2011 at 12:20:03PM +0200, Alexander Wirt wrote: And you are of course sure that everybody using reportbug knows there address will be published? We have several complaints a month about the problem. Please stick to the topic. In this thread it is about mailing lists. However,

Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Andreas Tille
[sorry for pressing wrong key and sending unfinished mail] On Fri, Aug 05, 2011 at 12:20:03PM +0200, Alexander Wirt wrote: And you are of course sure that everybody using reportbug knows there address will be published? We have several complaints a month about the problem. Please stick to the

Re: Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Alexander Wirt
Andreas Tille schrieb am Friday, den 05. August 2011: [sorry for pressing wrong key and sending unfinished mail] On Fri, Aug 05, 2011 at 12:20:03PM +0200, Alexander Wirt wrote: And you are of course sure that everybody using reportbug knows there address will be published? We have

Re: Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Martin Wuertele
* Andreas Tille andr...@an3as.eu [2011-08-05 12:39]: On Fri, Aug 05, 2011 at 12:20:03PM +0200, Alexander Wirt wrote: And you are of course sure that everybody using reportbug knows there address will be published? We have several complaints a month about the problem. Please stick to

Re: Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Andreas Tille
On Fri, Aug 05, 2011 at 01:56:16PM +0200, Martin Wuertele wrote: On Fri, Aug 05, 2011 at 12:20:03PM +0200, Alexander Wirt wrote: And you are of course sure that everybody using reportbug knows there address will be published? We have several complaints a month about the problem.

Re: Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Felipe Sateler
On Fri, 05 Aug 2011 14:15:34 +0200, Andreas Tille wrote: On Fri, Aug 05, 2011 at 01:56:16PM +0200, Martin Wuertele wrote: On Fri, Aug 05, 2011 at 12:20:03PM +0200, Alexander Wirt wrote: And you are of course sure that everybody using reportbug knows there address will be published? We

Re: Publishing mboxes of mailing list archives (Was: Debian mailing lists archives as mbox)

2011-08-05 Thread Andreas Tille
On Fri, Aug 05, 2011 at 11:28:55PM +, Felipe Sateler wrote: This does not apply to our topic as well. These lists are not (publicly) archived (or am I missing something?) BTW, both gmane and the mail archive seem to be archiving the bugs lists. I have no idea why people are really

Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Christian PERRIER
sad that you have to do this while Debian mailing lists archives available as mbox *do exist* (but they're only available for DDs with accounts in Debian machines). I understand there could be some reluctance to offering these completely anonymously but I wonder if there could be an intermediate way

Re: Debian mailing lists archives as mbox

2011-08-04 Thread Niels Thykier
for them. This allows us to I find it really sad that you have to do this while Debian mailing lists archives available as mbox *do exist* (but they're only available for DDs with accounts in Debian machines). I understand there could be some reluctance to offering these completely

Re: Debian mailing lists archives as mbox

2011-08-04 Thread Christian PERRIER
Quoting Niels Thykier (ni...@thykier.net): To my knowledge you can download them via: http://lists.debian.org/cgi-bin/mbox/list-MM Indeed. Well, it seems that at least Sukhbir, /me and Andreas Tille were'nt aware of this (for Sukhbir, that's understandable, for Andreas and /me, only

Re: Debian mailing lists archives as mbox

2011-08-04 Thread Alexander Wirt
Christian PERRIER schrieb am Thursday, den 04. August 2011: Quoting Niels Thykier (ni...@thykier.net): To my knowledge you can download them via: http://lists.debian.org/cgi-bin/mbox/list-MM Indeed. Well, it seems that at least Sukhbir, /me and Andreas Tille were'nt aware of

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Alexander Wirt
and then creates a mbox archive for them. This allows us to I find it really sad that you have to do this while Debian mailing lists archives available as mbox *do exist* (but they're only available for DDs with accounts in Debian machines). I understand there could be some reluctance to offering

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Andreas Tille
On Thu, Aug 04, 2011 at 09:44:49AM +0200, Alexander Wirt wrote: We had an ongoing discussion about privacy and so spam and so on about the mboxes. We even managed to get consense yesterday. To bring some light into this I would like to publish this consense we had: A filter needs to be

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Alexander Wirt
Andreas Tille schrieb am Thursday, den 04. August 2011: On Thu, Aug 04, 2011 at 09:44:49AM +0200, Alexander Wirt wrote: We had an ongoing discussion about privacy and so spam and so on about the mboxes. We even managed to get consense yesterday. To bring some light into this I would like

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Andreas Tille
On Thu, Aug 04, 2011 at 10:31:33AM +0200, Alexander Wirt wrote: Just for the record. The mboxes are not for being published. We are currently working on getting more data privacy protection in the archive so just publishing the mboxes would just be counterproductive. Thanks for clarifying

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Alexander Wirt
Andreas Tille schrieb am Thursday, den 04. August 2011: On Thu, Aug 04, 2011 at 10:31:33AM +0200, Alexander Wirt wrote: Just for the record. The mboxes are not for being published. We are currently working on getting more data privacy protection in the archive so just publishing the

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Andreas Tille
On Thu, Aug 04, 2011 at 11:02:20AM +0200, Alexander Wirt wrote: Just for the record. The mboxes are not for being published. We are currently working on getting more data privacy protection in the archive so just publishing the mboxes would just be counterproductive. Thanks for

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Alexander Wirt
Andreas Tille schrieb am Thursday, den 04. August 2011: On Thu, Aug 04, 2011 at 11:02:20AM +0200, Alexander Wirt wrote: Just for the record. The mboxes are not for being published. We are currently working on getting more data privacy protection in the archive so just publishing

Re: [Soc-coordination] Debian mailing lists archives as mbox (was: Re: Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Alexander Wirt
Sukhbir Singh schrieb am Thursday, den 04. August 2011: Hi Alex, Can we have some prototype/ format of the Message-IDs that you want us to strip? It would be beneficial for both sides because then we can show you what we will be handling and you can tell if something else needs to be taken

Re: [Soc-coordination] Debian mailing lists archives as mbox (was: Re: Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Sukhbir Singh
Hi Alex, Can we have some prototype/ format of the Message-IDs that you want us to strip? It would be beneficial for both sides because then we can show you what we will be handling and you can tell if something else needs to be taken care of. Thanks for the help, -- Sukhbir -- To

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Andreas Tille
On Thu, Aug 04, 2011 at 11:29:41AM +0200, Alexander Wirt wrote: As far as I know it is not only Christian and if we really want to get more people involved into Debian (including non-technicans) tasks like cleaning up the list archive from SPAM might be somehow interesting. Yeah, but I

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Alexander Wirt
Andreas Tille schrieb am Thursday, den 04. August 2011: On Thu, Aug 04, 2011 at 11:29:41AM +0200, Alexander Wirt wrote: As far as I know it is not only Christian and if we really want to get more people involved into Debian (including non-technicans) tasks like cleaning up the list

Removing Spam from the Listarchives (was: Debian mailing lists archives as mbox

2011-08-04 Thread Cord Beermann
Hallo! Du (Andreas Tille) hast geschrieben: [we should seperate the GSoC-project from the ListArchiveSpam efforts] As far as I know it is not only Christian and if we really want to get more people involved into Debian (including non-technicans) tasks like cleaning up the list archive from

Re: Debian mailing lists archives as mbox (was: Re: [Soc-coordination] Debian Teams Activity Metrics - Report IV) [Update]

2011-08-04 Thread Michelle Konzack
Hello Andreas Tille, Am 2011-08-04 10:22:43, hacktest Du folgendes herunter: - Parse the existing mboxes and strip them down to the following information Message-id: ID From: Name of poster e-mail@of.poster Date: Date Subject: Subject Content snip So far

Re: [Soc-coordination] Removing Spam from the Listarchives (was: Debian mailing lists archives as mbox

2011-08-04 Thread Olly Betts
On Thu, Aug 04, 2011 at 11:01:22AM +, Cord Beermann wrote: [we should seperate the GSoC-project from the ListArchiveSpam efforts] Indeed, I've dropped soc-coordination. as i wrote the review-stuff explained in