Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-25 Thread Kevin D. Clark
Benjamin Scott writes:

   According to their FAQ, they keep mail indefinitely.  They say they only
 keep the most recent 3000 msgs in data or thread indexes; is that what you
 mean?  Or are they not working as advertised?

All I know is that I could only look back 196 days at that instant.
If the email isn't indexed, then it is as good as lost AFAIK.

--kevin
-- 
GnuPG ID: B280F24E And the madness of the crowd
alumni.unh.edu!kdc Is an epileptic fit
   -- Tom Waits
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-25 Thread Kevin D. Clark

Kevin D. Clark writes:

 Bill McGonigle [EMAIL PROTECTED] writes:

 This appears to be the main stumbling block.  Mailman has an e-mail
 address obfuscation feature but it's very sad ('user at domain').  If
 I were writing a harvester I think I'd throw in the 1-line regex to
 fix these so it's not really a worthwhile countermeasure.  I'm looking
 at what it would take to have Mailman just remove the e-mail address
 (display only the 'real name') or only show the user-part if there's
 no 'real name' provided so the offer is DOA unless I get that fixed.

 I wrote a program that did most of this and posted this to the list a
 while ago.  Unfortunately, I don't have my personal archive with me
 right now.  Anybody?  Otherwise I'll try to retreive this tonight.

Apologies if you get this more than once.  I sent this out last night
but I haven't seen it get delivered yet.

This program is a filter that strips out email addresses.  I hope that
this program can be useful in addressing our current issue.







To: gnhlug-discuss
Subject: Re: I HATE SPAM (was Re: Mouse swapping on a laptop)
From: [EMAIL PROTECTED] (Kevin D. Clark)
Date: 04 Aug 2003 08:52:30 -0400

I vote that we:

1:  Not require people to strip email addresses from the headers and
body of posts.  This is too much work.  Humans shouldn't have to
do this.

2:  Keep the archive going, keep it world-accessible

3:  Obfuscate email addresses in the web-archives.  Many, many lists
that I am on do this already -- it makes sense.


WRT (3) above, I propose that we either configure our web-archiver to
do this (if it already does this), or else we just do this ourselves.
To this end, I have written (***) a Perl filter that does this
automagically.  This filter could be used to filter our existing
archives, or it could be used from a procmail script inside the
web-archiving box in order to obfuscate all future postings.

*** This is basically Jeff Friedl's well-known email regexp, combined
with around 3 lines of my own code at the bottom.

Regards,

--kevin
-- 
Some people, when confronted with a problem, think ``I know,
 I'll use regular expressions.'' Now they have two problems.

  -- Jamie Zawinski




#!/usr/bin/perl

# Program to build a regex to match an internet email address,
# from Chapter 7 of _Mastering Regular Expressions_ (Friedl / O'Reilly)
# (http://www.ora.com/catalog/regexp/)
#
# Optimized version.
#
# Copyright 1997 O'Reilly  Associates, Inc.
#



# Some things for avoiding backslashitis later on.
$esc= '';   $Period  = '\.';
$space  = '\040';   $tab = '\t';
$OpenBR = '\['; $CloseBR = '\]';
$OpenParen  = '\('; $CloseParen  = '\)';
$NonASCII   = '\x80-\xff';  $ctrl= '\000-\037';
$CRlist = '\n\015';  # note: this should really be only \015.

# Items 19, 20, 21
$qtext = qq/[^$esc$NonASCII$CRlist\]/;   # for within ...
$dtext = qq/[^$esc$NonASCII$CRlist$OpenBR$CloseBR]/;  # for within [...]
$quoted_pair = qq $esc [^$NonASCII] ; # an escaped character

##
# Items 22 and 23, comment.
# Impossible to do properly with a regex, I make do by allowing at most one 
level of nesting.
$ctext   = qq [^$esc$NonASCII$CRlist()] ;

# $Cnested matches one non-nested comment.
# It is unrolled, with normal of $ctext, special of $quoted_pair.
$Cnested = qq
   $OpenParen#  (
  $ctext*# normal*
  (?: $quoted_pair $ctext* )*# (special normal*)*
   $CloseParen   #   )
;

# $comment allows one level of nested parentheses
# It is unrolled, with normal of $ctext, special of ($quoted_pair|$Cnested)
$comment = qq
   $OpenParen  #  (
   $ctext* # normal*
   (?: #   (
  (?: $quoted_pair | $Cnested )# special
   $ctext* # normal*
   )*  #)*
   $CloseParen #)
;

##

# $X is optional whitespace/comments.
$X = qq
   [$space$tab]*# Nab whitespace.
   (?: $comment [$space$tab]* )*# If comment found, allow more spaces.
;



# Item 10: atom
$atom_char   = qq/[^($space)\@,;:\.$esc$OpenBR$CloseBR$ctrl$NonASCII]/;
$atom = qq
  $atom_char+# some number of atom characters...
  (?!$atom_char) # ..not followed by something that could be part of an atom
;

# Item 11: doublequoted string, unrolled.
$quoted_str = qq
\ # 
   $qtext *#   normal
   (?: $quoted_pair $qtext * )*#   ( special normal* )*
\   

Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Bill McGonigle
On Jan 23, 2005, at 22:30, Kevin D. Clark wrote:
One thing that this highlights, to me, is that the archives to this
mailing list still aren't available online.
http://www.mail-archive.com/gnhlug-discuss@mail.gnhlug.org/msg08582.html
-
Bill McGonigle, Owner   Work: 603.448.4440
BFC Computing, LLC  Home: 603.448.1668
[EMAIL PROTECTED]   Cell: 603.252.2606
http://www.bfccomputing.com/Text: [EMAIL PROTECTED]
AIM: wpmcgonigleSkype: bill_mcgonigle
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Kevin D. Clark

Bill McGonigle [EMAIL PROTECTED] writes:

 On Jan 23, 2005, at 22:30, Kevin D. Clark wrote:

 One thing that this highlights, to me, is that the archives to this
 mailing list still aren't available online.

 http://www.mail-archive.com/gnhlug-discuss@mail.gnhlug.org/msg08582.html

The archives from mail-archive.com are regularly purged, and in fact
only go back to 196 days at this instant.

Regards,

--kevin
-- 
GnuPG ID: B280F24E And the madness of the crowd
alumni.unh.edu!kdc Is an epileptic fit
   -- Tom Waits
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Bill McGonigle
On Jan 24, 2005, at 12:03, Kevin D. Clark wrote:
The archives from mail-archive.com are regularly purged, and in fact
only go back to 196 days at this instant.
Ah, bugger.
I could mirror the list on my server, I already have mailman setup for 
DLSLUG and sufficient free disk space.  Does anybody have historical 
mbox files?

-Bill
-
Bill McGonigle, Owner   Work: 603.448.4440
BFC Computing, LLC  Home: 603.448.1668
[EMAIL PROTECTED]   Cell: 603.252.2606
http://www.bfccomputing.com/Text: [EMAIL PROTECTED]
AIM: wpmcgonigleSkype: bill_mcgonigle
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Drew Van Zandt
If anyone wants to suggest an archiver/web frontend, I can set it up
on my server; I can afford to store at least a few hundred meg of
archives, probably even a gig.

--DTVZ

Note: As anyone I've replied to has probably noticed, I usually send
the email intended for the list to an individual before I realize what
I've done, and then forward it to the list as I originally intended.


On Mon, 24 Jan 2005 12:03:08 -0500, Kevin D. Clark [EMAIL PROTECTED] wrote:

 Bill McGonigle [EMAIL PROTECTED] writes:

  On Jan 23, 2005, at 22:30, Kevin D. Clark wrote:
 
  One thing that this highlights, to me, is that the archives to this
  mailing list still aren't available online.
 
  http://www.mail-archive.com/gnhlug-discuss@mail.gnhlug.org/msg08582.html

 The archives from mail-archive.com are regularly purged, and in fact
 only go back to 196 days at this instant.

 Regards,

 --kevin
 --
 GnuPG ID: B280F24E And the madness of the crowd
 alumni.unh.edu!kdc Is an epileptic fit
-- Tom Waits
 ___
 gnhlug-discuss mailing list
 gnhlug-discuss@mail.gnhlug.org
 http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss

___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Bill McGonigle
On Jan 24, 2005, at 12:48, Kevin D. Clark wrote:
Some folks on this list are *very* concerned with their email
addresses appearing on a web-accessible archive (they don't want to
give anything to the spammers).  I share this concern, although I
think that there are folks on this list who are more adamant about
this than I am.  I am more interested in making the collective wisdom
of this list available for others to gain from.  Still, any mail
archive should have an email-address obfuscating feature, in order to
satisfy this (reasonable) request.
This appears to be the main stumbling block.  Mailman has an e-mail 
address obfuscation feature but it's very sad ('user at domain').  If I 
were writing a harvester I think I'd throw in the 1-line regex to fix 
these so it's not really a worthwhile countermeasure.  I'm looking at 
what it would take to have Mailman just remove the e-mail address 
(display only the 'real name') or only show the user-part if there's no 
'real name' provided so the offer is DOA unless I get that fixed.

Removing X-No-Archive messages should be a 10-line formail/procmail 
script, but if anyone has one done please send it along.

The trouble with the password-protected archives is that they don't add 
to the general knowledge-base of the 'net.  There's alot of great 
content from lots of smart folks here and it's a shame not to share it.

I'm curious - is there anyone here who expects total privacy with the 
address they use to post here?  If I were a spammer I'd subscribe to 
every list I could find via mailman,et.al. interfaces; we don't 
validate lurkers here, right?  I'm not arguing for showing the 
addresses in the archive at all - just asking if there's a witness 
protection program expectation here, that is if having your 'real name' 
but not your e-mail address exposed to the Internet would cause a 
problem.  I'm ignoring the mail-archive.com archive for the sake of the 
academic question.

-Bill
-
Bill McGonigle, Owner   Work: 603.448.4440
BFC Computing, LLC  Home: 603.448.1668
[EMAIL PROTECTED]   Cell: 603.252.2606
http://www.bfccomputing.com/Text: [EMAIL PROTECTED]
AIM: wpmcgonigleSkype: bill_mcgonigle
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Tom Buskey
 I'm curious - is there anyone here who expects total privacy with the
 address they use to post here?  If I were a spammer I'd subscribe to

Hahahaha.  :-(

I wish I had a nickel for everytime the email privacy/mail archive/etc
issue was rehashed  beaten to death.  You could look in the archives
for the history.

Seriously, if you can remove all email addresses from the archive, I'd
bet you'd solve 90% of the problems people have with creating an
archive.

The other 10% will beat the topic to death with (a few) real and (many
many many ) hypothetical situations until you give up in frustration.

A way around this might be a poll that put the archive idea to a vote,
then go with the majority.  Then put that in the charter so we can end
these debates by pointing at the charter.
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Kevin D. Clark

Bill McGonigle [EMAIL PROTECTED] writes:

 On Jan 24, 2005, at 12:48, Kevin D. Clark wrote:

 Some folks on this list are *very* concerned with their email
 addresses appearing on a web-accessible archive (they don't want to
 give anything to the spammers).  I share this concern, although I
 think that there are folks on this list who are more adamant about
 this than I am.  I am more interested in making the collective wisdom
 of this list available for others to gain from.  Still, any mail
 archive should have an email-address obfuscating feature, in order to
 satisfy this (reasonable) request.

 This appears to be the main stumbling block.  Mailman has an e-mail
 address obfuscation feature but it's very sad ('user at domain').  If
 I were writing a harvester I think I'd throw in the 1-line regex to
 fix these so it's not really a worthwhile countermeasure.  I'm looking
 at what it would take to have Mailman just remove the e-mail address
 (display only the 'real name') or only show the user-part if there's
 no 'real name' provided so the offer is DOA unless I get that fixed.

I wrote a program that did most of this and posted this to the list a
while ago.  Unfortunately, I don't have my personal archive with me
right now.  Anybody?  Otherwise I'll try to retreive this tonight.

[snip]
 The trouble with the password-protected archives is that they don't
 add to the general knowledge-base of the 'net.  There's alot of great
 content from lots of smart folks here and it's a shame not to share it.

You and I agree 100% here.

[snip]

Regards,

--kevin
-- 
GnuPG ID: B280F24E And the madness of the crowd
alumni.unh.edu!kdc Is an epileptic fit
   -- Tom Waits
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Jeff Macdonald
On Mon, 24 Jan 2005 15:44:52 -0500, Tom Buskey wrote:
snip
 
 A way around this might be a poll that put the archive idea to a vote,
 then go with the majority.  Then put that in the charter so we can end
 these debates by pointing at the charter.

I'd like the archive. However, would gmane be a better alternative?
http://www.gmane.org/

-- 
Jeff Macdonald
Ayer, MA
___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Derek Martin
On Mon, Jan 24, 2005 at 03:44:52PM -0500, Tom Buskey wrote:
 Seriously, if you can remove all email addresses from the archive, I'd
 bet you'd solve 90% of the problems people have with creating an
 archive.

I'm pretty sure I've historically been the most vocal complainer, and
I've followed the arguments pretty closely.  I can say that

 - I have no problem whatever with archives existing
 - I have no problem with /my/ posts being archived [but see below]
 - I do not feel that any such archive need be protected by passwords

My one single objection has always been to the fact that my e-mail
address appears in a public forum which is very easy to harvest, and
that this unquestionably leads to an inflation of received spam (i.e.
at the mail server, before any filtering).  This includes cases of my
e-mail address being included in attributions and quotes in other
people's posts, which I have no control over.  The above 3 statements
are true, PROVIDED all references to my e-mail address are removed
before inclusion in any such archive.

As things stand, purely from the standpoint of self-interest, I no
longer care about this issue.  The reason for that is the address with
which I post is not a valid e-mail address.  Mailman allows for
certain tricks which facilitate this.  From the perspective of my view
of how things should be in the universe, I do still think that the
archive should not be established unless references to e-mail
addresses are removed.  However for obvious reasons, I no longer feel
the need to argue the point quite so vocally.

Actually, I'd like to see this behavior on the physical messages which
are sent out by the mailing list software to the subscribers also; but
I'm not even going to go there...

 The other 10% will beat the topic to death with (a few) real and (many
 many many ) hypothetical situations until you give up in frustration.

Based on my recollection of the discussions, I believe removing
addresses will remove much closer to 100% of the complaints.

-- 
Derek D. Martinhttp://www.pizzashack.org/   GPG Key ID: 0xDFBEAD02
-=-=-=-=-
This message is posted from an invalid address.  Replying to it will result in
undeliverable mail.  Sorry for the inconvenience.  Thank the spammers.



pgpDuzwbIwecy.pgp
Description: PGP signature


Re: mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-24 Thread Benjamin Scott
On Mon, 24 Jan 2005, at 12:03pm, [EMAIL PROTECTED] wrote:
 http://www.mail-archive.com/gnhlug-discuss@mail.gnhlug.org/msg08582.html
 
 The archives from mail-archive.com are regularly purged, and in fact only
 go back to 196 days at this instant.

  According to their FAQ, they keep mail indefinitely.  They say they only
keep the most recent 3000 msgs in data or thread indexes; is that what you
mean?  Or are they not working as advertised?

-- 
Ben Scott [EMAIL PROTECTED]
| The opinions expressed in this message are those of the author and do  |
| not represent the views or policy of any other person or organization. |
| All information is provided without warranty of any kind.  |

___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss


mail archives (was: Another ACPI anecdote, plus footnotes)

2005-01-23 Thread Kevin D. Clark

Bill Sconce [EMAIL PROTECTED] writes:

 We've had threads here previously about ACPI. [1] [2] [3]

A very informative post.  I found all of this to be very interesting.
Thanks for sharing.

It seems like you did quite a bit of research into this problem.

One thing that this highlights, to me, is that the archives to this
mailing list still aren't available online.  So, a non-subscriber to
this list who encounters the same problem that you encountered won't
be able to make use of your excellent summary or the other interesting
posts related to this topic on the GNHLUG list.

--kevin
-- 
GnuPG ID: B280F24E And the madness of the crowd
alumni.unh.edu!kdc Is an epileptic fit
   -- Tom Waits

___
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss