Re: [Mailman-Users] attachements question

2005-10-06 Thread Manuel Kissoyan
hi!

Where exactly are saved the atatchement files, also in which format uencoded 
or just like the original attachement file?

thank you in advance!









 At 3:04 PM -0300 2005-09-27, Manuel Kissoyan wrote:

  My question is, how long are these files saved for people to download

 They are saved until some other process comes along to clean them out.

 
 and
  are they automatically deleted after a while?

 That's not part of the standard cron jobs, no.

 Is there a way for me
  to manually go in and delete them so they dont build up and take up
  server space?

 Sure, you can go manually delete them.  Or you can set up a cron job to go 
 delete any attachments more than a certain age.

 Im assuming they self delete after a while because
  clicking on one of the link now indicates its not there.

 That implies that something else is going on.  There is no standard cron 
 job that I know of to clean out the old attachments.

 -- 
 Brad Knowles, [EMAIL PROTECTED]

 Those who would give up essential Liberty, to purchase a little
 temporary Safety, deserve neither Liberty nor Safety.

 -- Benjamin Franklin (1706-1790), reply of the Pennsylvania
 Assembly to the Governor, November 11, 1755

   SAGE member since 1995.  See http://www.sage.org/ for more info.
 


--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-10-06 Thread Mark Sapiro
Manuel Kissoyan wrote:

Where exactly are saved the atatchement files, also in which format uencoded 
or just like the original attachement file?

Scrubbed attachments are stored in the directory
archives/private/listname/attachments. In most cases, they are just
the original attachment (decoded from the MIME
Content-Transfer-Encoding:), but in the case of scrubbed HTML parts,
the format depends on the mm_cfg.py setting of ARCHIVE_HTML_SANITIZER.

The default is to store HTML attachments 'HTML escaped' which means
things like ,  and  are replaced with lt;, gt; and amp;
respectively so that opening the attachment in a web browser shows
effectively the source HTML rather than the rendered page.

--
Mark Sapiro [EMAIL PROTECTED]   The highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-29 Thread Stephen J. Turnbull
 John == John W Baxter [EMAIL PROTECTED] writes:

John In addition, the Message-Id values would have to be
John filtered, if used as is, for URL-unfriendly characters

I don't think so.  AFAIK, that was fixed about 2000 RFCs ago.  When
used as URLs, conforming agents will URL-encode them.  How the archive
chooses to implement identifying a storage location from the decoded
URL is up to it.  The URL-encoded string as filename is an obvious
choice, but database-based archives might just use the raw Message-ID.

BTW, I just realized that Google probably does index Message-IDs
already (as long as the archived message includes that header), so
Googleability is _not_ an advantage of my scheme over the header
digest approach.  The ability to algorithmically construct URNs and
URLs however still is.

John There's also the small point that the presence of
John Message-Id: is a SHOULD, not a MUST (some compromise in the
John musty past of the IETF, no doubt).

I would guess that's for the convenience of folks who use telnet HOST
25 as their MTA.  Humans are notoriously bad at generating unique
anything.wink  Anyway, my algorithm just treats such messages as a
collision with all the others that don't have Message-IDs.


-- 
School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp
University of TsukubaTennodai 1-1-1 Tsukuba 305-8573 JAPAN
   Ask not how you can do free software business;
  ask what your business can do for free software.
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-29 Thread Brad Knowles
At 2:29 PM +0900 2005-09-29, Stephen J. Turnbull wrote:

  BradBecause some MUAs generate message-ids that are likely
  Brad to collide.

  Can we stop pandering to the broken mailers, please?  Are we not
  hackers?  We know how to handle collisions.

I don't really care how the algorithm is implemented.  I care 
that it works.  And I care that it is actually done, and checked into 
the source tree.

-- 
Brad Knowles, [EMAIL PROTECTED]

Those who would give up essential Liberty, to purchase a little
temporary Safety, deserve neither Liberty nor Safety.

 -- Benjamin Franklin (1706-1790), reply of the Pennsylvania
 Assembly to the Governor, November 11, 1755

   SAGE member since 1995.  See http://www.sage.org/ for more info.
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-28 Thread John W. Baxter
On 9/28/05 1:30 AM, Brad Knowles [EMAIL PROTECTED] wrote:

  Why archivers don't use Message-Id for the URL, I don't know.
 
 Because some MUAs generate message-ids that are likely to
 collide.

In addition, the Message-Id values would have to be filtered, if used as is,
for URL-unfriendly characters (eg, Outlook Express tosses in a couple of $
characters).  Just another example of not trusting input from untrusted
sources.

The hash of Message-Id:, Date:, and Received: (all of the Received: headers)
would do, except for the case of an insane MTA--the one generating the top
Received: header--feeding the same message into Mailman multiple times.
Perhaps a representation of the computer's clock would help there.

There's also the small point that the presence of Message-Id: is a SHOULD,
not a MUST (some compromise in the musty past of the IETF, no doubt).

  --John


--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-28 Thread Brad Knowles
At 8:50 AM -0700 2005-09-28, John W. Baxter wrote:

  The hash of Message-Id:, Date:, and Received: (all of the Received: headers)
  would do, except for the case of an insane MTA--the one generating the top
  Received: header--feeding the same message into Mailman multiple times.
  Perhaps a representation of the computer's clock would help there.

The top Received: header is added by the Mailman mailing list 
server MTA itself, and that should include the local date/time stamp 
as of the time of receipt of the message.

-- 
Brad Knowles, [EMAIL PROTECTED]

Those who would give up essential Liberty, to purchase a little
temporary Safety, deserve neither Liberty nor Safety.

 -- Benjamin Franklin (1706-1790), reply of the Pennsylvania
 Assembly to the Governor, November 11, 1755

   SAGE member since 1995.  See http://www.sage.org/ for more info.
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-28 Thread Stephen J. Turnbull
 Brad == Brad Knowles [EMAIL PROTECTED] writes:

Brad At 2:03 PM +0900 2005-09-28, Stephen J. Turnbull wrote:

 Why archivers don't use Message-Id for the URL, I don't know.

Brad   Because some MUAs generate message-ids that are likely
Brad to collide.

Can we stop pandering to the broken mailers, please?  Are we not
hackers?  We know how to handle collisions.  Here's the algorithm:

1)  Look for a unique ID in the (X-)List-Archive-Message-ID field.  If
not found:
a)  Generate a unique ID according to the usual algorithm as if
the post were about to be sent from the archive host.
b)  Add it to the header in the (X-)List-Archive-Message-ID field.

2)  Extract the message ID from the message.  If none, set the program
variable equal to the ID generated in 1, and (optionally) add it as
Message-ID to the message's header.

4)  Generate the URL for the archived message based on *Message-ID*.

5)  Check for collision.

6)  If there is a collision, make a directory (could be a file-system
directory, could be just an HTML file, could be a digest message)
with the URL generated in 4.  Generate URLs for the colliding
messages based on Message-ID plus List-Archive-Message-ID, and
include them in the directory.  Conforming implementations MAY also
extract MUA information and make nasty comments about the broken
MUAs, their implementers, and their users to go with the directory.

If Message-ID == List-Archive-Message-ID, go to 1a.  At this point
a conforming implementation MAY mail /vmunix to its implementer,
who obvious snafu'ed.

7) PUT the colliding messages at those URLs.

Rationale:

1.  You could actually derive an URN from this:
archived-message://list-archive.your.org/MESSAGE-ID.

2.  The URL is unique and will persist across regeneration of the
archive as long is the message is present.

3.  People who use conformant software implemented competently should
be given precedence.

4.  Users who don't subscribe to the archiver's client but somehow get
their hands on a message ID can use Google to find it (and the
rest of the thread).

5.  People who use software that doesn't conform will suffer.

Brad For some time now, I've been arguing that they should use a
Brad hash of the relevant information (maybe all the headers,
Brad maybe just selected headers, maybe the entire message,
Brad whatever is reasonable to assume will survive), making sure
Brad to at least include the value of the Date:, Message-ID:,
Brad and Received: headers as part of that input.

This gives 1 and 2, but not 3, 4, and 5.  (No, you can't generate a
Google search item from knowledge of the algorithm because you don't
necessarily have the Received headers.)  Seems like overkill for Step
1 of the algorithm, too.


-- 
School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp
University of TsukubaTennodai 1-1-1 Tsukuba 305-8573 JAPAN
   Ask not how you can do free software business;
  ask what your business can do for free software.
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


[Mailman-Users] attachements question

2005-09-27 Thread Manuel Kissoyan
Hi,

Wondering if someone know the answer of this, made for one of our clients... 

With the mailman software for mailing lists, it has the option of scrubbing 
attatchments to the list and converts them to a link for people who are on the 
list to click on to get the attatchment rather than everyone having to download 
it. 
My question is, how long are these files saved for people to download and are 
they automatically deleted after a while? Is there a way for me to manually go 
in and delete them so they dont build up and take up server space? Im assuming 
they self delete after a while because clicking on one of the link now 
indicates its not there.  
 
Thank you in advance! 
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-27 Thread Brad Knowles
At 3:04 PM -0300 2005-09-27, Manuel Kissoyan wrote:

  My question is, how long are these files saved for people to download

They are saved until some other process comes along to clean them out.

and
  are they automatically deleted after a while?

That's not part of the standard cron jobs, no.

 Is there a way for me
  to manually go in and delete them so they dont build up and take up
  server space?

Sure, you can go manually delete them.  Or you can set up a cron 
job to go delete any attachments more than a certain age.

 Im assuming they self delete after a while because
  clicking on one of the link now indicates its not there.

That implies that something else is going on.  There is no 
standard cron job that I know of to clean out the old attachments.

-- 
Brad Knowles, [EMAIL PROTECTED]

Those who would give up essential Liberty, to purchase a little
temporary Safety, deserve neither Liberty nor Safety.

 -- Benjamin Franklin (1706-1790), reply of the Pennsylvania
 Assembly to the Governor, November 11, 1755

   SAGE member since 1995.  See http://www.sage.org/ for more info.
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] attachements question

2005-09-27 Thread Stephen J. Turnbull
 Brad == Brad Knowles [EMAIL PROTECTED] writes:

 Im assuming they self delete after a while because clicking on
 one of the link now indicates its not there.

Brad   That implies that something else is going on.  There
Brad is no standard cron job that I know of to clean out the old
Brad attachments.

If somebody cleaned out spam or private information, and regenerated
the archives, that might change the order, and the link would break.
I know this happens with MHonArc (since the URLs it generates are
simply the serial number of processing concatenated with a prefix and
.html).  I don't know about pipermail but I wouldn't be surprised.

Why archivers don't use Message-Id for the URL, I don't know.


-- 
School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp
University of TsukubaTennodai 1-1-1 Tsukuba 305-8573 JAPAN
   Ask not how you can do free software business;
  ask what your business can do for free software.
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp