Re: [gentoo-user] Retrieving bounced list messages

2017-01-04 Thread Floyd Anderson


Thanks for both responses and your advise. I will investigate your hints 
further to see what is possible with my skills.


On Wed, 04 Jan 10:56:29 +0100
Hogren  wrote:


For me, this is the raw mail [1]. When you are on a message, click on
"Download message RAW".

For your specified example URL in [1] yes, it looks like a raw message 
to me too. But it seems to be only the case because the mentioned 
message is signed and the signature wrapped some header fields which 
make it looks like raw.


If I click “Download message RAW” on an other but unsigned message, it 
always shows me the pure text of that message (even if I use “Save Link 
As...” in a browser or curl to download it).




Use a Bash, Python, C++, or other language with curl/libcurl to filter
by date.
It was my bad because I expected there’s a “ready to use” solution which 
was simply overlooked by me. ;-)


I think I have to be more clear about why headers are important to me. 
Although the main important thing is of course the information a mailing 
list discussion provides (pure text). But now comes the second one into 
play. How to handle all those interesting stuff offline?


My idea is download it, feed Mutt (my MUA) with it and view it sorted by 
threads/dates (here is the header fields part) and maybe in the future, 
crawling/indexing it with Notmuch.


I don’t want to have fly around a couple of message archives downloaded 
as plain text (without headers and therefore lost the relations to each 
other) on my storage organised in more or less bad named folder 
structures. This will become sooner or later the same destiny as many 
browser bookmark collections I saw (not only mine) — unmaintained and 
confusing — especially with silly bookmark managers.


--
Best regards,
Floyd Anderson





Re: [gentoo-user] Retrieving bounced list messages

2017-01-04 Thread Hogren
Sorry I didn't read your response.


On 04/01/2017 08:58, Floyd Anderson wrote:
> I see no change to find any message on [1] by its message number.
> Although it were possible, my goal is to have the messages local
> stored and searchable while being offline. Also [2] doesn’t help here
> even though it offers a message download link — it’s not the raw email
> (with header fields). 

For me, this is the raw mail [1]. When you are on a message, click on
"Download message RAW".

> Now there is still one thing. How to get messages (better whole
> threads) for offline usage from a period before the list subscription
> when their message numbers is unknown? 

Use a Bash, Python, C++, or other language with curl/libcurl to filter
by date.
It's quiet easy :
- download the mailing page, with mounths date
- download subpage only if the dates you want are in these
- from subpages, retrieves ids for concerned messages and download it
with an URL like that [2]

[1] : https://marc.info/?l=gentoo-announce=148344860307130=raw
[2] : https://marc.info/?l=gentoo-announce=IDMESSAGE=raw

Hogren



Re: [gentoo-user] Retrieving bounced list messages

2017-01-04 Thread Hogren
Hello,


With a little search, I saw it (on MARC, software used by gentoo [1]) :

" Robot policy

In theory, we don't mind people snarfing down some MARC pages for
off-line reading. (I travel a lot, and sometimes want to pull down long
threads before hitting the road to read locally, etc.)

On the other hand... first, if we think you are a spam-bot
address-harvester, no death is slow or painful enough. Also, even if
well-intentioned, a robot crawling MARC can sometimes create a DoS; if
the robot sustains many parallel requests (or we happen to be hit by
multiple different robots at the same time) and doesn't back off if the
site starts to slow down, it can bog down the server. In a perfect world
MARC would scale better, and would automatically recognize abusive
robots 100% accurately, 100% of the time. But since it's not a perfect
world... we may throttle traffic from you if your IP, user-agent, or
IP/user-agent combination have misbehaved in the past.

If you want to crawl MARC, please be sure you have a delay between
requests, say one or two seconds. If you think we've mis-identified you
as a robot, please feel free to contact us. Please include information
we'll need to find your activity in our logs, such as the time you get
this message, the IP address(es) you are browsing from, and the
user-agent (web browser) you are using."


With this warning, you can use the MARC interface to try to dump what
you want. Try to start with at the mailing list page [2]. The on MARC
contains date, and others mail headers. Use libcurl and sed should be
sufficient for your needs.



[1] : https://marc.info/?q=about

[2] : https://marc.info/?l=gentoo-announce=1=2



Good luck :)


Hogren


On 04/01/2017 08:58, Floyd Anderson wrote:
> On Tue, 03 Jan 20:12:05 -0500
> Philip Webb  wrote:
>> 170104 Floyd Anderson wrote:
>>> Is it possible — and when how — to retrieve bounced mailing list
>>> messages,
>>> e.g. from  or this list) ?
>>
>> You can recover everything from the Gentoo lists' archive :
>>  http://archives.gentoo.org/ .
>
> Thanks for your response. I’ve already found [1] but it’s hard (even
> impossible) to figure out the bounced message(s) from there. Notice,
> the bouncing messages information from mailing list manager (mlmmj)
> looks like:
>
>> Some messages to you could not be delivered. If you're seeing this
>> message it means things are back to normal, and it′ merely for your
>> information.
>>
>> Here is the list of the bounced messages:
>> - 174956
>> - 174958
>
> I see no change to find any message on [1] by its message number.
> Although it were possible, my goal is to have the messages local
> stored and searchable while being offline. Also [2] doesn’t help here
> even though it offers a message download link — it’s not the raw email
> (with header fields).
>
> Anyway, you push me in the right direction. After digging somewhat
> deeper I found the ability to send a message request to e.g.
>  (where N is the message number).
>
> Now there is still one thing. How to get messages (better whole
> threads) for offline usage from a period before the list subscription
> when their message numbers is unknown?
>
>
> [1] 
> [2] 
>




Re: [gentoo-user] Retrieving bounced list messages

2017-01-03 Thread Floyd Anderson

On Tue, 03 Jan 20:12:05 -0500
Philip Webb  wrote:

170104 Floyd Anderson wrote:

Is it possible — and when how — to retrieve bounced mailing list messages,
e.g. from  or this list) ?


You can recover everything from the Gentoo lists' archive :
 http://archives.gentoo.org/ .


Thanks for your response. I’ve already found [1] but it’s hard (even 
impossible) to figure out the bounced message(s) from there. Notice, the 
bouncing messages information from mailing list manager (mlmmj) looks 
like:



Some messages to you could not be delivered. If you're seeing this
message it means things are back to normal, and it′ merely for your
information.

Here is the list of the bounced messages:
- 174956
- 174958


I see no change to find any message on [1] by its message number. 
Although it were possible, my goal is to have the messages local stored 
and searchable while being offline. Also [2] doesn’t help here even 
though it offers a message download link — it’s not the raw email (with 
header fields).


Anyway, you push me in the right direction. After digging somewhat 
deeper I found the ability to send a message request to e.g. 
 (where N is the message number).


Now there is still one thing. How to get messages (better whole threads) 
for offline usage from a period before the list subscription when their 
message numbers is unknown?



[1] 
[2] 

--
Best regards,
Floyd Anderson





Re: [gentoo-user] Retrieving bounced list messages

2017-01-03 Thread Philip Webb
170104 Floyd Anderson wrote:
> Is it possible — and when how — to retrieve bounced mailing list messages,
> e.g. from  or this list) ?

You can recover everything from the Gentoo lists' archive :
  http://archives.gentoo.org/ .

-- 
,,
SUPPORT ___//___,   Philip Webb
ELECTRIC   /] [] [] [] [] []|   Cities Centre, University of Toronto
TRANSIT`-O--O---'   purslowatchassdotutorontodotca




[gentoo-user] Retrieving bounced list messages

2017-01-03 Thread Floyd Anderson


Hello list,

is it possible — and when how — to request/retrieve bounced mailing list 
messages (e.g. from  or this list)?


If there is an approach, can it be also applied to get certain 
interesting threads from dates _before_ one has been subscribed to that 
Gentoo list?


I wish to have a complete mailing list and be able to locally store some 
informative user experience from the past which may be meaningful to me 
in the future. Thanks and…


--
Best regards,
Floyd Anderson