>What type of partial match search is supported on mail-archive?
Here's the short and long explanation of search syntax.
https://www.mail-archive.com/faq.html#search
https://www.mail-archive.com/searching.html
___
Gossip mailing list
Dear Friends,
The Mail Archive has been running for 20 years. It started
as hobby and grew into a small business 14 years ago.
We have now come full circle. The business will end on
December 31, 2018 and the service will revert to a hobby.
What happened? Well, traffic has steadily declined for
I know nobody cares, but I noticed that The Mail Archive
achieved over 99.99% uptime for a year. Kind of neat. We'll
see if this message jinxes it. For comparison, if you count
the recent total eclipse in the United States as a failure,
the sun had about 99.9995% uptime.
We use MHonArc to render emails for the web, which is open source.
So any impatient programmers who are really hungry for this feature
may want to dust off their Perl programming skills.
Jeff
___
Gossip mailing list
A lot of crazy stuff happened in 2016. Looking back, what parts
of that touched The Mail Archive?
Let's start with mundane computer stuff. Uptime was great, the service
was online for the entire year except for 9 hours, 12 minutes. Some
additional hard drives were converted to SSD, and some RAM
We’ve decided to make a policy change for The Mail Archive regarding
message removal. For the last eighteen years (time flies!) list
administrators have been responsible for decisions on removing content from
publicly archived lists. We have had this policy in place in order to
preserve the
Thanks for the heads up. Highly appreciated. I'm impressed that you know
the certificate
vendor for The Mail Archive. I was not aware of the drama going on with
StartCom.
Is it correct that the removal only applies to new certificates, and
therefore the
deadline for action is May 3, 2017 when the
Today I want to express my support and appreciation for Lars Magne
Ingebrigtsen. For those not familiar, Lars created and has been running
the utterly terrific email archiving service Gmane for 14 years. I've
long admired both Gmane's engineering excellence and integrity.
Unfortunately, one
Thanks for the detailed report. I made some changes and now
get a 'A' rating on the online test. Does this fix the Android
problems?
___
Gossip mailing list
https://www.mail-archive.com/gossip@mail-archive.com
Once upon a time, an ancient fish crawled out of the water and into
the mud. That was an important moment in the grand journey of life.
Its children learned to live on the land and eventually became us.
On December 21, I watched the a young company called SpaceX
successfully land a rocket booster
The spam filtering service we use (SpamHero) quarantined that message along
with some others. I've released the messages from quarantine and also
adjusted
the whitelist to hopefully reduce or prevent this from happening in the
future.
I'm sorry about this and we would definitely consider
Yes, you can safely leave out To, Message-id, and Received.
Consequences are what you'd expect, like the inability to do a
message-id search and find that particular message.
You are correct. Posting address is manually assigned during the bulk
import process, and automatically determined from
The only things indexed for search are: message-id, subject, date (usually
extracted from the Recieved: header), sender name (extracted
from From: header), posting address (for example, gossip@mail-archive.com),
archival message number, and message body. Every message is sorted and
organized
Statute of limitations is typically 3 kilomessages on a normal
non-import list, but should (I think) be unlimited on bulk import.
Conversion to unix newlines is required and is manual; doesn't
matter who does it.
Still prefer to do whole import at once especially if tricky; less
labor, also less
We're experimenting with a new user interface for search. It
works a little differently, what do people think?
To try this on your own list, just replace search in the URL
with searchdev.
Cheers,
Jeff
===
OLD
http://www.mail-archive.com/search?q=squirrell=cayugabirds-l%40cornell.edu
NEW
Happy Spring everyone. Here are some updates for The Mail Archive.
Search is about 10 times faster than before. This is due to a complete
rewrite that shaves off a ton of initialization time. I'm really happy
about this.
As an experiment, we're changing the way we serve ads. Previously,
direct
After investigation, this turned out to be an issue with an X-No-Archive:
Yes header
on the list itself.
Cheers,
Jeff
___
Gossip mailing list
https://www.mail-archive.com/gossip@mail-archive.com
http://mail-archive.com/cgi-bin/mailman/options/gossip
___
Gossip mailing list
https://www.mail-archive.com/gossip@mail-archive.com
http://mail-archive.com/cgi-bin/mailman/options/gossip
You may have noticed that archiving was suspended at
The Mail Archive recently. Things are fine now, read on
if you want gory details.
We are hosted at a professional datacenter, complete
with building wide uninterruptible power supply (UPS) and
backup generators. About 5 years ago, the
The gossip mailing list has a new name: gossip@mail-archive.com
It's been over fifteen years, but I've finally got around to moving this
list to where it should have been the whole time. All subscribers have
been migrated. If there are any issues, please let me know.
___
Gossip mailing list
Gossip@mail-archive.com
http://mail-archive.com/cgi-bin/mailman/listinfo/gossip
I've now had some time to spruce up the search feature. Amazing what
progress Lucene has made in the last few years. Search is slightly
faster now, there is less code on our end, and I found and fixed a couple of
rare bugs involving HTML escaping.
What I can't do is reproduce your problem; it
Individual list search is very important. Thank you for reporting
the problem. This turned out to be a configuration mistake on the
webserver, involving the MultiViews configuration directive.
Search should be working now. Let us know if you see any
anomalies, and in the meantime we're looking
Happy New Year.
As The Mail Archive enters its 15th year of operation, let's take a
quick look back. This year we had a record uptime percentage of
99.69%. That number jumps to 99.96% if you forgive the day we were
deliberately dark in protest of the proposed SOPA law in the United
States. There
Okay, so here's a quick status update. There are have been suggestions about
fonts. Font weight. Line spacing. Typeface. Line spacing is interesting,
a big design goal was to make more information available with less scrolling.
However, we've found several references that suggest 1.4 is good in
Thank you for the feedback, keep it coming.
Interesting screenshots. By the way, my everyday platform is
also Ubuntu (10.04 and 12.04). So far I can't reproduce font
problems with courier.
As for the ordering of font-family, that's a good question. Let
me check with graphic designer.
-Jeff
--
What I dislike is that visited links are indistinguishable from non
visited ones. The difference in color just is way too little.
I didn't notice this until you mentioned it. Now it is driving me crazy.
Thank you for the feedback.
--
To unsubscribe, send mail to gossip-unsubscr...@jab.org.
Hello all,
The Mail Archive is now 14 years old (that's a long time in dog
years) and we've been thinking about some design updates.
The mockup below is intended for visitors from global search
engines. Direct visitors will continue to have no advertisements.
I hope that the proposed design is
Okay, found the bug.
On June 10th I made a change to deal with archives containing more
than one million messages. A piece of code had decided that one
million was a really big number and was starting to write scientific
notation to some internal log files. Unfortunately my change was
flawed and
Fair enough, I found how to have http://cr.yp.to/ezmlm.html subscribe
something other than the envelope or header from address.
If you think this would be useful for others, please consider sharing either
here, or we can put in the http://mail-archive.com/faq.html if appropriate.
I was a bit
Thanks for the note, we'll take a look. By the way, there's a discussion
about using a shorter URL. Would that be useful to you?
http://www.mail-archive.com/mailman-developers@python.org/msg12770.html
The only change is all web pages are serving HTTP 503 (Service Temporarily
Unavailable) for the next 24 hours. There is no disruption to email
archival.
We noticed at least one other email archival service (marc.info) is also
participating.
-Jeff
Finally, one of the charter goals of Mail Archive, Inc. is to have fun
Mission accomplished.
http://www.airshipventures.com/sightings/1290/2012/01/08
--
To unsubscribe, send mail to gossip-unsubscr...@jab.org.
I'm pleased to announce The Mail Archive has passed the 100 million
message mark this past week. It took 13 years and eight generations of
hardware, but we made it.
-Jeff
--
To unsubscribe, send mail to gossip-unsubscr...@jab.org.
June has been a weird month. Some of the automatic software programs stopped
running, resulting in many search indexes falling behind. Plus we had a
crash this morning causing about 90 minutes downtime. Crazy!
I think I've finally traced the problem to a May 30 operating system update.
It
On Sun, Jun 12, 2011 at 3:11 PM, e-letter inp...@gmail.com wrote:
Unable to use the search feature, e.g. search for text or a
returns 0 results, clearly incorrect.
Works for me, from both the home page and in an individual archive.
Can you please supply the exact search URL that is giving
I assume the indexing is not running at their end for some reason.
That is exactly correct. If I run by hand it works fine, and that go
link will resolve now. Still looking into why the cron job that kicks
off indexing had trouble. Thanks for problem report.
--
To unsubscribe, send mail to
Perhaps it's time for us to revisit [Lucene for site-wide search]
A few weeks ago I tested Lucene's ability to search across multiple
indexes (MultiSearcher) and it is hopelessly slow; queries take 5
seconds across just a few hundred indexes.
Right now I'm trying index merging
I was doing some experiments today, and managed to briefly knock over
a server in the process. I looked at searching multiple indexes
(Lucene's MultiSearcher) and merging
(IndexWriter.addIndexesNoOptimize). The former is unusably slow. The
latter seems to be on track for about 6 hours if the
Oops, sorry for the more-or-less duplicate message. The extra factor
of 2 in time is because the temporary files are turning out twice as
big as I was expecting. Earl, good suggestion, and no we haven't
explored it (yet).
--
To unsubscribe, send mail to gossip-unsubscr...@jab.org.
Progress report:
We are receiving your messages but our system is failing to recognize
them as belonging to a new list. It took me a while to find them since
gg1 is not logged the same way as regular inbound messages. I will
take the sorting engine out back into the parking lot and try to
talk
Fixed faster than expected. Only a few hundred messages affected
across the entire corpus as far as I can tell, and they are all being
dealt with. One question: the gg1 address is really designed for
Google Groups, not for other stuff. Did you try the regular archival
address first and get some
Season's greetings.
Thank you all for sticking with The Mail Archive as we close out the
decade. Here's a quick rundown of the trials, tribulations, and
triumphs over the past year.
First, let's talk about infrastructure. Our uptime was 99.57% which is
similar to previous years. This year's main
On Wed, Dec 22, 2010 at 4:46 AM, e-letter inp...@gmail.com wrote:
Please offer the ability to search using international date format
(-mm-dd) as a search criterion
Done. (Actually, this has always worked).
The interesting question: is it possible? Are the originating
mails stored so that the visible html can be repaired?
A mathematician, a physicist, and an engineer walk into a bar.
The mathematician says The raw mail exists even the old stuff is in
offline
cold storage. It can be matched by
The Mail Archive does have to be very aggressive to obfuscate email
addresses, otherwise a lot of people go bonkers. But yes, it is dumb to
break a hyperlink, especially a hyperlink to The Mail Archive. Your feature
request is valid and if you are feeling eager, feel free to send in a patch.
You found a bug. In The Mail Archive's hash calculation, there is an
incorrect urlib.unquote run on the message id. This is escaping the minus
sign, and therefore calculating based on an incorrect message id. We're
going to have to regenerate the entire message-id index after the bug is
fixed.
The gossip malling list uses a somewhat obscure and very limited list server
called Enemies of Carlotta. Another fun fact is it runs on a 5 watt NSLU2,
which has a grand total of 32 megabytes of memory. That's less memory than
the very first hardware iteration of The Mail Archive, which was a 90
If you visit the home page on The Mail Archive, you may notice
search is broken. You can not currently search the entire corpus.
We're working on it but it will take some time.
The other search features still work, e.g. search works fine for an
individual archive, and you can still search list
Testing obfuscation, data below:
j...@jab.org
http://j...@jab.org
mailto:j...@jab.org
http://www.mail-archive.com/gossip@jab.org/msg01358.html
http://www.mail-archive.com/gossip@jab.org/
http://www.mail-archive.com/gossip@jab.org
http://mail-archive.com/gossip@jab.org/msg01358.html
I've finally completed localization of advanced search. If you speak a
language other than English, now is a great time to click around the user
interface and see if there are any silly language mistakes. (To get to
advanced search, do a regular search first, then you'll see a link)
Cheers,
Jeff
This is general interest, so we are responding publicly.
We've discovered that many of the advanced query results have been
leaking out through a fractured fiber optic line in the Gulf of
Mexico. It is hard to get a precise measurement, but we believe 13 to
20 thousand bits of information per day
Ok, it only took ... 4 years ... but we now have sort-by-date
available in the advanced search interface. Enjoy.
-Jeff
I wasn't clear. Is there some way to organize the search results?
When I used the trial search engine on the sundial list and typed in
oglesby the 550 results were all
Hi Andrius,
I wasn't aware that there were adult content ads being served at all. Can
you please supply a URL so I can take a look? If you don't want to supply to
the entire group, send to M-A staff at themailarch...@gmail.com. I'm not
keen on adjusting aspects of individual messages because that
An advanced search form would basically have a bunch of fields like
subject and date and from then string them together into a query
syntax described below. Then redirect that query to the standard search URL.
Implementation would most likely be in PHP. Hard part isn't the programming,
it is
The list name links to the info page; optionally not displayed if made
redundant by the nature of the logo.
This one requires too much per-list thinking; we'll only consider changes
that are fully automatic. The other parts sound reasonable to me, but it
would be nice to know if other people
I have a web site for the list rules and, very soon, a link to the
archives. Is there a header I can add to my messages that will let the
archiver pick up and display my site URL on its own line?
We'll honor RFC2369 headers. A quick glance suggests List-Help is most
appropriate.
Also, is
Bring it on. The Mail Archive gets tens of thousands of inbound messages
daily, and currently serves on the order of queries per second. So I don't
think there is any concern about swamping the service given the numbers
mentioned. Doesn't matter to use how many mbox files are involved for
Hi Randy,
Bad news first.It is not such a great idea for mail-archive.com to re-send
mail. That's asking for trouble with respect to abuse and spammers. On the
good side, we can get the same effect if the mail server is in cahoots with
the archiving service. A specification (RFC5604) is in place
Ok, I checked; the message was dropped because it was bigger than our size
limit. We limit inbound message size for a variety of reasons; one is
historically attachment heavy archives consume a lot of resources and -
statistically speaking - are much more likely to be spammy. (There were
some
Thanks for the problem report, Joseph.
Looks like a problem with the RAID filesystem after a power outage at
the datacenter. I'm running some integrity checks, and this is going
to take a while. In the meantime, I've mounted backup disk from two
days ago. This is going to be a little stale (the
Ok, that one got away from me. Let's try this again.
I've enabled an experimental full site search. It's definitely not
ready for prime time; too slow and we aren't going to update the index
regularly. And it might go away at any time. But if you want to play
around, have fun. We don't really
Hi Terrence,
The domain for the mailing list is lists.metaperl.com, not metaperl.com.
Sure, I think we can help you out. Let's move this to the customer
service address instead of using gossip. By the way, you've got an
X-No-Archive: yes header in your new setup, which is preventing
archiving.
Remember the info pages?
http://www.mail-archive.com/petbunny%40lsv.uky.edu/info.html
If you squint carefully, you can see a new field called hints. So
what's a hint? Hints are gentle way to tell the world what a list
is about.
Let's say you have a mailing list about pet bunnies. Then you add
As you may know, The Mail Archive has been an advertising supported
service for several years now. We have recently decided to shift things
around a bit. We are completely removing advertising for regulars.
This includes list admins, list members and lurkers - anyone that uses
The Mail Archive
On 4/10/07, Ronny Haryanto [EMAIL PROTECTED] wrote:
[...] Everything else looks OK to me (I speak Indonesian natively) except [...]
Thank you, your feedback is integrated and will be active for all new
messages. I don't know if we can easily localize the dates (the
M-A search engine currently
The Mail Archive is getting a lot of visitors from Indonesia.
Can someone who speaks Indonesian please take a look
at the localization and tell me if looks ok? Should any wording
be improved or replaced?
Example archive:
http://www.mail-archive.com/origami_indonesia%40yahoogroups.com/
Jeff Marshall recently pointed out that archiving latencies had
been rising again - to at least several hours and occasionally
the good part of a day. So I pulled out the programming hammer
and stated whacking off milliseconds here and there - eventually
it starts to add up. I'd guess I probably
The Mail Archive uses a third party to monitor availability. so if a
computer goes offline someone usually hears about it pretty quickly.
As a side effect, we also get weekly, monthly, quarterly and yearly
uptime reports. The 2006 numbers just came in, and I'm pleased to
report M-A significantly
On Sat, Dec 16, 2006 at 02:32:18PM -0500, Jeff Breidenbach wrote:
It is possible to move a migrate a list archive, but it is kind of
manual so you have to ask really nicely.
Good news!
The process just became semi-automated, so there should be less
risk of screwups. And it is no longer
The counter on the homepage for total number of messages (which
is somewhere around 40 million) is going to be a little erratic over the
next week or so. No cause for alarm, just taking advantage of the
quiet holiday season and moving a bunch of mail between various disks.
Jeff
--
To
You are right, list-post tends to be the dominant header for
the M-A sorting engine. It is possible to move a migrate a list
archive, but it is kind of manual so you have to ask really nicely.
Basically, what we do is move the archive to the new location,
then insert a HTTP redirect at the old
Does anyone have a mailman archive to mbox converter
script in their back pocket? And when I say mailman archive,
I'm talking about the gzip'd text like this:
http://listas.asteriskbrasil.org/pipermail/asteriskbrasil/
Note the lack of headers - ugh. I have no idea what the mailman
folks were
I've included the simple script I use below.
Thanks, Lars. I don't see the script - could you please resend?
Cheers,
Jeff
___
Discussion list for The Mail Archive
Gossip@jab.org
http://jab.org/cgi-bin/mailman/listinfo/gossip
Hi Chris,
Thanks for the report, I see what the problem is. Can you wait about
24-36 hours? This problem is affecting a number of active lists
(beginning with 'g' 'h' 'i' 'j' 'k' and 'l') and requires a fair amount of
computer processing time to fix.
Cheers,
Jeff
On 9/21/06, Chris McFarling
Around 9:20pm PST, I rebooted the primary webserver. Pretty
minor reason - just adding a boot parameter to work around
linux kernel bug #7068. Unfortunately, things didn't go as
planned and I ended driving out to the data center and spending
about an hour of quality time there. Add in driving
Fixed.
This is a good time to mention we switched mainline search
over to PyLucene yesterday afternoon.
Both search engine contenders are quite excellent; we could happily
go with either one. PyLucene has advantages in UTF-8 maturity
and more efficient disk usage. Xapian has advantages of
Lucene: it slices, it dices
Right. That's just what I'd expect from a program whose name sounds
like a brand of cheese. During the long cold war, children were inspired
by Superman. In late 2003, Californians elected The Terminator to be
their governor. These turbulent times demand strong,
as Xapian Omega crushes and destroys the competition,
finishing off queries in milliseconds. This probabilistic juggernaut is
a battle tested, email chewing reigning champion in Europe. Honed
for years and more hardened than quartz, Jeff Breidenbach will
drive Xapian Omega during this Battle Royale
Thanks!
Also, one of the people with slightly-broken mbox files suggested this:
perl -i -p -e '/^From / !/\d\d:\d\d:\d\d \d\d\d\d$/ s/(.+)/$1/' A_*
I'm continuously amazed at both Perl, and the people whose brains are
capable of understanding it. :)
-Jeff
Hi all,
When someone wants to import a bunch of messages into an archive,
the provide an mbox file. The mbox file format is simple, but has at
least one gotcha.
In order to avoid misinterpretation of lines in message bodies which
begin with the four characters From, followed by a
Thanks for the problem report. Even the manual override for search
indexing is acting really sluggish right now; there may be something
amiss. I'm looking into it.
Cheers,
Jeff
___
Discussion list for The Mail Archive
Gossip@jab.org
On 19 Jul 2006 11:46:45 -0700, Jeff Marshall [EMAIL PROTECTED] wrote:
I received a few private votes for keeping the .xml rss 2.0 feed, if we
eliminate
one of them.
I think we should deprecate the RDF variant (e.g. remove it from the FAQ) then
actually remove the feature once we're sure
Whatever happened with this? Did it get resolved? Does anyone else
have this problem?
On 19 Jun 2006 21:06:58 -0700, Jeff Marshall [EMAIL PROTECTED] wrote:
Yikes, that shouldn't be happening at all for anything other than well-known
spambot user agents.
I'll look into the stats on how often
Hi all,
I've adjusted message page layout slightly; there is now a little less
artwork in
the right margin. Should we keep it that way?
Cheers,
Jeff
___
Discussion list for The Mail Archive
Gossip@jab.org
Hmmm...
I took a look at the formatting problem Marcus was talking about,
which is a text reflow problem; shows up when the browser window
is relatively narrow. As an experiment, I bumped the CSS entry for
msgBody from width:60% to width:75%
Does this break anyone? Look in particular on message
Marcus (and everyone),
I want to publicly apologize for mischaracterizing the Green Travel
list. Basically,
I jumped to an incorrect conclusion from reading a couple of messages. The list
and archive are legit. Mea culpa.
Cheers,
Jeff Breidenbach
I'm going to take down www.mail-archive.com in a little bit
to add another small boatload of hard drives. Inbound mail
will be queued. Hopefully this won't take long.
Cheers,
Jeff
___
Discussion list for The Mail Archive
Gossip@jab.org
Maintenance complete; the service was offline for about half an
hour. All right! The primary server is now has 3.5TB raw disk onboard,
which should be enough for anyone.
-Jeff
___
Discussion list for The Mail Archive
Gossip@jab.org
Hi all. Over the last week or so we
* Experimented with an automatic language detection tool,
and localized nearly a thousand lists.
* Bumped up the setting one notch The Mail Archive's inbound
spam filter.
* Noticed not too many people have submitted custom logos.
Maybe nobody cares?
The winning new feature from yesterday's coding session was...
-- Custom Logos
-- http://www.mail-archive.com/faq.html#customize
Why logos?
We're hoping that list admins will have extra warm fuzzy feelings
seeing their archive customized with a particular image. Heck, maybe
this will be the
Mac Oglesby noticed an occasional hiccup in message threading
in one of the archives. I've added a FAQ entry explaining how this
can occur, and with Mac's blessing I am forwarding our discussion
to gossip.
Cheers,
Jeff
-- Forwarded message --
From: Mac Oglesby
Date: Nov 17, 2005
Reply is to gossip due to topic being of general interest.
I was trying to download the latest version of the mail-me GPL tarball
from the /contrib directory but I found nothing there. Was the source
code removed intentionally, or is it a glitch?
Wow. I had no idea anyone had even looked at
I'm happy to report that dozens of lists so far are taking
advantage of the new easier-to-use localization forms. For
example, this archive made a transition to German yesterday:
http://www.mail-archive.com/illustrator@domeus.de
http://www.mail-archive.com/illustrator@domeus.de/info.html
There's
Every month or two the following happens. Someone does a Google
search on their email address, sees a five-year-stale Google
search summary and panics. We had a very light obfuscation
technique back then, which you don't notice unless you look hard.
Anyway, the point is lots of people are very
On the other [list], the mail-archive shows up with the light
obfuscation, i.e. name [at] domain.com.
JF, you get the award for most mysterious problems. If M-A gets
a message with name [at] domain.com, it'll keep it. If M-A gets
an address with an @ symbol, it'll obfuscate. Good luck figuring
This is a test message to see if we've fixed archiving
for [EMAIL PROTECTED]
-Jeff
___
Discussion list for The Mail Archive
Gossip@jab.org
http://jab.org/cgi-bin/mailman/listinfo/gossip
Hi all,
We've been load testing a new server for the last couple of weeks, and
plan to switch over tomorrow. Archiving will be put on hold at 5:30am
US Pacific time (UTC -0700) and resume probably in the
afternoon. Hopefully there will be very little noticeable downtime.
This will be - believe
Hi all,
We're upgrading The Mail Archive's internal network to gigabit
ethernet this afternoon, and need to take the main webserver offline
to install a new network card. With luck total downtime will be well
under one hour. All inbound mail will be queued during this time.
Cheers,
Jeff
With Earl's help, the problem with pre-June19th subject lines
in index pages has been fixed. Index pages should look pretty
good now.
Cheers,
Jeff
PS. Happy 4th of July (I'm planning to watch fireworks in a couple
of hours).
___
Discussion list for
1 - 100 of 279 matches
Mail list logo