On Jun 13, 2018, at 3:12 PM, Richard Hipp <d...@sqlite.org> wrote:
> 
> I have found is that forum
> and ticket traffic far exceeds the amount of source code.

I just exported my local 6-year archive of the SQLite mailing list traffic to 
an mbox file, then gzipped it, yielding a 31 MB file.  The current 
sqlite.fossil file is 51 MB.

If we do a linear projection back to the start of the mailing list — August 
2002, according to MARC — then we’d expect the full ML archive to be roughly 
100 MB, not counting the -dev list and such.

That estimate includes the peak in traffic many years ago:

    https://imgur.com/a/UdQ5ziH

That chart shows that the SQLite-Users mailing list is “dying,” in that its 
message volume is currently lower than near the mailing list’s beginning, 
despite the much greater popularity of the library these days.  I ascribe that 
to an increasing percentage of questions having web-searchable answers now, 
plus a decreasing willingness to put up with mailing lists.

(I just corrected a funny autocorrect: malign lists. :) )

My gzipped mbox test is worse than a Fossil-based solution, which:

1. would not have all of those redundant RFC 822 mail header labels;

2. would not need many of the message header lines at all (e.g. X-Abuse-Info or 
List-Unsubscribe); and 

3. would not store many of the fields in inefficient plain text forms within 
the SQLite DB.   (e.g. Delivery-Date might be a time_t stored in an INTEGER 
column rather than the verbose text RFC 822 format, and if not, then it would 
probably be the more compact ISO 8601 format so as to make the SQLite date/time 
routines happy.)

There would be no extra disk space cost on sqlite.org, since I assume you’re 
already keeping a mailing list archive there.

The extra space on the wire during cloning can be solved in the same way /uv is 
handled today, if need be.

> this kind of traffic does not lend itself well to delta compression.

In fact, delta compression would benefit two use cases of the Fossil Forum 
Feature:

1. The post editing case, just as with Fossil wiki articles today.

2. Quotations in replies.

Granted, a forum feature would lower the current /stat compression ratios, but 
we’re not really after high compression ratios for their own sake, are we?  
We’re using compression to solve problems: inter-version redundancy, the low 
information density of prose and high-level language code, etc.

Forum posts would be highly compressible by zlib.  The uncompressed mbox file 
in the tests above is about 5x the size of the gzipped version.

> extraneous and noisy forum traffic.

One man’s noise is another man’s signal.

It would be very nice to be able to search the mailing list from within Fossil 
UI, getting only one copy of each matching message, not one from 
mail-archive.com, one from MARC, one from Nabble, one from …

I happen know where we can get a pretty good FTS implementation. ;)

This feature would be even more useful on private projects, where we cannot 
rely on the likes of mail-archive.com to gather and index our discussions.

> This is especially true if attachments are
> allowed on forum posts

You can fob that off on third-party services like Imgur, for the most part, 
just as Stack Exchange does.

If a post really really needs a repo-embedded attachment, it can use an 
embedded doc link, just as you’d do for a Fossil wiki article today.

>> 3. Forum posts can show up in the timeline.
> 
> Yikes.  I think I would certainly want that to be turned off by default.

The timeline search type field defaults to “Check-ins”, does it not?  You’d 
only see forum posts with “Any Type” selected or with a new “Forum Posts” 
filter selected.

Even then, it should be quite manageable.  With the default 100 event search 
window size, you’d still be seeing several days of activity with the current 
level of SQLite mailing list traffic, most of the time.  SQLite’s mailing list 
traffic must be somewhere over the 90th percentile in terms of daily traffic, 
so most Fossil-hosted software projects would show even more days of activity 
in that 100 event window.

To clarify, I don’t mean that the whole forum post would show up in the 
Timeline, just a brief summary, as currently happens with tickets and wiki 
article events.
_______________________________________________
fossil-users mailing list
fossil-users@lists.fossil-scm.org
http://lists.fossil-scm.org:8080/cgi-bin/mailman/listinfo/fossil-users

Reply via email to