Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-04-02 Thread Terri Oda
On 03/29/2012 11:58 PM, Shayan Md wrote: Okay then, can you please tell me how we can put this search code in best use of mailman3? I have a proposal to write, I am getting unsure of things day by day. Can you also tell me who is the mentor of this project? When it comes to writing your proposa

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Shayan Md
On Fri, Mar 30, 2012 at 10:48 AM, Stephen J. Turnbull wrote: > On Fri, Mar 30, 2012 at 12:24 PM, Shayan Md wrote: > > On Fri, Mar 30, 2012 at 5:05 AM, Stephen J. Turnbull >wrote: > > >> And (2) search and retrieval may > >> do a *lot* of message access, for example if you want to do data > >> mi

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Stephen J. Turnbull
On Fri, Mar 30, 2012 at 12:24 PM, Shayan Md wrote: > On Fri, Mar 30, 2012 at 5:05 AM, Stephen J. Turnbull > wrote: >> And (2) search and retrieval may >> do a *lot* of message access, for example if you want to do data >> mining (see Ana from Spain's thread). > Isn't it the purpose of index? Y

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Shayan Md
On Fri, Mar 30, 2012 at 5:05 AM, Stephen J. Turnbull wrote: > On Fri, Mar 30, 2012 at 3:55 AM, Shayan Md wrote: > > > Assuming that we have something like this(object-ID-addressable, If I am > > not wrong, mailman3 made it possible but not yet implemented as it's part > > of archiver), is it over

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Barry Warsaw
On Mar 30, 2012, at 02:18 AM, Stephen J. Turnbull wrote: >> I suspect that there will be plenty of mailing lists that get fed messages >> from programs, e.g. think vcs -commit diff lists.  Those programs can also >> be buggy, but again I'd prefer that Mailman not compromise on this issue >> for th

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Jeff Breidenbach
> An archiver should certainly provide an interface to look up a message by > [...] the hash. Including List-Id in the hash calculation allows the archiver to display a cross posted message in context. See http://www.mail-archive.com/faq.html#listserver Also, a gentle reminder that I put some c

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Stephen J. Turnbull
On Fri, Mar 30, 2012 at 3:55 AM, Shayan Md wrote: > Assuming that we have something like this(object-ID-addressable, If I am > not wrong, mailman3 made it possible but not yet implemented as it's part > of archiver), is it over ambitious to plan to implement indexer/searcher > for mailman3 and a

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Shayan Md
On Wed, Mar 28, 2012 at 6:59 AM, Stephen J. Turnbull wrote: > On Wed, Mar 28, 2012 at 4:21 AM, Terri Oda wrote: > > >> Looks like archiver for mm3 is still in development stage. As far as I > >> understand searcher depends on the srchiver, right? Not completely but > it > >> somewhat depends on a

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Stephen J. Turnbull
Barry writes: > I suspect that there will be plenty of mailing lists that get fed > messages from programs, e.g. think vcs -commit diff lists.  Those programs can > also be buggy, but again I'd prefer that Mailman not compromise on this issue > for their sake. I predict you will eventually l

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Richard Wackerbarth
On Mar 29, 2012, at 10:17 AM, Barry Warsaw wrote: > (Aside: Is there some reason why you To: me and CC: the list rather than > having the list address in the To: field? I ask because I'm wondering if it's > a gmail thing, or something about your MUA, and because I suppress the list > copy if I'm

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Barry Warsaw
(Aside: Is there some reason why you To: me and CC: the list rather than having the list address in the To: field? I ask because I'm wondering if it's a gmail thing, or something about your MUA, and because I suppress the list copy if I'm CC'd directly, I don't get a List-Post: header, so my MUA's

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-29 Thread Stephen J. Turnbull
On Thu, Mar 29, 2012 at 8:07 AM, Barry Warsaw wrote: > Mailman 3 itself requires unique Message-IDs. So? FWIW, I don't think I agree with that requirement (even RFC 5322 doesn't make it a "MUST"), but I'm not going to argue with you about Mailman 3 design, that's your pidgin. But there's nothi

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-28 Thread Barry Warsaw
On Mar 27, 2012, at 11:51 AM, Toshio Kuratomi wrote: >The searcher wouldn't be much use without an archiver. There is a sample >archiver in mailman core -- if enabled, it stores the messages to lists in >maildirs. It does not have a frontend for retrieving or otherwise >displaying the archives.

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-28 Thread Barry Warsaw
On Mar 28, 2012, at 06:06 PM, Bill Janssen wrote: >Right. UpLib has a 'message-store', which the threading code interacts >with to generate threads as data referring to document IDs. The >message-store API can take both message-IDs or UpLib document IDs and >resolve them. Say Bill, how would yo

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-28 Thread Barry Warsaw
On Mar 28, 2012, at 10:29 AM, Stephen J. Turnbull wrote: >The only tricky issue is that we *do* have to worry about message-ID >collisions of truly different messages and about messages without message >IDs, especially for converted historical archives. So the API needs to be >able to deal with t

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-28 Thread Bill Janssen
Stephen J. Turnbull wrote: > On Wed, Mar 28, 2012 at 4:21 AM, Terri Oda wrote: > > >> Looks like archiver for mm3 is still in development stage. As far as I > >> understand searcher depends on the srchiver, right? Not completely but it > >> somewhat depends on archiver. I am not sure if searche

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-27 Thread Stephen J. Turnbull
On Wed, Mar 28, 2012 at 4:21 AM, Terri Oda wrote: >> Looks like archiver for mm3 is still in development stage. As far as I >> understand searcher depends on the srchiver, right? Not completely but it >> somewhat depends on archiver. I am not sure if searcher can be implemented >> without archive

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-27 Thread Terri Oda
On 03/27/2012 03:31 AM, Shayan Md wrote: I was working on mm3. But systers' indexer/searcher was implemented for mailman2. So it must be easy for to integrate it with mm2. Actually, the systers indexer was designed to work with mboxes (because I had a pile of data in that format that the stud

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-27 Thread Toshio Kuratomi
On Tue, Mar 27, 2012 at 03:01:41PM +0530, Shayan Md wrote: > > > On Tue, Mar 27, 2012 at 1:11 AM, Toshio Kuratomi wrote: > Is this integration to be done with mailman2 or mailman3? > > In mailman3, the archivers are separated from the mailman core. > > I was working on mm3. But systers

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-27 Thread Shayan Md
On Tue, Mar 27, 2012 at 1:11 AM, Toshio Kuratomi wrote: > On Mon, Mar 26, 2012 at 04:57:44PM +0530, Shayan Md wrote: > > Hi, > > > > I am Shayan, I am doing my masters from IISc Bangalore. I want to take > part > > in GSoC from mailman organization. I have fairly good experience in > python. > >

Re: [Mailman-Developers] Google Summer of Code: Integration of Search Code

2012-03-26 Thread Toshio Kuratomi
On Mon, Mar 26, 2012 at 04:57:44PM +0530, Shayan Md wrote: > Hi, > > I am Shayan, I am doing my masters from IISc Bangalore. I want to take part > in GSoC from mailman organization. I have fairly good experience in python. > I worked on whoosh library for my own project. I have experience with > d