On 04/17/2013 02:43 AM, Florian Fuchs wrote:
> Hi Manish, hi everyone,
>
> 2013/4/10 Manish Gill :
>> For the GSoC REST API project, I've been wondering about how
>> authentication would work.
>>
>> OAuth is a way to go if we want authenticated/signed requests. I have a
>> few questions regarding t
Hi,
I am working on "better user settings management" for past few weeks. As i
have worked on django earlier also, so i am enjoying it.I branched out
postorius few weeks back and sent a merge request but after having
discussion with terri and florian, i now know that extension of django's
User clas
for identifying an important message a classifier will be implemented. and
thanks for pointing out the issue regarding the delivery of the message, if
it is delivered twice then the existing implementation of delivery is
sufficient, but if we want to deliver it only once then for each person we
nee
In evaluating a proposal, we need to look at a number of factors:
First, will it work? -- Does the proposed design accomplish the stated
objective?
Next: Is it useful?
And: Can the candidate be expected to accomplish the task within the allotted
time frame?
Finally: Is it the best use for our li
thanks a lot for the information. Thing is that I don't think
that the Spam classifier by itself is going to be big enough so I came up
with this idea. Actually I also need to know what the community wants,
regarding the e-mail delivery. and regarding the classifier I don't think
that it
On 13-04-17 6:56 AM, Avik Pal wrote:
Meanwhile It would be much appreciated if someone can direct me to
an labeled dataset available on line.
Leaving aside entirely the question of whether we should (or will)
support any project that requires learning on this scale, as a former
anti
On 13-04-16 10:31 PM, Stephen J. Turnbull wrote:
Pratik Sarkar writes:
> I am working on the proposal.And how many slots are there for the filter
> project?
There are no slots for the filter project as such. The whole Mailman
project has slots, and they are somewhat fluid, since we operat
thanks a lot Terri, I think I will go with the Enron email dataset and
they are to be cross validated against publicly available legitimate
mailing list mails and Spam and (hopefully) python's regular expressions
will help me a lot building the synthetic set.
Avik Pal
Bengal Engineering & Scieen
> On 17 April 2013 21:02, Terri Oda wrote:
>
>>
>> On 13-04-17 6:56 AM, Avik Pal wrote:
>>
>>> Meanwhile It would be much appreciated if someone can direct
>>> me to
>>> an labeled dataset available on line.
>>>
>>> Leaving aside entirely the question of whether we should (or will)
>> s
On 13-04-17 10:10 AM, Avik Pal wrote:
Don't lose hope Terri, after digging for a couple of hours came across
this and its pretty much updated. http://untroubled.org/spam/
Finding sources of spam (like that one) isn't that hard; it's finding
sources of legit email combined with spam and classi
ya I get your point, but see these are part of any machine learning
project, and feature extraction has to be done considering the synthetic
data set.
On 17 April 2013 22:05, Terri Oda wrote:
>
>
> Finding sources of spam (like that one) isn't that hard; it's finding
> sources of legit email
I'm glad you're somewhat aware of the issues. I frequently encounter
folk who aren't aware of the issues in machine learning, so your "don't
lose hope" email set off all kinds of warning bells in my head.
Going back to GSoC-specific stuff:
- Enron is a very old data set
- If you're going to u
Avik Pal writes:
> Meanwhile It would be much appreciated if someone can direct me to
> an labeled dataset available on line.
By "labelled" you mean pre-classified into spam vs ham? I see you
already found one, but you could also check the SpamBayes and
SpamAssassin distributions.
> Here I h
On Apr 17, 2013, at 12:43 AM, Avik Pal wrote:
>also I would like to propose an idea of my own. Many of us set the preference
>in mailman to get all the emails of a day batched together, but sometimes
>this means we miss important mails(though we get it at the end of the day but
>we miss the moment
Hello !
By way of introduction I am *Adwait Sharma,* a final year computer science
undergraduate from Bangalore, India who code a lot specially in Python.
The idea I found most interesting in GSoC 2013 is Boilerplate stripper
-
http://wiki.list.org/display/DEV/Google+Summer+of+Code+2013#GoogleSu
Background for folk new to this discussion:
Currently, all user information is stored in Mailman core, but it's
minimal: a real name, a set of email addresses, subscription info, and
preferences. Barry suggests that it should stay minimal: only the
things Mailman needs to know to correctly de
thanks a lot Stephen for all the suggestions :)
Avik Pal
Bengal Engineering & Scieence University,Shibpur
github:https://github.com/avikpal
IRC:- irc://freenode/avikp,isnick
twitter:-https://twitter.com/avikpalme
On 17 April 2013 22:36, Stephen J. Turnbull wrote:
> Avik Pal writes:
>
> > M
+1 for this, i am desperately waiting for someone to do do this :)
On Thu, Apr 18, 2013 at 5:51 AM, Terri Oda wrote:
> Background for folk new to this discussion:
>
> Currently, all user information is stored in Mailman core, but it's
> minimal: a real name, a set of email addresses, subscripti
Hi,
some first thoughts on it:
1) It should be self-contained.
Meaning: It should not depend on any
mailman/mailman.client/postorius/hyperkitty packages.
2) Like the core, it should implement a Python-based webserver.
It doesn't need to run on Ports 80/443, so we don't have to care about
one of
19 matches
Mail list logo