Hi Kevin and SpamAssassin Dev Community,
Which one would be better for testing mechanisms; Travis CI or Cmake.

Thanks...
Saahil Sirowa
Indian Institute of Technology Hyderabad
B. Tech Computer Science and Engineering

On Mon 26 Mar, 2018, 09:58 Saahil Sirowa, <[email protected]> wrote:

>
> On Mon 26 Mar, 2018, 09:57 Saahil Sirowa, <[email protected]>
> wrote:
>
>> Hi Kevin and SpamAssassin Dev Community,
>> Which one would be better for testing mechanisms; Travis CI or Cmake.
>>
>> Thanks...
>> Saahil Sirowa
>> Indian Institute of Technology Hyderabad
>> B. Tech Computer Science and Engineering
>>
>> On Mon 26 Mar, 2018, 07:29 Saahil Sirowa, <[email protected]>
>> wrote:
>>
>>> Hi Kevin,
>>> I know you have already gone through the proposal once. But, I still
>>> request you to go through it. Your suggestions in this final phase will
>>> prove valuable.
>>>
>>> Awaiting for a favorable response.
>>>
>>> I intentionally didn't sent this mail in dev mailing list.
>>>
>>> Thanks...
>>> Saahil Sirowa
>>> B. Tech Computer Science and Engineering
>>> Indian Institute of Technology, Hyderabad
>>>
>>> On Mon, Mar 26, 2018 at 7:24 AM, Saahil Sirowa <
>>> [email protected]> wrote:
>>>
>>>> Hi Kevin and Spam Assassin Dev Community,
>>>> I have made some changes in the draft.
>>>> GSoC 2018 Proposal
>>>> <https://docs.google.com/document/d/1-OCNv79sHvVViKwnrRYtlMiKWLCzz4xUW4tNOlmaTmw/edit?usp=sharing>
>>>>
>>>> I request you all to rigorously review it and suggest appropriate
>>>> edits. As, this is the final phase of the application period(Deadline 27th
>>>> March 16:00 UTC), I would really appreciate it If you respond before this.
>>>> This will help me in incorporating the suggested changes in time.
>>>>
>>>> Thanks...
>>>> Saahil Sirowa
>>>> B. Tech Computer Science and Engineering
>>>> Indian Institute of Technology, Hyderabad
>>>>
>>>>
>>>> On Fri, Mar 23, 2018 at 7:55 PM, Saahil Sirowa <
>>>> [email protected]> wrote:
>>>>
>>>>> I had some in last 2-3 days. I will update the proposal draft  with
>>>>> required changes by tomorrow night(Sat night).
>>>>>
>>>>> Thanks...
>>>>> Saahil Sirowa
>>>>> B. Tech Computer Science and Engineering
>>>>> Indi@n Institute of Technology, Hyderabad
>>>>>
>>>>> On Fri 23 Mar, 2018, 18:01 Kevin A. McGrail, <[email protected]>
>>>>> wrote:
>>>>>
>>>>>> Wanted to check in and see how you are doing.  THis blog post has
>>>>>> gotten some praise
>>>>>>
>>>>>>
>>>>>> https://medium.com/@owtf/google-summer-of-code-writing-a-good-proposal-141b1376f076
>>>>>> .
>>>>>>
>>>>>> --
>>>>>> Kevin A. McGrail
>>>>>> Asst. Treasurer & VP Fundraising, Apache Software Foundation
>>>>>> Chair Emeritus Apache SpamAssassin Project
>>>>>> https://www.linkedin.com/in/kmcgrail - 703.798.0171
>>>>>>
>>>>>> On Wed, Mar 21, 2018 at 7:52 AM, Kevin A. McGrail <
>>>>>> [email protected]> wrote:
>>>>>>
>>>>>>> Comments allowed might be helpful though :-)
>>>>>>>
>>>>>>> --
>>>>>>> Kevin A. McGrail
>>>>>>> Asst. Treasurer & VP Fundraising, Apache Software Foundation
>>>>>>> Chair Emeritus Apache SpamAssassin Project
>>>>>>> https://www.linkedin.com/in/kmcgrail - 703.798.0171
>>>>>>> <(703)%20798-0171>
>>>>>>>
>>>>>>> On Wed, Mar 21, 2018 at 12:36 AM, Rajkiran Rajkumar <
>>>>>>> [email protected]> wrote:
>>>>>>>
>>>>>>>> @Saahil, kindly make your doc view-only for people with a link to
>>>>>>>> it. Giving edit permissions to the world is a bad idea.
>>>>>>>>
>>>>>>>> Thanks,
>>>>>>>> Rajkiran
>>>>>>>>
>>>>>>>> On Tue, Mar 20, 2018 at 5:17 PM, Kevin A. McGrail <
>>>>>>>> [email protected]> wrote:
>>>>>>>>
>>>>>>>>> +users
>>>>>>>>>
>>>>>>>>> All we give is feedback.  The submission to GSoC is what matters.
>>>>>>>>> So if you mentioned perl here that's not going to carryover to the
>>>>>>>>> reviewers.
>>>>>>>>>
>>>>>>>>> Can someone with fresh eyes take a look at this?  I read it too
>>>>>>>>> recently so I will gloss over it too much.
>>>>>>>>>
>>>>>>>>> Here are some posts the mentors list thought might be helpful.
>>>>>>>>> The first I believe covers someone's pov who did not get selected.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> https://medium.freecodecamp.org/hacking-gsoc-how-to-gain-real-life-experience-and-support-open-source-b1e6a664f6e4?source=linkShare-53ba2bb84284-1521381334
>>>>>>>>>
>>>>>>>>> https://sanatt.me/2017/12/30/cracking-google-summer-code-2018/
>>>>>>>>>
>>>>>>>>> Regards, KAM
>>>>>>>>>
>>>>>>>>> On Tue, Mar 20, 2018, 03:57 Saahil Sirowa <
>>>>>>>>> [email protected]> wrote:
>>>>>>>>>
>>>>>>>>>> Hi Kevin and Apache SpamAssassin Dev Community,
>>>>>>>>>>
>>>>>>>>>> I have resolved all the changes you suggested in the previous
>>>>>>>>>> draft.
>>>>>>>>>> 1) I mentioned about learning PERL a week before the community
>>>>>>>>>> bonding period. It will not take much time. I can assure you that 
>>>>>>>>>> language
>>>>>>>>>> is not going to be an issue.
>>>>>>>>>> 2) I updated the biography part a bit
>>>>>>>>>> 3) Significant changes have been made in the Timeline.
>>>>>>>>>> 4) I'm planning to used cmake/travis ci for automated testing. If
>>>>>>>>>> there is a better alternative please do suggest.
>>>>>>>>>> 5) I gave links to research papers that i will be reading in the
>>>>>>>>>> timeline.
>>>>>>>>>> 6) I updated the timeline by mentioning to gain advanced
>>>>>>>>>> information about email traffic and spams. I listed some links for 
>>>>>>>>>> the
>>>>>>>>>> purpose.
>>>>>>>>>> 7) I updated the credits
>>>>>>>>>> 8) There are other changes made in various parts of proposal.
>>>>>>>>>>
>>>>>>>>>> Thanks for your previous detailed feedback.
>>>>>>>>>>
>>>>>>>>>> Here is link to the updated proposal
>>>>>>>>>> GSoC 2018 proposal
>>>>>>>>>> <https://docs.google.com/document/d/1-OCNv79sHvVViKwnrRYtlMiKWLCzz4xUW4tNOlmaTmw/edit#heading=h.q7h3lddabdvh>
>>>>>>>>>> Please rigorously review it and suggest any changes that I should
>>>>>>>>>> make.
>>>>>>>>>>
>>>>>>>>>> Awaiting for a favorable response.
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Thanks...
>>>>>>>>>> Saahil Sirowa
>>>>>>>>>> B. Tech Computer Science and Engineering
>>>>>>>>>> Indian Institute of Technology, Hyderabd
>>>>>>>>>>
>>>>>>>>>> On Mon, Mar 19, 2018 at 3:27 AM, Kevin A. McGrail <
>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>>
>>>>>>>>>>> Hi Saahil
>>>>>>>>>>>
>>>>>>>>>>> re: Perl. As the project is primarily in Perl and you do not
>>>>>>>>>>> list that in your Proficiencies or any similar languages like PHP, 
>>>>>>>>>>> I would
>>>>>>>>>>> address that.  The word Perl does not appear a single time.
>>>>>>>>>>>
>>>>>>>>>>> Your Biography is a little light on why this is something you
>>>>>>>>>>> feel you can implement.  The mentors will likely NOT be able to 
>>>>>>>>>>> help you
>>>>>>>>>>> with the science rather focusing on the community, processes, and 
>>>>>>>>>>> open
>>>>>>>>>>> source in general.
>>>>>>>>>>>
>>>>>>>>>>> re: Email and SPam, do you have any experience with email
>>>>>>>>>>> traffic or spam?  if so, add it.  If not, explain what you plan to 
>>>>>>>>>>> do to
>>>>>>>>>>> address that.
>>>>>>>>>>>
>>>>>>>>>>> Re: Deliverables, I think you'll need to propose the first draft
>>>>>>>>>>> of that.  But your goal will likely be a plugin for Apache 
>>>>>>>>>>> SpamAssassin
>>>>>>>>>>> that can be installed and configured to provide multiple 
>>>>>>>>>>> configurable
>>>>>>>>>>> statistical analysis algorithms to better identify ham (good email) 
>>>>>>>>>>> and/or
>>>>>>>>>>> spam (bad email)
>>>>>>>>>>>
>>>>>>>>>>> Please use Apache SpamAssassin to properly brand the title.
>>>>>>>>>>>
>>>>>>>>>>> Re: I have no input on the scheduling/timelines except that past
>>>>>>>>>>> proposal I have read have included more phases and do not add 
>>>>>>>>>>> "optional"
>>>>>>>>>>> items.  I'd prefer to see small increments to make sure you stay on
>>>>>>>>>>> schedule and don't get overwhelmed and find yourself way behind as 
>>>>>>>>>>> the time
>>>>>>>>>>> progresses.
>>>>>>>>>>>
>>>>>>>>>>> Re: Testing Methodology, this is likely the most critical
>>>>>>>>>>> missing part.  I am a fan of test driven development where you set 
>>>>>>>>>>> up tests
>>>>>>>>>>> that should pass and fall and use continuous testing as you add 
>>>>>>>>>>> code to
>>>>>>>>>>> confirm your development is progressing well.
>>>>>>>>>>>
>>>>>>>>>>> This is especially important because spam analysis often doesn't
>>>>>>>>>>> work the way people expect and tests w/statistics can help identify 
>>>>>>>>>>> issues.
>>>>>>>>>>>
>>>>>>>>>>> For example, this is a hypothesis that this statistical
>>>>>>>>>>> algorithms will be better than Bayes.  So you'll need a baseline for
>>>>>>>>>>> comparison.
>>>>>>>>>>>
>>>>>>>>>>> Additionally, even experts in the field are surprised when they
>>>>>>>>>>> think something will prove the hamminess of an email but in fact 
>>>>>>>>>>> shows the
>>>>>>>>>>> opposite.  Real world example, SPF is a policy when introduced was 
>>>>>>>>>>> supposed
>>>>>>>>>>> to allow an automated mechanism that says "this is an email from a
>>>>>>>>>>> legitimate mail server for my domain".
>>>>>>>>>>>
>>>>>>>>>>> However, the FIRST wave of people to adobt it were all
>>>>>>>>>>> spammers.  So it became a spam indicator more than a spam 
>>>>>>>>>>> indicator.  It
>>>>>>>>>>> was a very interesting outcome.
>>>>>>>>>>>
>>>>>>>>>>> Re: Corpora, you'll want a corpora of carefully hand sorted ham
>>>>>>>>>>> and spam.  Have you thought about how you'll get that?  I *might* 
>>>>>>>>>>> be able
>>>>>>>>>>> to help but it's 50/50.
>>>>>>>>>>>
>>>>>>>>>>> Re: You mention reading research papers on statisical algorithms
>>>>>>>>>>> from a previous proposal.  You'll want to list them to show which 
>>>>>>>>>>> ones you
>>>>>>>>>>> plan to study
>>>>>>>>>>>
>>>>>>>>>>> re: "Discussions with the SA community regarding the various
>>>>>>>>>>> types of spams that the present SA can handle." is unclear.  What 
>>>>>>>>>>> is a
>>>>>>>>>>> "type of spam" to you?  Do you have a list of types of spam?
>>>>>>>>>>>
>>>>>>>>>>> re: "Brainstorming with the mentors and SA community about the
>>>>>>>>>>> various input features and parameters that can have a huge impact 
>>>>>>>>>>> on the
>>>>>>>>>>> overall performance of the listed neural nets models." I think this 
>>>>>>>>>>> is
>>>>>>>>>>> flawed.  There won't be a ton of people who can discuss this with 
>>>>>>>>>>> you.
>>>>>>>>>>> You'll need to likely use scientific process to show what has a 
>>>>>>>>>>> performance
>>>>>>>>>>> impact.  This is not busy work or school work.  This is an 
>>>>>>>>>>> experiment that
>>>>>>>>>>> has not been tried at the SA project.
>>>>>>>>>>>
>>>>>>>>>>> re: "actively involved with the community." is a stretch.  A few
>>>>>>>>>>> emails do not active involvement make.
>>>>>>>>>>>
>>>>>>>>>>> re: Bonding, you might consider raising that to 1-2 major bugs
>>>>>>>>>>> and 10-20 minor bugs.
>>>>>>>>>>>
>>>>>>>>>>> Re: Credits/references, I would add more clarity about where
>>>>>>>>>>> each of those references are used.
>>>>>>>>>>>
>>>>>>>>>>> Regards,
>>>>>>>>>>> KAM
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>
>>>

Reply via email to