Dear Tommaso, thank you for your kind reply.
I know I have a lot to study before actually starting any code and that's
why any suggestion is so valuable.
So, you're suggesting that a simplification of the system using only the
paramagnetic centers can be a good approach? (I'm not sure if I understood
it correctly).
My main idea was, at first, try to represent the systems as realistically
as possible (using coordinates). I know that the software will not know
what a bond is or what an intermolecular interaction is but, let's say,
after including 1000s of examples in the training, I was expecting that (as
an example) finding a C 0.000 and an H at 1.000 should start to "make
sense" because it leads to an experimental trend. And I totally agree that
my way to represent the system is not the better.

Thank you so much for all the help.

On Mon, Mar 27, 2017 at 4:15 PM, Tommaso Costanzo <
tommaso.costanz...@gmail.com> wrote:

> Dear Henrique,
>
>
> I agree with Robert on the use of a supervised algorithm and I would also
> suggest you to try a semisupervised one if you have trouble in labeling
> your data.
>
>
> Moreover, as a chemist I think that the input you are thinking to use is
> not the in the best form for machine learning because you are trying to
> predict coupling J(AB) but in the future space you have only coordinates
> (XYZ). What I suggest is to generate the pair of atoms externally and then
> use a matrix of the form (Mx3), where M are the pairs of atoms you want to
> predict your J and 3 are the features of the two atoms (distance, angle,
> unpaired electrons). For a supervised approach you will need a training set
> where the J is know so your training data will be of the form Mx4 and the
> fourth feature will be the J you know.
>
> Hope that this is clear, if not I will be happy to help more
>
>
> Sincerely
>
> Tommaso
>
> 2017-03-27 13:46 GMT-04:00 Henrique C. S. Junior <henrique...@gmail.com>:
>
>> Dear Robert, thank you. Yes, I'd like to talk about some specifics on the
>> project.
>> Thank you again.
>>
>> On Mon, Mar 27, 2017 at 2:25 PM, Robert Slater <rdsla...@gmail.com>
>> wrote:
>>
>>> You definitely can use some of the tools in sci-kit learn for supervised
>>> machine learning.  The real trick will be how well your training system is
>>> representative of your future predictions.  All of the various regression
>>> algorithms would be of some value and you make even consider an ensemble to
>>> help generalize.  There will be some important questions to answer--what
>>> kind of loss function do you want to look at?  I assumed regression
>>> (continuous response) but it could also classify--paramagnetic,
>>> diamagnetic, ferromagnetic, etc...
>>>
>>> Another task to think about might be dimension reduction.
>>> There is no guarantee you will get fantastic results--every problem is
>>> unique and much will depend on exactly what you want out of the
>>> solution--it may be that we get '10%' accuracy at best--for some systems
>>> that is quite good, others it is horrible.
>>>
>>> If you'd like to talk specifics, feel free to contact me at this email.
>>> I have a background in magnetism (PhD in magnetic multilayers--i was
>>> physics, but as you are probably aware chemisty and physics blend in this
>>> area) and have a fairly good knowledge of sci-kit learn and machine
>>> learning.
>>>
>>>
>>>
>>> On Mon, Mar 27, 2017 at 10:50 AM, Henrique C. S. Junior <
>>> henrique...@gmail.com> wrote:
>>>
>>>> I'm a chemist with some rudimentary programming skills (getting started
>>>> with python) and in the middle of the year I'll be starting a Ph.D. project
>>>> that uses computers to describe magnetism in molecular systems.
>>>>
>>>> Most of the time I get my results after several simulations and
>>>> experiments, so, I know that one of the hardest tasks in molecular
>>>> magnetism is to predict the nature of magnetic interactions. That's why
>>>> I'll try to tackle this problem with Machine Learning (because such
>>>> interactions are dependent, basically, of distances, angles and number of
>>>> unpaired electrons). The idea is to feed the computer with a large training
>>>> set (with number of unpaired electrons, XYZ coordinates of each molecule
>>>> and experimental magnetic couplings) and see if it can predict the magnetic
>>>> couplings (J(AB)) of new systems:
>>>> (see example in the attached image)
>>>>
>>>> Can Scikit-Learn handle the task, knowing that the matrix used to
>>>> represent atomic coordinates will probably have a different number of atoms
>>>> (because some molecules have more atoms than others)? Or is this a job
>>>> better suited for another software/approach? ​
>>>>
>>>>
>>>> --
>>>> *Henrique C. S. Junior*
>>>> Industrial Chemist - UFRRJ
>>>> M. Sc. Inorganic Chemistry - UFRRJ
>>>> Data Processing Center - PMP
>>>> Visite o Mundo Químico <http://mundoquimico.com.br>
>>>>
>>>> _______________________________________________
>>>> scikit-learn mailing list
>>>> scikit-learn@python.org
>>>> https://mail.python.org/mailman/listinfo/scikit-learn
>>>>
>>>>
>>>
>>> _______________________________________________
>>> scikit-learn mailing list
>>> scikit-learn@python.org
>>> https://mail.python.org/mailman/listinfo/scikit-learn
>>>
>>>
>>
>>
>> --
>> *Henrique C. S. Junior*
>> Industrial Chemist - UFRRJ
>> M. Sc. Inorganic Chemistry - UFRRJ
>> Data Processing Center - PMP
>> Visite o Mundo Químico <http://mundoquimico.com.br>
>>
>> _______________________________________________
>> scikit-learn mailing list
>> scikit-learn@python.org
>> https://mail.python.org/mailman/listinfo/scikit-learn
>>
>>
>
>
> --
> Please do NOT send Microsoft Office Attachments:
> http://www.gnu.org/philosophy/no-word-attachments.html
>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
>


-- 
*Henrique C. S. Junior*
Industrial Chemist - UFRRJ
M. Sc. Inorganic Chemistry - UFRRJ
Data Processing Center - PMP
Visite o Mundo Químico <http://mundoquimico.com.br>
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to