John, others,

On Mon, Oct 7, 2013 at 11:31 AM, John May <john.wilkinson...@gmail.com> wrote:
> I'm actually okay with ChemicalEntity but as a more appropriate name for the
> existing ChemObject (which AtomContainer already extends).

Now there is a class we must reconsider... ChemObject exists because
we decided to have some core properties and no one came up with a
better idea; it's the Java approach, I guess. And a Map on each atom
and bond is questionable at least... Similarly, we for long want to
remove all those *FooSet classes and just List<IAtomContainer>,
ummm... List<IMolecule>, ummm.... List<IChemicalEntity>...

Rajarshi, now that you have some time, want to push some of your ideas
in this area in the master?

> I think 1.7+ we'll need a long think about the object hierarchy and tighten
> up the definitions a bit. I have no problem representing the complexity of
> chemistry

Agreed. To me naming of classes is quite orthogonal to the
hierarchy... we can simply one, while keeping the other more complex.
(where the one is the hierarchy :)

> but it shouldn't try to encode an ontology in the classes.

We most certainly don't have to replicate a full ontology. I have also
the insight that while we should have ontologies to encode isotope
information, the running CDK should use a simpler format which is way
much faster at loading and smaller in size, and created from the
proper source.

John, we have a lot to do. But please don't forget, speed never was a
goal! Functionality was, and that is what is what made the CDK big.
Now we introduce speed, but have to be careful not to loose
functionality in the process. We *can* do both.

(In fact, a paper a year or two ago showed that the CDK was not really
that slow at all... and sometimes even faster than C/C++... the
reference should be somewhere in my CiteULike library... still, we can
be faster, and you have done amazing work here.)

Egon

-- 
Dr E.L. Willighagen
Postdoctoral Researcher
Department of Bioinformatics - BiGCaT
Maastricht University (http://www.bigcat.unimaas.nl/)
Homepage: http://egonw.github.com/
LinkedIn: http://se.linkedin.com/in/egonw
Blog: http://chem-bla-ics.blogspot.com/
PubList: http://www.citeulike.org/user/egonw/tag/papers
ORCID: 0000-0001-7542-0286

------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
_______________________________________________
Cdk-user mailing list
Cdk-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to