Changing Python class/module layout, dropping --rename ?

Andi Vajda Fri, 13 Jul 2012 08:47:31 -0700


On Tue, 10 Jul 2012, Andi Vajda wrote:

I would also like to propose a change, to allow for more flexible
mechanism of generating Python class names. The patch doesn't change
the default pylucene behaviour, but it gives people a way to replace
class names with patterns. I have noticed that there are more
same-name classes from different packages in the new lucene (and it
becomes worse when one has to deal with both lucene and solr).
Another way to fix this is to reproduce the namespace hierarchy used inLucene, following along the Java packages, something I've been dreading todo. Lucene just loooooves a really long deeply nested class structure.
I'm not convinced yet it is bad enough to go down that route, though.
Your proposal to use patterns may in fact yield a much more convenientsolution. Thanks !

Rethinking this a bit, I'm prepared to change my mind on this. Yourpatterned rename patch shows that we're slowly but surely reaching the limitof the current setup that consists in throwing all wrapped classes under theone global 'lucene' namespace.

Lucene 4.0 has seen a large number of deeply nested classes with similarnames added since 3.x. Renaming these one by one (or excluding some) doesn'tscale. Using the proposed patterned rename scales more but makes itdifficult to know what got renamed and how.Ultimately, the more classes that are like-named, the more classes wouldhave instable names from one release to the next as more duplicated namesare encountered.

What if instead JCC supported the original Java namespaces all the way tothe Python inteface (still dropping the original 'org.apache' Java packagetree prefix) ?The world-rooted style of naming Java classes isn't Pythonic but using thesecond half of the package structure feels right at home in the Pythonworld.

JCC already re-creates the complete Java package structure in C++ asnamespaces for all the C++ code it generates, for both the JNI wrapperclasses and the C++/Python types. It's only the installation of the classnames into the Python VM that is done in the flat 'lucene' namespace.

I think it shouldn't be too hard to change the code that installs classes tocreate sub-modules of the lucene module and install classes in thesesubmodules instead (down to however many levels are in the original).


In other words:
  - from lucene import Document
would become
  - from lucene.document import Document

One could of course also say:
  - import lucene.document.Document as whateverOneLikes

If that proposal isn't mortally flawed somewhere, I'm prepared to dropsupport for --rename and replace it with this new Python class/modulelayout.

Since this is being talked about in the context of a major PyLucene release,version 4.0, and that all tests/samples have to be reworked anyway, thisbackwards compat break shouldn't be too controversial, hopefully.

If it is, the old --rename could be preserved for sure, but I'd prefersimplying the JCC interface than to accrete more to it.


What do you think ?

Andi..


Andi..


I can confirm the test_test_BinaryDocument.py crashes the JVM no more.

Roman


On Tue, Jul 10, 2012 at 8:54 AM, Andi Vajda <va...@apache.org> wrote:


 Hi Roman,


On Mon, 9 Jul 2012, Roman Chyla wrote:

Thanks, I am attaching a new patch that adds the missing test base.
Sorry for the tabs, I was probably messing around with a few editors
(some of them not configured properly)



I integrated your test class (renaming it to fit the naming scheme used).
Thanks !

So far, found one serious problem, crashes VM -- see. eg
test/test_BinaryDocument.py - when getting the document using:
reader.document(0)



test/test_BInaryDocument.py doesn't seem to crash the VM but fails because
of some API changes. I suspect the crash to be some issue related to using
an older jcc.

I see a comment saying: "couldn't find any combination with lucene4.0whereit would raise errors". Most of these unit tests are straight ports fromtheoriginal Java version. If you're stumped about a change, check theoriginal

Java test, it may have changed too.

Andi..

Changing Python class/module layout, dropping --rename ?

Reply via email to