Re: Pylucene release

2012-11-29 Thread Andi Vajda

On Nov 29, 2012, at 5:37, Shawn Grant shawn.gr...@orcatec.com wrote:

 Hi Andi, thanks for the explanation.
 
 The main problem I've come across so far is that it looks like the main 
 branch lucene has a lucene41 codec in it that does not appear to be part of 
 the 4.0 release and (I think) is causing problems creating and/or retrieving 
 term vectors.  I'm not a lucene expert and it's been hard to diagnose.  I 
 also can't use Luke due to the codec.

PyLucene trunk is currently tracking Lucene's branch 4.x. I'd expect the 
lucene41 codec to be available there.

 I tried to set the default codec to lucene40 but then my index writer 
 complained that lucene40 was only for reading.

You should ask on the lucene-user@ list. There are more people listening there 
who would know the details.

 I'll try to contribute to porting the unit tests to help move the release 
 along.

Cool !

Andi..

 
 On 11/13/2012 02:18 PM, Andi Vajda wrote:
 
 Hi Shawn,
 
 On Tue, 13 Nov 2012, Shawn Grant wrote:
 
 Hi Andi, I was just wondering if Pylucene is on its usual schedule to 
 release 4-6 weeks after Lucene.  I didn't see any discussion of it on the 
 mailing list or elsewhere.  I'm looking forward to 4.0!
 
 Normally, PyLucene is released a few days after a Lucene release but 4.0 has 
 seen so many API changes and removals that all tests and samples need to be 
 ported to the new API. Last week-end, I ported a few but lots remain to be.
 
 If no one helps, it either means that no one cares enough or that everyone 
 is willing to be patient :-)
 
 The PyLucene trunk svn repository is currently tracking the Lucene Core 4.x 
 branch and you're welcome to use it out of svn. In the ten or so unit tests 
 I ported so far, I didn't find any issues with PyLucene proper (or JCC). All 
 changes were due to the tests being out of date or using deprecated APIs now 
 removed. You might find that PyLucene out-of-trunk is quite usable.
 
 If people want to help with porting PyLucene unit tests, the ones under its 
 'test' directory not yet ported, feel free to ask questions here.
 The gist of it is:
  - fix the imports (look at the first few tests for example,
alphabetically)
  - fix the tests to pass by looking at the original Java tests for changes
as most of these tests were originally ported from Java Lucene.
 
 Once you're familiar with the new APIs, porting the sample code in samples 
 and in LuceneInAction should fairly straightforward. It's just that there is 
 a lot to port.
 
 Andi..


AW: Pylucene release

2012-11-14 Thread Thomas Koch
Hi Andi,
I still wanted to check the API changes related to 4.0 and could then help
with porting the example code (and/or unit tests). I hope there are more
people interested in helping to port PyLucene (or at least the 'related'
Python code) to the Lucene 4.0 level...

How can we best proceed? I assume you checked in the code that's adapted
already to SVN. Is there a list of code that needs to be ported (and can be
used to distribute tasks)? As said I don't have a an idea of the API changes
yet, so it's hard to estimate the time needed to get used to 4.0 (and fix
the code), but as you did that already maybe you can share your experience
with us. As with any new major release (e.g. Python 3.x) I guess many of us
are afraid to move forward to the new release and change our code base, but
certainly that's just a matter of time ...

Cheers,
Thomas

 -Ursprüngliche Nachricht-
 Von: Andi Vajda [mailto:va...@apache.org]
 Gesendet: Dienstag, 13. November 2012 23:18
 An: Shawn Grant
 Cc: pylucene-dev@lucene.apache.org
 Betreff: Re: Pylucene release
 
 
   Hi Shawn,
 
 On Tue, 13 Nov 2012, Shawn Grant wrote:
 
  Hi Andi, I was just wondering if Pylucene is on its usual schedule to
  release
  4-6 weeks after Lucene.  I didn't see any discussion of it on the
  mailing list or elsewhere.  I'm looking forward to 4.0!
 
 Normally, PyLucene is released a few days after a Lucene release but 4.0
has
 seen so many API changes and removals that all tests and samples need to
 be ported to the new API. Last week-end, I ported a few but lots remain to
 be.
 
 If no one helps, it either means that no one cares enough or that everyone
is
 willing to be patient :-)
 
 The PyLucene trunk svn repository is currently tracking the Lucene Core
4.x
 branch and you're welcome to use it out of svn. In the ten or so unit
tests I
 ported so far, I didn't find any issues with PyLucene proper (or JCC). All
 changes were due to the tests being out of date or using deprecated APIs
 now removed. You might find that PyLucene out-of-trunk is quite usable.
 
 If people want to help with porting PyLucene unit tests, the ones under
its
 'test' directory not yet ported, feel free to ask questions here.
 The gist of it is:
- fix the imports (look at the first few tests for example,
  alphabetically)
- fix the tests to pass by looking at the original Java tests for
changes
  as most of these tests were originally ported from Java Lucene.
 
 Once you're familiar with the new APIs, porting the sample code in samples
 and in LuceneInAction should fairly straightforward. It's just that there
is a lot
 to port.
 
 Andi..




Re: AW: Pylucene release

2012-11-14 Thread Andi Vajda


 Hi Thomas,

On Wed, 14 Nov 2012, Thomas Koch wrote:


I still wanted to check the API changes related to 4.0 and could then help
with porting the example code (and/or unit tests). I hope there are more
people interested in helping to port PyLucene (or at least the 'related'
Python code) to the Lucene 4.0 level...

How can we best proceed?


 1. Pick a test that fails (for example: python test/test_FuzzyQuery.py)
 2. Announce you're working on it on the list (so that only you does)
 3. Fix it
 4. Send in a patch


I assume you checked in the code that's adapted already to SVN.


Yes, all current code is checked in, including fixed or broken tests.


Is there a list of code that needs to be ported (and can be
used to distribute tasks)?


Currently, all tests in test up to test_FilteredQuery.py (alphabetically) 
pass. The test_ICU* ones also pass. You should use these as examples on how 
to fix failing ones.


As said I don't have a an idea of the API changes yet, so it's hard to 
estimate the time needed to get used to 4.0


No time estimated is expected from you.
It's best to proceed by example. Look at the tests that pass already (and 
thus that have been fixed) as examples.

The steps to fix a failing test are as follows:
  - fix import statements first (they're all changed since PyLucene 4.0
no longer uses a flat namespace but strictly follows the original Java
package structure now)
for example:
 from lucene import Document
becomes
 from org.apache.lucene.document import Document
If you don't know where a class is (and the Lucene tree is deeply
nested), find lucene src -name ClassName.java will usually give
you an idea of the package structure to import
  - when it makes sense (most of the time), use PyLuceneTestCase as the
parent test class. This will help with the complexities/boilerplate in
creating a test IndexWriter/Reader/Searcher using a RAMDirectory
  - if the tests still fails, look at the original Java test code for
possible changes in the API or the expected that behaviour that occurred
since the first port. The original Java test file is usually named
TestName.java when the Python test is named test_Name.py

Andi..

(and fix the code), but as you did that already maybe you can share your 
experience with us. As with any new major release (e.g. Python 3.x) I 
guess many of us are afraid to move forward to the new release and change 
our code base, but certainly that's just a matter of time ...


Cheers,
Thomas


-Ursprüngliche Nachricht-
Von: Andi Vajda [mailto:va...@apache.org]
Gesendet: Dienstag, 13. November 2012 23:18
An: Shawn Grant
Cc: pylucene-dev@lucene.apache.org
Betreff: Re: Pylucene release


  Hi Shawn,

On Tue, 13 Nov 2012, Shawn Grant wrote:


Hi Andi, I was just wondering if Pylucene is on its usual schedule to
release
4-6 weeks after Lucene.  I didn't see any discussion of it on the
mailing list or elsewhere.  I'm looking forward to 4.0!


Normally, PyLucene is released a few days after a Lucene release but 4.0

has

seen so many API changes and removals that all tests and samples need to
be ported to the new API. Last week-end, I ported a few but lots remain to
be.

If no one helps, it either means that no one cares enough or that everyone

is

willing to be patient :-)

The PyLucene trunk svn repository is currently tracking the Lucene Core

4.x

branch and you're welcome to use it out of svn. In the ten or so unit

tests I

ported so far, I didn't find any issues with PyLucene proper (or JCC). All
changes were due to the tests being out of date or using deprecated APIs
now removed. You might find that PyLucene out-of-trunk is quite usable.

If people want to help with porting PyLucene unit tests, the ones under

its

'test' directory not yet ported, feel free to ask questions here.
The gist of it is:
   - fix the imports (look at the first few tests for example,
 alphabetically)
   - fix the tests to pass by looking at the original Java tests for

changes

 as most of these tests were originally ported from Java Lucene.

Once you're familiar with the new APIs, porting the sample code in samples
and in LuceneInAction should fairly straightforward. It's just that there

is a lot

to port.

Andi..





Re: Pylucene release

2012-11-13 Thread Andi Vajda


 Hi Shawn,

On Tue, 13 Nov 2012, Shawn Grant wrote:

Hi Andi, I was just wondering if Pylucene is on its usual schedule to release 
4-6 weeks after Lucene.  I didn't see any discussion of it on the mailing 
list or elsewhere.  I'm looking forward to 4.0!


Normally, PyLucene is released a few days after a Lucene release but 4.0 has 
seen so many API changes and removals that all tests and samples need to be 
ported to the new API. Last week-end, I ported a few but lots remain to be.


If no one helps, it either means that no one cares enough or that everyone 
is willing to be patient :-)


The PyLucene trunk svn repository is currently tracking the Lucene Core 4.x 
branch and you're welcome to use it out of svn. In the ten or so unit tests 
I ported so far, I didn't find any issues with PyLucene proper (or JCC). All 
changes were due to the tests being out of date or using deprecated APIs now 
removed. You might find that PyLucene out-of-trunk is quite usable.


If people want to help with porting PyLucene unit tests, the ones under its 
'test' directory not yet ported, feel free to ask questions here.

The gist of it is:
  - fix the imports (look at the first few tests for example,
alphabetically)
  - fix the tests to pass by looking at the original Java tests for changes
as most of these tests were originally ported from Java Lucene.

Once you're familiar with the new APIs, porting the sample code in samples 
and in LuceneInAction should fairly straightforward. It's just that there is 
a lot to port.


Andi..


Re: preparing for a PyLucene release

2009-02-14 Thread Michael McCandless


Andi Vajda wrote:



On Fri, 13 Feb 2009, Michael McCandless wrote:


I think this is a good idea!

There were discussions late last year to do a 2.9 release for  
Lucene, but realistically

it's probably  a ways out still.


Even a 2.4.1 release, just bug fixes ?


Possibly we'll do a 2.4.1 before 2.9, but 2.4.0 has proven stable and I
don't think there've been discussions/requests for a 2.4.1.

Mike


Re: preparing for a PyLucene release

2009-02-13 Thread Michael McCandless


I think this is a good idea!

There were discussions late last year to do a 2.9 release for Lucene,  
but realistically

it's probably  a ways out still.

The Lucene java release checklist might be a good place to check for  
the typical things:


http://wiki.apache.org/lucene-java/ReleaseTodo

Alot of that is specific to Lucene java, but a good amount is generic  
and should be helpful.


I'd recommend keeping notes of what you did and then posting to the  
Wiki.


Once you have a release candidate posted, then you should call for a  
vote; you
should definitely encourage anyone  everyone to test/vote (not just  
PMC members).


Mike

Andi Vajda wrote:



Hi,

Apart from the JIRA project (issue INFRA-1861) which seems stuck,  
the transition for PyLucene from OSAF to Apache is now complete.


Historically, PyLucene has had official releases closely tracking  
Java Lucene's. At the moment, it doesn't appear that Java Lucene is  
preparing for a release anytime soon.


I'd like to propose creating an offical Apache PyLucene source release
now instead of waiting for the next Java Lucene release for the  
following purposes:

 - practising the process (new to me)
 - getting a release out there for people looking for it under the
   not-yet-created 'releases' menu under 'resources'
 - announcing the move to a larger audience such as Freshmeat

From a functionality standpoint, this release would still be built  
around Java Lucene 2.4.0 but would include the latest bug fixes  
since the last release made at OSAF. I propose calling this release  
PyLucene-2.4.0-3.


I read the documents at [1] and [2] and I'm ready to kick-off the  
process.

In particular, I fixed the NOTICE and LICENSE files again.

At what time in that process do I need a vote from the Lucene PMC ?
Is there anything else required besides the vote and what's  
described at [1] and [2] ?


Andi..

[1] http://www.apache.org/dev/mirror-step-by-step.html?Step-By-Step
[2] http://www.apache.org/dev/#releases