Hi, This is really encouraging Ferdy, thank you for explaining a bit about what angle you and Mathijs approach this from. In all honesty I do not use nutchgora in production, my main interest is in working to get the software at a more stable state as I have a vetted interest in getting JUnit integrity. Please see my comments as below.
On Mon, Oct 10, 2011 at 12:53 PM, Ferdy Galema <[email protected]>wrote: > ** > Hi guys, > > (Don't mind me adding my colleague Mathijs to the recipients) > Hi Mathijs > > Sure, I'm willing to dive into it. Although our knowledge of Nutchgora is > still fairly limited, we're committed to using it in our project so any > effort in trying to make a stable branch is both good for the community and > our own experience with Nutch. We have been a long time user of Nutch 1.x > but recently we decided to make a switch to Nutchgora in order to use HBase > as a crawling backend. > OK, then we are not a million miles away from how we are using nutchgora, however I would imagine we have different intended use cases, this is the beauty of this branch. It has great potential for a flexible approach to crawling and storage. > > Anyhow, I will try to run the tests and apply patches and update the Jira > where appropiate. Again, since we are still new to Nutchgora it will take > some time to get familiar with the codebase. > We as I said initially, I submitted a small patch which fixes 3 JUnit tests, nutchgora fails with TestGoraStirage as per here https://issues.apache.org/jira/browse/NUTCH-1135 This said, I am also very interested to hear about your opinions on Juliens suggestions here https://issues.apache.org/jira/browse/NUTCH-896 > > By the way, how come the current branch is not tripping over missing Gora > dependencies anymore? When I do a clean checkout I still have to provide > Gora in my local repository.. Has Jenkins been provided with a Gora build > somehow? > Well as far as I know Gora 1.1.0 was pushed to maven repo therefore the dependencies can be retrieved when we build nutchgora locally with any/ivy. As per Juliens commit below http://svn.apache.org/viewvc/nutch/branches/nutchgora/ivy/ivy.xml?view=markup <dependency org="org.apache.gora" name="gora-core" rev="0.1.1-incubating" conf="*->compile"/> I think it would be best to keep track of how this conversation goes if we could shift it to dev@ so I am going to add this to the thread. Thanks for now Lewis > > Ferdy. > > > On 10/08/2011 08:31 PM, lewis john mcgibbney wrote: > > Hi guys, > > I'm contacting you directly to see if we can knock our heads together and > maybe get NUTCH-1081 and sub-issues dealt with. > > I submitted a trivial patch for three of the tests which fix them, I would > appreciate if you could possibly test and provide some feedback. > > Further to this, the monster that is TestGoraStorage is still giving me > some pain. I have been trying on and off to get the the bottom of the > problem and have documented my early attempts in NUTCH-1135. > > To be honest, I am positive that both of you have a far better working > knowledge of Nutchgora, however none-the-less I am still extremely driven > towards getting all JUnit tests working and a stable CI build before the end > of the year. Although I cannot confirm, I 'think' that the above issues are > the only ones preventing us from doing so! > > Any thoughts? > > Thanks in advance > > -- > *Lewis* > > -- *Lewis*

