Ferdy Galema created NUTCH-1357: ----------------------------------- Summary: All gora mapreduce functionality should go through StorageUtils Key: NUTCH-1357 URL: https://issues.apache.org/jira/browse/NUTCH-1357 Project: Nutch Issue Type: Improvement Reporter: Ferdy Galema Fix For: nutchgora
I am trying to make the concept of crawlId work for ALL nutch jobs: it seems the biggest problem with it not working as expected is because of the various ways gora mapreduce is used in nutch. Some jobs use StorageUtils, some use GoraMapper/GoraReduce, some even use directly GoraInputFormat/GoraOutputFormat. But the only place the translation is made from crawlId into a schema name is in StorageUtils! Currently I am converting all calls to Gora* mapreduce initializing code to StorageUtils calls. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira