[jira] Updated: (NUTCH-952) fix outlink which started with '?' in html parser

2011-01-07 Thread Stondet (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stondet updated NUTCH-952:
--

Affects Version/s: (was: 1.3)
   2.0

 fix outlink which started with '?' in html parser
 -

 Key: NUTCH-952
 URL: https://issues.apache.org/jira/browse/NUTCH-952
 Project: Nutch
  Issue Type: Bug
  Components: parser
Affects Versions: 2.0
Reporter: Stondet
 Attachments: NUTCH-952-v2.patch


 a href=?w=ruby%20on%20railsty=csd=0 ruby on rails/a(a snippet from 
 http://bbs.soso.com/search?ty=csd=0w=rails)
 outlink parsed from above link: 
 http://bbs.soso.com/?w=ruby%20on%20railsty=csd=0
 but expected is http://bbs.soso.com/search?w=ruby%20on%20railsty=csd=0

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (NUTCH-952) fix outlink which started with '?' in html parser

2011-01-05 Thread Stondet (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stondet updated NUTCH-952:
--

Attachment: NUTCH-952.patch

fix outlink which started with '?'

 fix outlink which started with '?' in html parser
 -

 Key: NUTCH-952
 URL: https://issues.apache.org/jira/browse/NUTCH-952
 Project: Nutch
  Issue Type: Bug
  Components: parser
Reporter: Stondet
 Attachments: NUTCH-952.patch


 a href=?w=ruby%20on%20railsty=csd=0 ruby on rails/a(a snippet from 
 http://bbs.soso.com/search?ty=csd=0w=rails)
 outlink parsed from above link: 
 http://bbs.soso.com/?w=ruby%20on%20railsty=csd=0
 but expected is http://bbs.soso.com/search?w=ruby%20on%20railsty=csd=0

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.