[Nutch-dev] [jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516621 ] Doğacan Güney commented on NUTCH-530: - Yeah, you are right. +1 from me. Add a combiner to improve performance

[Nutch-dev] [jira] Commented: (NUTCH-532) CrawlDbMerger: wrong computation of last fetch time

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516623 ] Doğacan Güney commented on NUTCH-532: - res.getFetchTime() - Math.round(res.getFetchInterval() * 1000d); always

[Nutch-dev] 公司领导函

2007-07-31 Thread 刘先生
TO:财务部 深圳市金峰税务代理有限公司成立于1993年, 经过十多年的发展不断壮大,现在在全国大只城市 均设有分公司,与全国上百家公司有着密切业务的 联系,可优惠对外代开增值税和海关增值税发票, (国税/地税),代开范围:普通商品销售、运输、 广告、餐饮、建筑安装、服务咨询等发票。 本公司成立多年一直坚持以“诚信”作为公司 核心思想,牢固树立公司形象,真正做到“彼此合 作一次、必成永久朋友”的经营理念,我司可先提 供发票,等贵司或贵厂收到票认证后再付款,

[Nutch-dev] [jira] Updated: (NUTCH-534) SegmentMerger: add -normalize option

2007-07-31 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emmanuel Joke updated NUTCH-534: Attachment: NUTCH-534.patch Patch provided SegmentMerger: add -normalize option

[Nutch-dev] [jira] Commented: (NUTCH-530) Add a combiner to improve performance on updatedb

2007-07-31 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516675 ] Emmanuel Joke commented on NUTCH-530: - Actually I don't re-use CrawlDbReducer, I've define a new class as

[Nutch-dev] [jira] Resolved: (NUTCH-533) LinkDbMerger: url normalized is not updated in the key and inlinks list

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney resolved NUTCH-533. - Resolution: Fixed Fixed in rev. 561306. LinkDbMerger: url normalized is not updated in the key

[Nutch-dev] [jira] Closed: (NUTCH-533) LinkDbMerger: url normalized is not updated in the key and inlinks list

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney closed NUTCH-533. --- Resolved and committed. LinkDbMerger: url normalized is not updated in the key and inlinks list

[Nutch-dev] [jira] Updated: (NUTCH-442) Integrate Solr/Nutch

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-442: Attachment: RFC_multiple_search_backends.patch Here is my (very large - sorry) patch for this

[Nutch-dev] [jira] Closed: (NUTCH-520) A common infrastructure for different index backends

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney closed NUTCH-520. --- Resolution: Duplicate I am closing this as duplicate since NUTCH-442 (which has a patch that includes

[Nutch-dev] [jira] Updated: (NUTCH-442) Integrate Solr/Nutch

2007-07-31 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-442: Attachment: schema.xml This simple schema can be used to test solr integration. Integrate

[Nutch-dev] 聚在一起

2007-07-31 Thread 琴海
认识更多的朋友,可以一起旅游、K歌、聊天、运动等等;总之能够聚在一起,我想大家就会很开心的http://g1.flytf.com/?p=tuerk - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and

[Nutch-dev] [jira] Commented: (NUTCH-533) LinkDbMerger: url normalized is not updated in the key and inlinks list

2007-07-31 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516870 ] Hudson commented on NUTCH-533: -- Integrated in Nutch-Nightly #167 (See