[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 深圳市顺发实业有限公司

2007-01-12 Thread 深圳市顺发有限公司
 尊敬的客户:
 您好!我是深圳市顺发实业有限公司。我公司是经深圳市财政局工商
 注册登记成立的税务代理公司,公司拥有多年专业的税务代理经验公司本着
 互惠互利的原则合理对外代开发票。
代开范围:(商品销售、广告、“电脑版”运输发票、其它服务、租赁、
 建筑安装、餐饮定额发票等!还有可抵扣增值税发票、海关缴款书;)
 欢迎各个新老客户来电与我司合作!
 
贵企业(公司)若有以下情况请来电联系: 
1.公司为一般纳税企业没有优惠政策而想减低税率的;
2.对外销售商品或提供技术服务而本公司暂未领正式发票的;
3.外出采购或公干而服务商没有提供可以报销的发票;
4.公司帐目进项与出项差额过大,需补充差额的。
5. 公司在做帐或进销存方面如需用到的。
公司承诺:受理谨慎 成功收费 为客户节省运作成本提供优质服务!如贵
 公司有代开发票方面的问题,欢迎来电或留言咨询.我们会在第一时间回复您!
  
   
   祝:生意兴隆
   步步高升
  联系人:王先生  (经理)
  手  机:15919464218
  地  址:深圳市罗湖区商业大厦1208
 深 圳 市 顺 发 实 业 有 限 公 司

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 财务专题

2007-01-12 Thread 陈先生
 你好!本公司现有发票可以代开。
 点数普通在.05―1增值3―5个点左右
 数量多点数可以商量。 验证后付款。
 欢迎来电咨询
  电话:135  3060  2800
  联系人:陈先生

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


Re: [Nutch-dev] sort result on different set of terms

2007-01-12 Thread DS jha

Thanks for your reply. I looked at scoring-opic plugin - but looks like it
gets called at parsing/index time and not during search time, correct?

I am classifying content and assigning a category (or categories) at parse
time and storing this information along with category score in the index.
Now, during query time when I display results for a particular category, I
would like to sort result based on this category score. I cannot say sort on
category score field, since a document can be classified against multiple
categories (and so multiple category scores) - At search time, for each
document that matches against say, 'category:A' - i will have to get corr
category score and use that for sorting.  Any thoughts?

Thanks,







On 1/11/07, Dennis Kubes [EMAIL PROTECTED] wrote:


You can write a scoring filter.  That is much easier than changing
NutchSimplicity.  Take a look at the scoring-opic plugin under src.
That will demostrate the default scoring algorithm.

Dennis Kubes

DS jha wrote:
 Hello -

 I would like to score  summarize results on a different set of
 words/conditions than my original search query criteria - is that
possible?

 I was thinking of extending NutchSimplicity class to modify how
documents
 are scored (basically change terms on which documents are scored) and
 plugin
 basicsummarizer for generating hit summary - does that sound like
logical
 extension points?
 .
 Let me know of anyone has solved this type of problem before.

 Thanks,


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] 代办税票

2007-01-12 Thread 代办税票
负责人:您好!

我公司是一家正常纳税的A级企业,在全国大、中、小城市均有。在与任何客户、单位的合作程

序都是按照国家法规进行,如有违规愿承担相关责任,本公司因需扩展市场的竞争性,为客户对

营业税收提供方便灵活、优惠应用;能够对贵公司提供优惠缴纳税款.可以帮客户代开代理发票:

一: 普通国税发票

1:商业销售(可以网上查)  2:货物统一销售   3:工业(企业)销售

二:普通地税发票

1:运输(电脑版运输、货运代理、装卸、联运、海运等)

2:其它服务(广告费、住宿费、会议费、咨询费等)

3:建筑安装   加工修理

4:有海关核销单出售,价格优惠.交接方便

5:其它(租赁,行政事业专用、机动车销售、房地产交易、税务代理)

等专用票据 。以上票据税点均在0.5%~1.5%目前在全国是最低之一

如需敬请致电:

   手  机: 13826592593

   联系人: 刘先生   

   E-mail:[EMAIL PROTECTED]  
  
 



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic

2007-01-12 Thread Sami Siren (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12464347
 ] 

Sami Siren commented on NUTCH-422:
--

Is there a reason for the two takarta-regexp-jars (v 1.2 and 1.3) in source 
package?

 index-extra plugin creates additional fields in the index, based on 
 configurable logic
 --

 Key: NUTCH-422
 URL: https://issues.apache.org/jira/browse/NUTCH-422
 Project: Nutch
  Issue Type: New Feature
  Components: indexer
Affects Versions: 0.8.1
 Environment: All environments
Reporter: Alan Tanaman
 Assigned To: Sami Siren
 Attachments: index-extra-v1.0-bin-java1.5.zip, 
 index-extra-v1.0-source.zip


 Extract from the Readme file:
 A.  Introduction
 The index-extra plugin allows you to configure additional fields that you 
 wish to be added to the index, based on one of the following sources:
   - The parsed text
   - Meta data fields
   - Previously created document-to-be-indexed fields
   - Plain constant string
   - Java expression combining one or more of the above, and resolving to 
 a string
 A regex can also be applied to any of the above, allowing fields to be 
 created based on patterns extracted from the source.
 B.  Installation
 1)  Binaries only:  Copy the 'index-extra' folder within 
 index-extra-v1.0-bin-java1.5.zip to NUTCHDIR/build
 Copy the 'index-extra-conf.xml' file to 
 NUTCHDIR/conf, and configure
 Enable the plugin by updating the nutch-site.xml file
 2)  Source code:Always refer to the Nutch wiki for detailed 
 instructions on building Nutch.  In short:
 Copy the 'index-extra' folder within 
 index-extra-v1.0-source.zip to NUTCHDIR/src/plugin
 Update the build.xml in NUTCHDIR/src/plugin to 
 include plugin
 Update the NUTCHDIR/default.properties file to 
 include plugin
 run ant to build
 Copy the 'index-extra-conf.xml' file to 
 NUTCHDIR/conf, and configure
 Enable the plugin by updating the nutch-site.xml file
 C.  Known Issues
 1)  For this plugin to work correctly on any document field, it is 
 necessary to run the other index filters
 first, so that all basic document fields are generated first.  To do 
 this, configure the indexingfilter.order
 property.  (Please see patch NUTCH-421 to enable indexingfilter.order 
 property. If this patch is not applied,
 the plugin will still work, but will not be able to use document fields 
 created by other index filter plugins.)
 2)  At this stage, field boost can not be used as Nutch scoring overrides 
 the field boost with its own
 document-level boost calculation.  This occurs at the end of 
 org.apache.nutch.indexer.Indexer's reduce method.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic

2007-01-12 Thread Sami Siren (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12464351
 ] 

Sami Siren commented on NUTCH-422:
--

couple of more points:
-source files use tabs for indentation
-headers of files are not consistent, should be updated
-module contains jdom which is already part of nutch, should instead use 
existing one
-no junit tests, not strictly a requirement but a big plus is to have some!

 index-extra plugin creates additional fields in the index, based on 
 configurable logic
 --

 Key: NUTCH-422
 URL: https://issues.apache.org/jira/browse/NUTCH-422
 Project: Nutch
  Issue Type: New Feature
  Components: indexer
Affects Versions: 0.8.1
 Environment: All environments
Reporter: Alan Tanaman
 Assigned To: Sami Siren
 Attachments: index-extra-v1.0-bin-java1.5.zip, 
 index-extra-v1.0-source.zip


 Extract from the Readme file:
 A.  Introduction
 The index-extra plugin allows you to configure additional fields that you 
 wish to be added to the index, based on one of the following sources:
   - The parsed text
   - Meta data fields
   - Previously created document-to-be-indexed fields
   - Plain constant string
   - Java expression combining one or more of the above, and resolving to 
 a string
 A regex can also be applied to any of the above, allowing fields to be 
 created based on patterns extracted from the source.
 B.  Installation
 1)  Binaries only:  Copy the 'index-extra' folder within 
 index-extra-v1.0-bin-java1.5.zip to NUTCHDIR/build
 Copy the 'index-extra-conf.xml' file to 
 NUTCHDIR/conf, and configure
 Enable the plugin by updating the nutch-site.xml file
 2)  Source code:Always refer to the Nutch wiki for detailed 
 instructions on building Nutch.  In short:
 Copy the 'index-extra' folder within 
 index-extra-v1.0-source.zip to NUTCHDIR/src/plugin
 Update the build.xml in NUTCHDIR/src/plugin to 
 include plugin
 Update the NUTCHDIR/default.properties file to 
 include plugin
 run ant to build
 Copy the 'index-extra-conf.xml' file to 
 NUTCHDIR/conf, and configure
 Enable the plugin by updating the nutch-site.xml file
 C.  Known Issues
 1)  For this plugin to work correctly on any document field, it is 
 necessary to run the other index filters
 first, so that all basic document fields are generated first.  To do 
 this, configure the indexingfilter.order
 property.  (Please see patch NUTCH-421 to enable indexingfilter.order 
 property. If this patch is not applied,
 the plugin will still work, but will not be able to use document fields 
 created by other index filter plugins.)
 2)  At this stage, field boost can not be used as Nutch scoring overrides 
 the field boost with its own
 document-level boost calculation.  This occurs at the end of 
 org.apache.nutch.indexer.Indexer's reduce method.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


Re: [Nutch-dev] sort result on different set of terms

2007-01-12 Thread Dennis Kubes


DS jha wrote:
 Thanks for your reply. I looked at scoring-opic plugin - but looks like it
 gets called at parsing/index time and not during search time, correct?

that is correct
 
 I am classifying content and assigning a category (or categories) at parse
 time and storing this information along with category score in the index.
 Now, during query time when I display results for a particular category, I
 would like to sort result based on this category score. I cannot say 
 sort on
 category score field, since a document can be classified against multiple
 categories (and so multiple category scores) - 

are you trying to filter out categories or do you actually have a 
different score for each category that content gets indexed under.  You 
can sort a query but only on a single field (I think).

At search time, for each
 document that matches against say, 'category:A' - i will have to get corr
 category score and use that for sorting.  Any thoughts?

You could populate the sort field dynamically but still only a single 
field.  Are you trying to sort on multiple category fields?

Dennis Kubes
 
 Thanks,
 
 
 
 
 
 
 
 On 1/11/07, Dennis Kubes [EMAIL PROTECTED] wrote:

 You can write a scoring filter.  That is much easier than changing
 NutchSimplicity.  Take a look at the scoring-opic plugin under src.
 That will demostrate the default scoring algorithm.

 Dennis Kubes

 DS jha wrote:
  Hello -
 
  I would like to score  summarize results on a different set of
  words/conditions than my original search query criteria - is that
 possible?
 
  I was thinking of extending NutchSimplicity class to modify how
 documents
  are scored (basically change terms on which documents are scored) and
  plugin
  basicsummarizer for generating hit summary - does that sound like
 logical
  extension points?
  .
  Let me know of anyone has solved this type of problem before.
 
  Thanks,
 

 

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] [jira] Resolved: (NUTCH-428) NullPointerException

2007-01-12 Thread Sami Siren (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sami Siren resolved NUTCH-428.
--

   Resolution: Fixed
Fix Version/s: 0.9.0

Most propably you dont have agent name configured in nutch-site.xml. I changed 
this situation to emit RuntimeException in trunk instead so it's easier to 
diagnose.

 NullPointerException
 

 Key: NUTCH-428
 URL: https://issues.apache.org/jira/browse/NUTCH-428
 Project: Nutch
  Issue Type: Bug
  Components: fetcher
Affects Versions: 0.8.1
 Environment: Windows XP
Reporter: Piyush
 Fix For: 0.9.0


 I am using the NUTCH.Bat provided in one one of the thread. (i am not using 
 CYGWIN) Whenever I try to fetch the Item, I am getting fetching failed 
 nullpointerexception 
 I have a URL Directory. which has urls.txt file. there is only one entry in 
 the file which is http://www.winzip.com/land_about.htm. 
 I have updated the crawl-urlfilter.txt with +^http://www.winzip.com/. 
 Is there any other settings I am missing?? Any help is greatly appreciated. 
 The command i used to  start the crawl is 
 nutch  crawl urls -dir crawlResults -depth 1
 Here is my log 
 crawl started in: crawlResult
 rootUrlDir = urls
 threads = 10
 depth = 1
 Injector: starting
 Injector: crawlDb: crawlResult/crawldb
 Injector: urlDir: urls
 Injector: Converting injected urls to crawl db entries.
 Injector: Merging injected urls into crawl db.
 Injector: done
 Generator: starting
 Generator: segment: crawlResult/segments/20070110085314
 Generator: Selecting best-scoring urls due for fetch.
 Generator: Partitioning selected urls by host, for politeness.
 Generator: done.
 Fetcher: starting
 Fetcher: segment: crawlResult/segments/20070110085314
 Fetcher: threads: 10
 fetching http://www.winzip.com/land_about.htm
 fetch of http://www.winzip.com/land_about.htm failed with: 
 java.lang.NullPointerException
 Fetcher: done
 CrawlDb update: starting
 CrawlDb update: db: crawlResult/crawldb
 CrawlDb update: segment: crawlResult/segments/20070110085314
 CrawlDb update: Merging segment data into db.
 CrawlDb update: done
 LinkDb: starting
 LinkDb: linkdb: crawlResult/linkdb
 LinkDb: adding segment: crawlResult/segments/20070110085314
 LinkDb: done
 Indexer: starting
 Indexer: linkdb: crawlResult/linkdb
 Indexer: adding segment: crawlResult/segments/20070110085314
 Optimizing index.
 Indexer: done
 Dedup: starting
 Dedup: adding indexes in: crawlResult/indexes
 Dedup: done
 Adding crawlResult/indexes/part-0
 crawl finished: crawlResult
  

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers


[Nutch-dev] ***性福中国成人用品商城欢迎您有空来坐坐***

2007-01-12 Thread 朋友
   您好!性福中国成人用品商城好消息:即日起,购买本商城任何一款产品,都有惊喜相送!欢迎选购http://www.dzcnx.com 
一次购物免费成为我们的会员,终身享受8折优惠!
   产品有:男女器具,名牌避孕套,护理洗液,催情助兴,壮阳延时,丰胸美乳,高档情趣内衣,进口缩阴器,MAXMAN增大产品(短时间内有效增大6-8CM)...
我们会给您最贴心的价格,最优质的服务,最保密的配送,成就你最贴身的情人!
客服QQ:男530885426   女605081148 手机:13188712690   E-mail:[EMAIL PROTECTED]  
网址:http://www.dzcnx.com
  如有打扰,致以万分的歉意!
  性福中国商城,给男人激情,让女人幸福!  

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers