[Nutch-dev] calendar year

2006-12-15 Thread Tobias Lane
Three weeks ago I released a simple picture of an system almost ready for release that might be of interest and got more hits from all over the world and DOD that was quite interesting. Although some scripting languages are compiled, most languages are interpreted. Another important point to

[Nutch-dev] (no subject)

2006-12-15 Thread jinling1688888
贵公司财务/经理您好: 本公司实力雄厚,与全国各地众多公司有业务往来,可以提供税务 机关代开发票相关信息咨询。咨询范围:商品普通销售发票、广告发票、 电脑版运输发票、其它服务发票、租赁发票、建筑安装发票等。郑重承 诺:所有票据可以上网验证或到税务局验证! 欢迎咨询! 如贵公司有代开发票方面的问题,欢迎来电或留言咨询.我们会在第一 时间回复您! 联系电话13926545703 联 系 人:林先生 Email [EMAIL PROTECTED]

[Nutch-dev] [jira] Created: (NUTCH-415) Generate should mark selected records in crawlDB

2006-12-15 Thread Andrzej Bialecki (JIRA)
Generate should mark selected records in crawlDB Key: NUTCH-415 URL: http://issues.apache.org/jira/browse/NUTCH-415 Project: Nutch Issue Type: Bug Affects Versions: 0.8.1, 0.8, 0.8.2,

[Nutch-dev] [jira] Created: (NUTCH-416) CrawlDatum status and CrawlDbReducer refactoring

2006-12-15 Thread Andrzej Bialecki (JIRA)
CrawlDatum status and CrawlDbReducer refactoring Key: NUTCH-416 URL: http://issues.apache.org/jira/browse/NUTCH-416 Project: Nutch Issue Type: Improvement Affects Versions: 0.9.0

[Nutch-dev] [jira] Created: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.

2006-12-15 Thread JIRA
After upgrade to hadoop-0.9.1, parsing and indexing doesn't work. - Key: NUTCH-417 URL: http://issues.apache.org/jira/browse/NUTCH-417 Project: Nutch Issue Type: Bug

[Nutch-dev] [jira] Commented: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.

2006-12-15 Thread JIRA
[ http://issues.apache.org/jira/browse/NUTCH-417?page=comments#action_12458794 ] Dogacan Güney commented on NUTCH-417: - Patch for indexer. Instead of using the FileSystem coming from getRecordWriter, use FileSystem.get(job) to get the file

[Nutch-dev] [jira] Updated: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.

2006-12-15 Thread JIRA
[ http://issues.apache.org/jira/browse/NUTCH-417?page=all ] Dogacan Güney updated NUTCH-417: Attachment: index.patch After upgrade to hadoop-0.9.1, parsing and indexing doesn't work. -

[Nutch-dev] [jira] Commented: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.

2006-12-15 Thread Andrzej Bialecki (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-417?page=comments#action_12458800 ] Andrzej Bialecki commented on NUTCH-417: - Regarding the issue in Indexer: a word of warning - you are running with mapred speculative execution set to

[Nutch-dev] [jira] Commented: (NUTCH-417) After upgrade to hadoop-0.9.1, parsing and indexing doesn't work.

2006-12-15 Thread JIRA
[ http://issues.apache.org/jira/browse/NUTCH-417?page=comments#action_12458811 ] Dogacan Güney commented on NUTCH-417: - Setting speculative execution to false also fixes my problem with parser. Thank you for the quick answer. I guess you

[Nutch-dev] [jira] Commented: (NUTCH-415) Generate should mark selected records in crawlDB

2006-12-15 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-415?page=comments#action_12458814 ] Sami Siren commented on NUTCH-415: -- Please also consider the performance implications. If this marking will add signifigant performance overhead then it would be

[Nutch-dev] Warning: set speculative execution to false

2006-12-15 Thread Andrzej Bialecki
Hi all, If you run recent trunk/ (after upgrade to Hadoop 0.9.1) please be sure to turn off mapred speculative execution (config property mapred.speculative.execution) - this feature is known to cause problems in this version of Hadoop, and it also causes problems in Nutch. See NUTCH-417 for

[Nutch-dev] [jira] Commented: (NUTCH-415) Generate should mark selected records in crawlDB

2006-12-15 Thread Andrzej Bialecki (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-415?page=comments#action_12458819 ] Andrzej Bialecki commented on NUTCH-415: - There will be some difference in performance, because crawldb needs to be re-written. However, I believe it's

[Nutch-dev] Java developers are finding all sorts of new ways to collaborate and participate across the world, across boundaries of all kinds.

2006-12-15 Thread Abel R. Spencer
Here you enter the Google license key you obtained earlier, along with some search query parameters. JavaServer Faces:Build your own pluggable, Ajax-enabled components. Factory Class: This class is always com. Note that the list displays all defined locale Bundle. The method

[Nutch-dev] 有发票可代开

2006-12-15 Thread [EMAIL PROTECTED]
您好: 我是广州市科飞贸易有限公司,本公司 现有剩余-咨询.运输.服务.商品.建筑广告 等各种普通发票可以代开只收2% 的税点 。 联 人: 林生 电 话: 020-33622299 (可先验票后付款) 手 机: 13826444669 回 邮:[EMAIL PROTECTED] - Take Surveys. Earn Cash. Influence the Future

[Nutch-dev] 2007年免费赠报开始(都市快报、钱江晚报)

2006-12-15 Thread 淳牌水站----品牌连锁服务
淳牌水站品牌连锁服务 2007年免费赠报开始(都市快报、钱江晚报) 赠报周期:2006年11月1日--2007年1月1日 赠送对象:2005、2006长年饮用淳牌桶装水的单位用户,2007年新增加年饮用桶装水在150桶以上的单位用户。 淳牌饮品成立于2001年,是千岛湖十佳水业代表企业,水源取自千岛湖深层水域,全套设备德国引进,年产高品质桶装天然水200万桶,QS质量、卫生达标连续5年评为优良合格,目前淳牌饮品在杭州日销1500桶,单位用户312家,为答谢新老客户,从2006年11月1日到2007年1月1日,淳牌公司免费赠送全年订报卡1500份,欢迎新老客户来电咨询!

[Nutch-dev] 代理公司

2006-12-15 Thread zxcvbnm456a
深圳佳达贸易有限公司 您好! 冒昧打扰,请多包涵! 本公司是一个固定纳税公司,本着互惠互利的合作原则联合多家公司通过税务 合作对外开余额发票。代开范围:(商品销售、运输、广告、建筑安装、租赁 、服务票等等)。本公司发票以低点数向外代开,欢迎客户来电洽谈。 由本公司代开发票可在网上或税务局验证 商祺: 联系人:陈先生 手机:13510702507 [EMAIL PROTECTED]

[Nutch-dev] 代理公司

2006-12-15 Thread zxcvbnm456a
深圳佳达贸易有限公司 您好! 冒昧打扰,请多包涵! 本公司是一个固定纳税公司,本着互惠互利的合作原则联合多家公司通过税务 合作对外开余额发票。代开范围:(商品销售、运输、广告、建筑安装、租赁 、服务票等等)。本公司发票以低点数向外代开,欢迎客户来电洽谈。 由本公司代开发票可在网上或税务局验证 商祺: 联系人:陈先生 手机:13510702507 [EMAIL PROTECTED]

[Nutch-dev] 代理公司

2006-12-15 Thread zxcvbnm456a
深圳佳达贸易有限公司 您好! 冒昧打扰,请多包涵! 本公司是一个固定纳税公司,本着互惠互利的合作原则联合多家公司通过税务 合作对外开余额发票。代开范围:(商品销售、运输、广告、建筑安装、租赁 、服务票等等)。本公司发票以低点数向外代开,欢迎客户来电洽谈。 由本公司代开发票可在网上或税务局验证 商祺: 联系人:陈先生 手机:13510702507 [EMAIL PROTECTED]