nutch-developers
Thread
Date
Earlier messages
Later messages
Messages by Date
2007/07/20
[Nutch-dev] [jira] Commented: (NUTCH-522) Use URLValidator in the Injector
JIRA
2007/07/19
[Nutch-dev] 您们好
gfegtuytujtedsarfg
2007/07/19
[Nutch-dev] [jira] Updated: (NUTCH-522) Use URLValidator in the Injector
Emmanuel Joke (JIRA)
2007/07/19
Re: [Nutch-dev] Looking to fix relative path issue in linkdb
Briggs
2007/07/19
Re: [Nutch-dev] Looking to fix relative path issue in linkdb
Robert Young
2007/07/19
[Nutch-dev] 德国少女情欲水★美臀夹阴2代
爱欲高潮
2007/07/19
[Nutch-dev] [jira] Commented: (NUTCH-522) Use URLValidator in the Injector
JIRA
2007/07/19
Re: [Nutch-dev] OOM error during parsing with nekohtml
Shailendra Mudgal
2007/07/19
Re: [Nutch-dev] Looking to fix relative path issue in linkdb
Briggs
2007/07/19
Re: [Nutch-dev] Looking to fix relative path issue in linkdb
Robert Young
2007/07/19
[Nutch-dev] [jira] Updated: (NUTCH-522) Use URLValidator in the Injector
Emmanuel Joke (JIRA)
2007/07/19
[Nutch-dev] [jira] Created: (NUTCH-522) Use URLValidator in the Injector
Emmanuel Joke (JIRA)
2007/07/19
[Nutch-dev] [jira] Commented: (NUTCH-521) Modified injector to allow newly injected CrawlDatum to overwrite original
JIRA
2007/07/19
Re: [Nutch-dev] Looking to fix relative path issue in linkdb
Andrzej Bialecki
2007/07/19
[Nutch-dev] Looking to fix relative path issue in linkdb
Robert Young
2007/07/19
[Nutch-dev] [jira] Updated: (NUTCH-521) Modified injector to allow newly injected CrawlDatum to overwrite original
Rob Young (JIRA)
2007/07/19
[Nutch-dev] [jira] Created: (NUTCH-521) Modified injector to allow newly injected CrawlDatum to overwrite original
Rob Young (JIRA)
2007/07/19
[Nutch-dev] [jira] Updated: (NUTCH-520) A common infrastructure for different index backends
JIRA
2007/07/19
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Andrzej Bialecki (JIRA)
2007/07/19
[Nutch-dev] [jira] Created: (NUTCH-520) A common infrastructure for different index backends
JIRA
2007/07/19
[Nutch-dev] resending this query on running nutch on nfs
prem kumar
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
JIRA
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
JIRA
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-517) build encoding should be UTF-8
Hudson (JIRA)
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Hudson (JIRA)
2007/07/18
[Nutch-dev] ready for the first assignment
Tsengtan A Shuy
2007/07/18
[Nutch-dev] [jira] Created: (NUTCH-519) prased incorrectly
Chris Hane (JIRA)
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Andrzej Bialecki (JIRA)
2007/07/18
[Nutch-dev] 送票上门!
sp
2007/07/18
[Nutch-dev] salty tripod
a.wurm
2007/07/18
Re: [Nutch-dev] no nutch script file under bin directory
Kai_testing Middleton
2007/07/18
Re: [Nutch-dev] no nutch script file under bin directory
Tsengtan A Shuy
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
JIRA
2007/07/18
Re: [Nutch-dev] no nutch script file under bin directory
Kai_testing Middleton
2007/07/18
[Nutch-dev] [jira] Reopened: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Andrzej Bialecki (JIRA)
2007/07/18
[Nutch-dev] [jira] Closed: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
JIRA
2007/07/18
[Nutch-dev] [jira] Resolved: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
JIRA
2007/07/18
[Nutch-dev] [jira] Closed: (NUTCH-517) build encoding should be UTF-8
JIRA
2007/07/18
[Nutch-dev] [jira] Resolved: (NUTCH-517) build encoding should be UTF-8
JIRA
2007/07/18
Re: [Nutch-dev] no nutch script file under bin directory
Tsengtan A Shuy
2007/07/18
Re: [Nutch-dev] no nutch script file under bin directory
Kai_testing Middleton
2007/07/18
[Nutch-dev] 企业行政文秘职业培训
126培训网
2007/07/18
[Nutch-dev] [jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Updated: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
2007/07/18
[Nutch-dev] [jira] Created: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Updated: (NUTCH-517) build encoding should be UTF-8
Enis Soztutar (JIRA)
2007/07/18
[Nutch-dev] [jira] Created: (NUTCH-517) build encoding should be UTF-8
Enis Soztutar (JIRA)
2007/07/17
[Nutch-dev] [jira] Updated: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
Emmanuel Joke (JIRA)
2007/07/17
[Nutch-dev] [jira] Commented: (NUTCH-515) Next fetch time is set incorrectly
Hudson (JIRA)
2007/07/17
[Nutch-dev] [jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop
Hudson (JIRA)
2007/07/17
Re: [Nutch-dev] no nutch script file under bin directory
Tsengtan A Shuy
2007/07/17
[Nutch-dev] 德国少女情欲水★美臀夹阴2代
性爱专家
2007/07/17
Re: [Nutch-dev] no nutch script file under bin directory
Tsengtan A Shuy
2007/07/17
[Nutch-dev] no nutch script file under bin directory
Tsengtan A Shuy
2007/07/17
[Nutch-dev] [jira] Closed: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
2007/07/17
[Nutch-dev] [jira] Resolved: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
2007/07/17
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
Andrzej Bialecki (JIRA)
2007/07/17
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
2007/07/17
[Nutch-dev] [jira] Updated: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
2007/07/17
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
2007/07/17
[Nutch-dev] [jira] Created: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
Emmanuel Joke (JIRA)
2007/07/16
Re: [Nutch-dev] OOM error during parsing with nekohtml
Doğacan Güney
2007/07/16
[Nutch-dev] [jira] Resolved: (NUTCH-515) Next fetch time is set incorrectly
JIRA
2007/07/16
[Nutch-dev] (no subject)
张世新
2007/07/16
Re: [Nutch-dev] OOM error during parsing with nekohtml
Shailendra Mudgal
2007/07/16
[Nutch-dev] [jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
2007/07/16
[Nutch-dev] [jira] Commented: (NUTCH-515) Next fetch time is set incorrectly
JIRA
2007/07/16
[Nutch-dev] [jira] Commented: (NUTCH-515) Next fetch time is set incorrectly
Andrzej Bialecki (JIRA)
2007/07/16
Re: [Nutch-dev] OOM error during parsing with nekohtml
Tsengtan A Shuy
2007/07/16
Re: [Nutch-dev] OOM error during parsing with nekohtml
Kai_testing Middleton
2007/07/16
[Nutch-dev] [jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring
JIRA
2007/07/16
[Nutch-dev] [jira] Updated: (NUTCH-515) Next fetch time is set incorrectly
JIRA
2007/07/16
[Nutch-dev] [jira] Created: (NUTCH-515) Next fetch time is set incorrectly
JIRA
2007/07/16
Re: [Nutch-dev] OOM error during parsing with nekohtml
Tsengtan A Shuy
2007/07/16
[Nutch-dev] OOM error during parsing with nekohtml
Shailendra Mudgal
2007/07/15
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #151
hudson
2007/07/15
[Nutch-dev] 程思柔
程思柔
2007/07/15
[Nutch-dev] 发票联系13824335239
机会
2007/07/15
[Nutch-dev] Nude Britney poster allowed in subway
nude britney
2007/07/15
[Nutch-dev] 送票上门!
sp
2007/07/15
[Nutch-dev] 上海市财税咨询代理
林先生
2007/07/14
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #150
hudson
2007/07/14
Re: [Nutch-dev] inject command fail on whole-web run
Tsengtan A Shuy
2007/07/14
[Nutch-dev] inject command fail on whole-web run
Tsengtan A Shuy
2007/07/14
[Nutch-dev] 欢迎收看!`
江生
2007/07/14
[Nutch-dev] [jira] Closed: (NUTCH-471) Fix synchronization in NutchBean creation
Dennis Kubes (JIRA)
2007/07/14
[Nutch-dev] [jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation
Dennis Kubes (JIRA)
2007/07/14
[Nutch-dev] [jira] Updated: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
2007/07/14
[Nutch-dev] [jira] Created: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
2007/07/14
[Nutch-dev] [jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation
JIRA
2007/07/13
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #149
hudson
2007/07/13
[Nutch-dev] [jira] Reopened: (NUTCH-471) Fix synchronization in NutchBean creation
Dennis Kubes (JIRA)
2007/07/13
[Nutch-dev] [jira] Closed: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
2007/07/13
[Nutch-dev] [jira] Resolved: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
2007/07/13
[Nutch-dev] 程思柔
程思柔
2007/07/13
[Nutch-dev] running nutch of nfs
prem kumar
2007/07/13
[Nutch-dev] NUTCH CONSULTANT NEEDED
Luca Rondanini
2007/07/13
[Nutch-dev] [jira] Commented: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
2007/07/13
[Nutch-dev] [jira] Closed: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/13
[Nutch-dev] 领导查收
刘先生
2007/07/12
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/12
[Nutch-dev] [jira] Created: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
2007/07/12
[Nutch-dev] [jira] Closed: (NUTCH-512) Search on date range
Andrzej Bialecki (JIRA)
2007/07/12
[Nutch-dev] [jira] Closed: (NUTCH-511) Recrawling
Andrzej Bialecki (JIRA)
2007/07/12
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Andrzej Bialecki (JIRA)
2007/07/12
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/12
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/12
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Espen Amble Kolstad (JIRA)
2007/07/12
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/12
[Nutch-dev] [jira] Created: (NUTCH-512) Search on date range
anuradha (JIRA)
2007/07/12
[Nutch-dev] [jira] Created: (NUTCH-511) Recrawling
anuradha (JIRA)
2007/07/12
[Nutch-dev] [jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
2007/07/11
[Nutch-dev] how can i fetch a site manual
Cuongnhc
2007/07/11
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Hudson (JIRA)
2007/07/11
[Nutch-dev] [jira] Commented: (NUTCH-510) IndexMerger delete working dir
Hudson (JIRA)
2007/07/11
[Nutch-dev] 商祺
sz001
2007/07/11
Re: [Nutch-dev] OPIC scoring differences
Andrzej Bialecki
2007/07/11
[Nutch-dev] 送票上门!
sp
2007/07/11
[Nutch-dev] [jira] Closed: (NUTCH-510) IndexMerger delete working dir
JIRA
2007/07/11
[Nutch-dev] [jira] Resolved: (NUTCH-510) IndexMerger delete working dir
JIRA
2007/07/11
Re: [Nutch-dev] OPIC scoring differences
Doğacan Güney
2007/07/11
[Nutch-dev] [jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
2007/07/11
[Nutch-dev] [jira] Resolved: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/10
Re: [Nutch-dev] Nutch nightly build and NUTCH-505 draft patch
Doğacan Güney
2007/07/10
[Nutch-dev] [jira] Issue Comment Edited: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/10
[Nutch-dev] [jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring
Enis Soztutar (JIRA)
2007/07/10
[Nutch-dev] [jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring
Enis Soztutar (JIRA)
2007/07/10
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/10
[Nutch-dev] [jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring
Enis Soztutar (JIRA)
2007/07/10
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Andrzej Bialecki (JIRA)
2007/07/10
Re: [Nutch-dev] Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u, \" with {:method=>:get}"
Andrzej Bialecki
2007/07/10
[Nutch-dev] Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u, \" with {:method=>:get}"
Erik Hatcher
2007/07/10
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
2007/07/10
Re: [Nutch-dev] Not renewing CrawlDatum on Inject
Robert Young
2007/07/10
[Nutch-dev] [jira] Commented: (NUTCH-439) Top Level Domains Indexing / Scoring
Andrzej Bialecki (JIRA)
2007/07/10
Re: [Nutch-dev] Not renewing CrawlDatum on Inject
Robert Young
2007/07/10
[Nutch-dev] [jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring
Enis Soztutar (JIRA)
2007/07/09
[Nutch-dev] 业务代理
合作事宜
2007/07/09
[Nutch-dev] [jira] Commented: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
Hudson (JIRA)
2007/07/09
[Nutch-dev] [jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists
Hudson (JIRA)
2007/07/09
Re: [Nutch-dev] Not renewing CrawlDatum on Inject
Andrzej Bialecki
2007/07/09
[Nutch-dev] Not renewing CrawlDatum on Inject
Robert Young
2007/07/09
Re: [Nutch-dev] URL Injection with another source than text files
Epo Jemba
2007/07/09
[Nutch-dev] [jira] Commented: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker
Enis Soztutar (JIRA)
2007/07/09
[Nutch-dev] [jira] Issue Comment Edited: (NUTCH-510) IndexMerger delete working dir
Enis Soztutar (JIRA)
2007/07/09
Re: [Nutch-dev] OPIC scoring differences
Andrzej Bialecki
2007/07/09
[Nutch-dev] spam detect
anton
2007/07/08
[Nutch-dev] [jira] Updated: (NUTCH-510) IndexMerger delete working dir
Enis Soztutar (JIRA)
2007/07/08
[Nutch-dev] [jira] Resolved: (NUTCH-503) Generator exits incorrectly for small fetchlists
JIRA
2007/07/08
[Nutch-dev] [jira] Created: (NUTCH-510) IndexMerger delete working dir
Enis Soztutar (JIRA)
2007/07/08
[Nutch-dev] [jira] Closed: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
JIRA
2007/07/08
[Nutch-dev] [jira] Resolved: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
JIRA
2007/07/08
[Nutch-dev] [jira] Closed: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
2007/07/08
[Nutch-dev] [jira] Commented: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
2007/07/08
[Nutch-dev] [jira] Commented: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
JIRA
2007/07/08
Re: [Nutch-dev] OPIC scoring differences
Doğacan Güney
2007/07/08
[Nutch-dev] 陈先生
li
2007/07/08
[Nutch-dev] OPIC scoring differences
Carl Cerecke
2007/07/08
[Nutch-dev] [jira] Updated: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
2007/07/08
[Nutch-dev] [jira] Created: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
2007/07/07
[Nutch-dev] mozdex as a backend search engine.
Tsengtan A Shuy
2007/07/07
[Nutch-dev] [jira] Created: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker
Emmanuel Joke (JIRA)
2007/07/07
[Nutch-dev] [jira] Created: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
Emmanuel Joke (JIRA)
2007/07/06
[Nutch-dev] 中国692位顶尖训练师/顾问详细联系方式
asos
2007/07/06
Re: [Nutch-dev] Plans on releasing another bug fix release?
Briggs
2007/07/05
Re: [Nutch-dev] Plans on releasing another bug fix release?
Ian Holsman
2007/07/05
[Nutch-dev] 送票上门!
sp
2007/07/05
[Nutch-dev] 您好!合作共赢!
ysl
2007/07/05
Re: [Nutch-dev] Plans on releasing another bug fix release?
rubdabadub
2007/07/04
[Nutch-dev] 网络赚钱新创意!
tony
2007/07/04
[Nutch-dev] 特别推介!:提供国际最流行的虚拟办公模式:月支出100元起,你将拥有设在广州的公司、分支机构、办公室、联络处。以最小的支出而达到你
纳 福
2007/07/04
[Nutch-dev] Faustino, Our product updates a body and a soul.
Faustino Price222189
2007/07/04
Re: [Nutch-dev] Plans on releasing another bug fix release?
Briggs
2007/07/04
[Nutch-dev] URL Injection with another source than text files
Epo Jemba
2007/07/04
[Nutch-dev] 你有事吗,
代办税票
2007/07/04
[Nutch-dev] 你有事吗,
代办税票
2007/07/04
[Nutch-dev] 你有事吗,
代办税票
2007/07/04
Re: [Nutch-dev] Plans on releasing another bug fix release?
Andrzej Bialecki
2007/07/04
[Nutch-dev] 你有事吗,
代办税票
2007/07/04
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #140
hudson
2007/07/04
Re: [Nutch-dev] Plans on releasing another bug fix release?
Nuther
2007/07/04
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #139
hudson
2007/07/03
Re: [Nutch-dev] Plans on releasing another bug fix release?
Andrzej Bialecki
2007/07/03
Re: [Nutch-dev] Plans on releasing another bug fix release?
Doug Cutting
2007/07/03
Re: [Nutch-dev] Plans on releasing another bug fix release?
Andrzej Bialecki
2007/07/03
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #138
hudson
2007/07/03
[Nutch-dev] Patch to skip hidden plugin directories
David Fuhry
2007/07/03
[Nutch-dev] 中国692位顶尖训练师/顾问详细联系方式
asos
2007/07/03
[Nutch-dev] 经营业务。
hu
2007/07/03
[Nutch-dev] 你有事找我吗?
代办税票
2007/07/03
[Nutch-dev] Plans on releasing another bug fix release?
Briggs
2007/07/03
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #137
hudson
2007/07/02
[Nutch-dev] (no subject)
詹小姐
2007/07/02
[Nutch-dev] 你有事找我吗?
代办税票
2007/07/02
[Nutch-dev] 你有事找我吗?
代办税票
2007/07/02
[Nutch-dev] Nutch nightly build and NUTCH-505 draft patch
Kai_testing Middleton
Earlier messages
Later messages