nutch-developers
Thread
Date
Earlier messages
Later messages
Messages by Thread
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
Andrzej Bialecki (JIRA)
[Nutch-dev] [jira] Updated: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
Emmanuel Joke (JIRA)
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
[Nutch-dev] [jira] Resolved: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
[Nutch-dev] [jira] Closed: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE
Hudson (JIRA)
[Nutch-dev] [jira] Created: (NUTCH-515) Next fetch time is set incorrectly
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-515) Next fetch time is set incorrectly
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-515) Next fetch time is set incorrectly
Andrzej Bialecki (JIRA)
[Nutch-dev] [jira] Commented: (NUTCH-515) Next fetch time is set incorrectly
JIRA
[Nutch-dev] [jira] Resolved: (NUTCH-515) Next fetch time is set incorrectly
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-515) Next fetch time is set incorrectly
Hudson (JIRA)
[Nutch-dev] OOM error during parsing with nekohtml
Shailendra Mudgal
Re: [Nutch-dev] OOM error during parsing with nekohtml
Tsengtan A Shuy
Re: [Nutch-dev] OOM error during parsing with nekohtml
Kai_testing Middleton
Re: [Nutch-dev] OOM error during parsing with nekohtml
Tsengtan A Shuy
Re: [Nutch-dev] OOM error during parsing with nekohtml
Shailendra Mudgal
Re: [Nutch-dev] OOM error during parsing with nekohtml
Doğacan Güney
Re: [Nutch-dev] OOM error during parsing with nekohtml
Shailendra Mudgal
[Nutch-dev] no nutch script file under bin directory
Tsengtan A Shuy
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #151
hudson
[Nutch-dev] 发票联系13824335239
机会
[Nutch-dev] Nude Britney poster allowed in subway
nude britney
[Nutch-dev] 上海市财税咨询代理
林先生
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #150
hudson
[Nutch-dev] 欢迎收看!`
江生
[Nutch-dev] [jira] Created: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
Andrzej Bialecki (JIRA)
[Nutch-dev] [jira] Closed: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
[Nutch-dev] [jira] Resolved: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS
Hudson (JIRA)
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #149
hudson
[Nutch-dev] 程思柔
程思柔
[Nutch-dev] 程思柔
程思柔
[Nutch-dev] 程思柔
程思柔
[Nutch-dev] running nutch of nfs
prem kumar
[Nutch-dev] NUTCH CONSULTANT NEEDED
Luca Rondanini
[Nutch-dev] 领导查收
刘先生
[Nutch-dev] [jira] Created: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
[Nutch-dev] [jira] Resolved: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
[Nutch-dev] [jira] Closed: (NUTCH-513) suffix-urlfilter.txt does not have a template
JIRA
[Nutch-dev] [jira] Created: (NUTCH-512) Search on date range
anuradha (JIRA)
[Nutch-dev] [jira] Closed: (NUTCH-512) Search on date range
Andrzej Bialecki (JIRA)
[Nutch-dev] [jira] Created: (NUTCH-511) Recrawling
anuradha (JIRA)
[Nutch-dev] [jira] Closed: (NUTCH-511) Recrawling
Andrzej Bialecki (JIRA)
[Nutch-dev] how can i fetch a site manual
Cuongnhc
[Nutch-dev] 商祺
sz001
[Nutch-dev] Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u, \" with {:method=>:get}"
Erik Hatcher
Re: [Nutch-dev] Fwd: [Collex] application#index (ActionController::RoutingError) "no route found to match \"/nines/ escape(document.title) u, \" with {:method=>:get}"
Andrzej Bialecki
[Nutch-dev] Not renewing CrawlDatum on Inject
Robert Young
Re: [Nutch-dev] Not renewing CrawlDatum on Inject
Andrzej Bialecki
Re: [Nutch-dev] Not renewing CrawlDatum on Inject
Robert Young
Re: [Nutch-dev] Not renewing CrawlDatum on Inject
Robert Young
[Nutch-dev] [jira] Created: (NUTCH-510) IndexMerger delete working dir
Enis Soztutar (JIRA)
[Nutch-dev] [jira] Updated: (NUTCH-510) IndexMerger delete working dir
Enis Soztutar (JIRA)
[Nutch-dev] [jira] Issue Comment Edited: (NUTCH-510) IndexMerger delete working dir
Enis Soztutar (JIRA)
[Nutch-dev] [jira] Resolved: (NUTCH-510) IndexMerger delete working dir
JIRA
[Nutch-dev] [jira] Closed: (NUTCH-510) IndexMerger delete working dir
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-510) IndexMerger delete working dir
Hudson (JIRA)
[Nutch-dev] 陈先生
li
[Nutch-dev] 陈先生
li
[Nutch-dev] OPIC scoring differences
Carl Cerecke
Re: [Nutch-dev] OPIC scoring differences
Doğacan Güney
Re: [Nutch-dev] OPIC scoring differences
Andrzej Bialecki
Re: [Nutch-dev] OPIC scoring differences
Doğacan Güney
Re: [Nutch-dev] OPIC scoring differences
Andrzej Bialecki
[Nutch-dev] [jira] Created: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
[Nutch-dev] [jira] Updated: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
[Nutch-dev] [jira] Commented: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
[Nutch-dev] [jira] Closed: (NUTCH-509) Update Crawldb: avoid to start a job if there is no valid segment
Emmanuel Joke (JIRA)
[Nutch-dev] [jira] Created: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker
Emmanuel Joke (JIRA)
[Nutch-dev] mozdex as a backend search engine.
Tsengtan A Shuy
[Nutch-dev] [jira] Commented: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker
Enis Soztutar (JIRA)
[Nutch-dev] [jira] Created: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
Emmanuel Joke (JIRA)
[Nutch-dev] [jira] Resolved: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
JIRA
[Nutch-dev] [jira] Closed: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-507) lib-lucene-analyzers jar defintion is wrong in plugin.xml
Hudson (JIRA)
[Nutch-dev] 您好!合作共赢!
ysl
[Nutch-dev] 网络赚钱新创意!
tony
[Nutch-dev] 特别推介!:提供国际最流行的虚拟办公模式:月支出100元起,你将拥有设在广州的公司、分支机构、办公室、联络处。以最小的支出而达到你
纳 福
[Nutch-dev] Faustino, Our product updates a body and a soul.
Faustino Price222189
[Nutch-dev] URL Injection with another source than text files
Epo Jemba
Re: [Nutch-dev] URL Injection with another source than text files
Epo Jemba
[Nutch-dev] 你有事吗,
代办税票
[Nutch-dev] 你有事吗,
代办税票
[Nutch-dev] 你有事吗,
代办税票
[Nutch-dev] 你有事吗,
代办税票
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #140
hudson
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #139
hudson
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #138
hudson
[Nutch-dev] Patch to skip hidden plugin directories
David Fuhry
[Nutch-dev] 中国692位顶尖训练师/顾问详细联系方式
asos
[Nutch-dev] 中国692位顶尖训练师/顾问详细联系方式
asos
[Nutch-dev] 经营业务。
hu
[Nutch-dev] Plans on releasing another bug fix release?
Briggs
Re: [Nutch-dev] Plans on releasing another bug fix release?
Andrzej Bialecki
Re: [Nutch-dev] Plans on releasing another bug fix release?
Doug Cutting
Re: [Nutch-dev] Plans on releasing another bug fix release?
Andrzej Bialecki
Re: [Nutch-dev] Plans on releasing another bug fix release?
Nuther
Re: [Nutch-dev] Plans on releasing another bug fix release?
Andrzej Bialecki
Re: [Nutch-dev] Plans on releasing another bug fix release?
Briggs
Re: [Nutch-dev] Plans on releasing another bug fix release?
rubdabadub
Re: [Nutch-dev] Plans on releasing another bug fix release?
Ian Holsman
Re: [Nutch-dev] Plans on releasing another bug fix release?
Briggs
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #137
hudson
[Nutch-dev] 你有事找我吗?
代办税票
[Nutch-dev] 你有事找我吗?
代办税票
[Nutch-dev] 你有事找我吗?
代办税票
[Nutch-dev] Nutch nightly build and NUTCH-505 draft patch
Kai_testing Middleton
Re: [Nutch-dev] Nutch nightly build and NUTCH-505 draft patch
Doğacan Güney
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #136
hudson
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #135
hudson
[Nutch-dev] in top-secret lingerie film
Kate Moss stars in top-secret lingerie film
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #134
hudson
[Nutch-dev] 通知
home66
[Nutch-dev] Fwd: failed to subscribe 'nutch-user' maillist
Oscar
Re: [Nutch-dev] failed to subscribe 'nutch-user' maillist
Susam Pal
[Nutch-dev] 请 回 电!
胡先生
[Nutch-dev] problem running "bin/nutch crawl urls -dir crawl -depth 3 -topN 50" command
Tsengtan A Shuy
Re: [Nutch-dev] problem running "bin/nutch crawl urls -dir crawl -depth 3 -topN 50" command
Tsengtan A Shuy
[Nutch-dev] [jira] Created: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Issue Comment Edited: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Resolved: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Closed: (NUTCH-506) Nutch should delegate compression to Hadoop
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-506) Nutch should delegate compression to Hadoop
Hudson (JIRA)
[Nutch-dev] 贵公司负责人收:
李先生
[Nutch-dev] 贵公司负责人收:
李先生
[Nutch-dev] new lists
Dickson R Karen
[Nutch-dev] problem with nutch 0.8.1 compile
Tsengtan A Shuy
Re: [Nutch-dev] problem with nutch 0.8.1 compile
Tsengtan A Shuy
Re: [Nutch-dev] problem with nutch 0.8.1 compile
Tsengtan A Shuy
[Nutch-dev] [jira] Updated: (NUTCH-392) OutputFormat implementations should pass on Progressable
JIRA
[Nutch-dev] AD:俊衡贸易
陈先生
[Nutch-dev] 《深圳恒天贸易公司》
吕伟哥
[Nutch-dev] 《深圳恒天贸易公司》
吕伟哥
[Nutch-dev] 优惠代开发~~票!
凯达实业有限公司
[Nutch-dev] 您好,您找我有事吗?
代办税票
[Nutch-dev] 您好,您找我有事吗?
代办税票
[Nutch-dev] 您好,您找我有事吗?
代办税票
[Nutch-dev] 您好,您找我有事吗?
代办税票
[Nutch-dev] 看点精彩的
爱人
[Nutch-dev] 信用卡空卡贷款
paul
[Nutch-dev] 信用卡空卡贷款
paul
[Nutch-dev] JIRA email question
Doğacan Güney
Re: [Nutch-dev] JIRA email question
Doug Cutting
[Nutch-dev] [jira] Commented: (NUTCH-289) CrawlDatum should store IP address
JIRA
[Nutch-dev] NUTCH-119 :: how hard to fix
Kai_testing Middleton
Re: [Nutch-dev] NUTCH-119 :: how hard to fix
Doğacan Güney
Re: [Nutch-dev] NUTCH-119 :: how hard to fix
Kai_testing Middleton
Re: [Nutch-dev] NUTCH-119 :: how hard to fix
Doğacan Güney
Re: [Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Kai_testing Middleton
[Nutch-dev] You've received a postcard from a family member!
notme.hk
[Nutch-dev] Re-crawling Problem
Luca Rondanini
[Nutch-dev] 想看就看
爱人
Re: [Nutch-dev] svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-...
Chris Mattmann
Re: [Nutch-dev] svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-...
Dennis Kubes
Re: [Nutch-dev] svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-...
Chris Mattmann
[Nutch-dev] 艾尔产品设计公司资料(公司负责人收)
艾尔产品设计
[Nutch-dev] [jira] Updated: (NUTCH-356) Plugin repository cache can lead to memory leak
JIRA
[Nutch-dev] 信息咨询!
liude
[Nutch-dev] Hudson build is back to normal: Nutch-Nightly #127
hudson
[Nutch-dev] 公司
fan9415
[Nutch-dev] [jira] Created: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Andrzej Bialecki (JIRA)
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Issue Comment Edited: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Resolved: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Hudson (JIRA)
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Espen Amble Kolstad (JIRA)
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Updated: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
Andrzej Bialecki (JIRA)
[Nutch-dev] [jira] Commented: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] [jira] Closed: (NUTCH-505) Outlink urls should be validated
JIRA
[Nutch-dev] 你好.有事找我吗?
代办税票
[Nutch-dev] 你好.有事找我吗?
代办税票
[Nutch-dev] 你好.有事找我吗?
代办税票
[Nutch-dev] Build failed in Hudson: Nutch-Nightly #126
hudson
[Nutch-dev] Homeland and International Security
Nadia Duffy
[Nutch-dev] 叶伟,向你咨询
szsjfsygs
[Nutch-dev] 叶伟,向你咨询
szsjfsygs
[Nutch-dev] 叶伟,向你咨询
szsjfsygs
[Nutch-dev] 叶伟,向你咨询
szsjfsygs
[Nutch-dev] 叶伟,向你咨询
szsjfsygs
Earlier messages
Later messages