nutch-dev
Thread
Date
Earlier messages
Later messages
Messages by Date
2010/02/08
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0.
Dawid Weiss (JIRA)
2010/02/07
Hudson build is back to normal : Nutch-trunk #1062
Apache Hudson Server
2010/02/07
plugin dev trouble
Sahil Shah
2010/02/06
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0.
Dawid Weiss (JIRA)
2010/02/06
[jira] Updated: (NUTCH-787) Upgrade Lucene to 3.0.0.
Dawid Weiss (JIRA)
2010/02/05
[jira] Commented: (NUTCH-786) Better list of suffix domains
Ken Krugler (JIRA)
2010/02/05
[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0.
Dawid Weiss (JIRA)
2010/02/05
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0
Dawid Weiss (JIRA)
2010/02/05
[jira] Created: (NUTCH-787) Upgrade Lucene to 3.0.0.
Dawid Weiss (JIRA)
2010/02/05
[jira] Closed: (NUTCH-786) Better list of suffix domains
Julien Nioche (JIRA)
2010/02/05
[jira] Updated: (NUTCH-786) Better list of suffix domains
Julien Nioche (JIRA)
2010/02/05
[jira] Created: (NUTCH-786) Better list of suffix domains
Julien Nioche (JIRA)
2010/02/05
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0
Andrzej Bialecki (JIRA)
2010/02/05
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0
Sami Siren (JIRA)
2010/02/05
[jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0
Dawid Weiss (JIRA)
2010/02/03
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again
Serykh Evgeniy (JIRA)
2010/02/03
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again
Serykh Evgeniy (JIRA)
2010/02/03
[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again
Serykh Evgeniy (JIRA)
2010/02/02
[jira] Commented: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Hudson (JIRA)
2010/02/02
Logging to the terminal
Santiago Pérez
2010/02/02
[jira] Commented: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Sami Siren (JIRA)
2010/02/02
[jira] Commented: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Julien Nioche (JIRA)
2010/02/01
[jira] Commented: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Hudson (JIRA)
2010/02/01
[jira] Commented: (NUTCH-775) Enhance Searcher interface
Hudson (JIRA)
2010/02/01
[jira] Commented: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Sami Siren (JIRA)
2010/02/01
[jira] Resolved: (NUTCH-775) Enhance Searcher interface
Sami Siren (JIRA)
2010/02/01
[jira] Updated: (NUTCH-785) Fetcher : copy metadata from origin URL when redirecting + call scfilters.initialScore on newly created URL
Julien Nioche (JIRA)
2010/02/01
[jira] Created: (NUTCH-785) Fetcher : copy metadata from origin URL when redirecting + call scfilters.initialScore on newly created URL
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-784) CrawlDBScanner
Julien Nioche (JIRA)
2010/02/01
[jira] Created: (NUTCH-784) CrawlDBScanner
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Julien Nioche (JIRA)
2010/02/01
[jira] Assigned: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-783) IndexerChecker Utilty
Julien Nioche (JIRA)
2010/02/01
[jira] Assigned: (NUTCH-783) IndexerChecker Utilty
Julien Nioche (JIRA)
2010/02/01
[jira] Created: (NUTCH-783) IndexerChecker Utilty
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-782) Ability to order htmlparsefilters
Julien Nioche (JIRA)
2010/02/01
[jira] Created: (NUTCH-782) Ability to order htmlparsefilters
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/02/01
[jira] Updated: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/02/01
[jira] Closed: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Julien Nioche (JIRA)
2010/02/01
[jira] Resolved: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Julien Nioche (JIRA)
2010/02/01
[jira] Created: (NUTCH-781) Update Tika to v0.6 for the MimeType detection
Julien Nioche (JIRA)
2010/01/31
NativeCodeLoader - unable to load native-hadoop library for your platform
kraman
2010/01/28
Configuration - bad conf file - element not property
kraman
2010/01/28
[jira] Commented: (NUTCH-775) Enhance Searcher interface
Sami Siren (JIRA)
2010/01/28
[jira] Commented: (NUTCH-775) Enhance Searcher interface
Andrzej Bialecki (JIRA)
2010/01/28
[jira] Commented: (NUTCH-775) Enhance Searcher interface
Sami Siren (JIRA)
2010/01/28
[Nutch Wiki] Update of "Support" by OtisGospodnetic
Apache Wiki
2010/01/28
[jira] Updated: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/01/28
[jira] Commented: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/01/27
[jira] Commented: (NUTCH-766) Tika parser
Sami Siren (JIRA)
2010/01/26
[jira] Updated: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/26
[jira] Updated: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/26
[jira] Issue Comment Edited: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/26
[jira] Issue Comment Edited: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/26
Page search2.net deleted from Nutch Wiki
Apache Wiki
2010/01/25
[jira] Updated: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/25
[Nutch Wiki] Update of "FrontPage" by JohnWhelan
Apache Wiki
2010/01/25
[jira] Commented: (NUTCH-766) Tika parser
Andrzej Bialecki (JIRA)
2010/01/25
[jira] Commented: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2010/01/25
[jira] Commented: (NUTCH-766) Tika parser
Sami Siren (JIRA)
2010/01/25
Java Heap Limit Exceeded
Withanage, Dulip
2010/01/23
Re: State of nutchbase
xiao yang
2010/01/22
[jira] Issue Comment Edited: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2010/01/22
[jira] Commented: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2010/01/22
[jira] Commented: (NUTCH-766) Tika parser
Sami Siren (JIRA)
2010/01/22
[jira] Commented: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/01/22
[jira] Commented: (NUTCH-766) Tika parser
Sami Siren (JIRA)
2010/01/22
[jira] Resolved: (NUTCH-778) Running Nutch On linux having whoami exception?
Julien Nioche (JIRA)
2010/01/22
[jira] Issue Comment Edited: (NUTCH-650) Hbase Integration
Xiao Yang (JIRA)
2010/01/21
[jira] Updated: (NUTCH-650) Hbase Integration
Xiao Yang (JIRA)
2010/01/21
Re: Tried to run Crawl with depth of only 2 and getting IOException
kraman
2010/01/20
[jira] Issue Comment Edited: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/20
[jira] Updated: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/20
[jira] Commented: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/20
[jira] Commented: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/20
[jira] Commented: (NUTCH-650) Hbase Integration
Xiao Yang (JIRA)
2010/01/20
Re: [jira] Commented: (NUTCH-650) Hbase Integration
xiao yang
2010/01/20
[jira] Updated: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/20
[jira] Created: (NUTCH-780) Nutch crawler did not read configuration files
Vu Hoang (JIRA)
2010/01/20
Re: [jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
MilleBii
2010/01/20
Re: Alt text of images as anchor text
axi
2010/01/20
Re: Alt text of images as anchor text
Nutch Newbie
2010/01/20
Re: Alt text of images as anchor text
axi
2010/01/20
Re: Alt text of images as anchor text
Nutch Newbie
2010/01/20
Re: Injecting urls and define Inlink
Nutch Newbie
2010/01/20
Re: Tried to run Crawl with depth of only 2 and getting IOException
Nutch Newbie
2010/01/20
Alt text of images as anchor text
axi
2010/01/20
Nofollow links on nutch
axi
2010/01/20
Re: Injecting urls and define Inlink
MyD
2010/01/19
Re: Injecting urls and define Inlink
MilleBii
2010/01/19
Injecting urls and define Inlink
MyD
2010/01/19
[jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Andrzej Bialecki (JIRA)
2010/01/19
[jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Julien Nioche (JIRA)
2010/01/18
[jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Andrzej Bialecki (JIRA)
2010/01/18
[jira] Updated: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Julien Nioche (JIRA)
2010/01/18
[jira] Created: (NUTCH-779) Mechanism for passing metadata from parse to crawldb
Julien Nioche (JIRA)
2010/01/17
[Nutch Wiki] Update of "RunningNutchAndSolr" by GeoffBe ntley
Apache Wiki
2010/01/13
Re: Injecting URLs and define Inlink?
MyD
2010/01/11
Re: [jira] Commented: (NUTCH-650) Hbase Integration
xiao yang
2010/01/11
[jira] Commented: (NUTCH-767) Update Tika to v0.5 for the MimeType detection
Hudson (JIRA)
2010/01/11
unsubscribe
Ahmad Dahlan
2010/01/11
[jira] Commented: (NUTCH-751) Upgrade version of HttpClient
Ken Krugler (JIRA)
2010/01/11
[jira] Commented: (NUTCH-766) Tika parser
Julien Nioche (JIRA)
2010/01/11
[jira] Commented: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2010/01/11
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche
Apache Wiki
2010/01/11
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche
Apache Wiki
2010/01/11
[jira] Resolved: (NUTCH-751) Upgrade version of HttpClient
Julien Nioche (JIRA)
2010/01/11
Nutch on eclipse ant
dhamu
2010/01/11
[jira] Closed: (NUTCH-767) Update Tika to v0.5 for the MimeType detection
Julien Nioche (JIRA)
2010/01/09
[jira] Created: (NUTCH-778) Running Nutch On linux having whoami exception?
Prakash Panjwani (JIRA)
2010/01/08
[jira] Commented: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count
Hudson (JIRA)
2010/01/08
[jira] Resolved: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count
Julien Nioche (JIRA)
2010/01/08
[jira] Commented: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count
Julien Nioche (JIRA)
2010/01/08
[jira] Assigned: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count
Julien Nioche (JIRA)
2010/01/08
Hudson build is back to normal: Nutch-trunk #1033
Apache Hudson Server
2010/01/08
[jira] Updated: (NUTCH-774) Retry interval in crawl date is set to 0
Reinhard Schwab (JIRA)
2010/01/08
Why rebuild the index for each crawl?
xiao yang
2010/01/07
Re: help for hadoop and hbase
xiao yang
2010/01/07
Re: Injecting URLs and define Inlink?
xiao yang
2010/01/07
Build failed in Hudson: Nutch-trunk #1032
Apache Hudson Server
2010/01/07
Injecting URLs and define Inlink?
MyD
2010/01/07
Potential Bug: Index documents with incorrect segment numbers
igor.k
2010/01/07
[Nutch Wiki] Trivial Update of "PublicServers" by Geoff reyMcCaleb
Apache Wiki
2010/01/07
Re: [jira] Commented: (NUTCH-776) Configurable queue depth
MilleBii
2010/01/07
help for hadoop and hbase
wnkdu
2010/01/07
[jira] Commented: (NUTCH-776) Configurable queue depth
Julien Nioche (JIRA)
2010/01/06
[Nutch Wiki] Update of "FAQ" by GodmarBack
Apache Wiki
2010/01/06
[Nutch Wiki] Update of "FAQ" by GodmarBack
Apache Wiki
2010/01/06
[Nutch Wiki] Update of "FAQ" by GodmarBack
Apache Wiki
2010/01/06
[jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable
Godmar Back (JIRA)
2010/01/06
[Nutch Wiki] Update of "FAQ" by GodmarBack
Apache Wiki
2010/01/06
[jira] Resolved: (NUTCH-655) Injecting Crawl metadata
Julien Nioche (JIRA)
2010/01/06
[jira] Closed: (NUTCH-655) Injecting Crawl metadata
Julien Nioche (JIRA)
2010/01/06
[jira] Commented: (NUTCH-655) Injecting Crawl metadata
Julien Nioche (JIRA)
2010/01/05
[jira] Commented: (NUTCH-655) Injecting Crawl metadata
Andrzej Bialecki (JIRA)
2010/01/05
[jira] Assigned: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19
Julien Nioche (JIRA)
2010/01/05
[jira] Assigned: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB
Julien Nioche (JIRA)
2010/01/05
[jira] Commented: (NUTCH-655) Injecting Crawl metadata
Julien Nioche (JIRA)
2010/01/05
[jira] Assigned: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Julien Nioche (JIRA)
2010/01/05
[jira] Assigned: (NUTCH-655) Injecting Crawl metadata
Julien Nioche (JIRA)
2010/01/05
Nutch Developers needed for a Nutch powered search engine
SC Interactive Global Media SRL
2010/01/05
[jira] Closed: (NUTCH-658) Add Counter for # of doc fetched in Reporter
Julien Nioche (JIRA)
2010/01/05
[jira] Resolved: (NUTCH-658) Add Counter for # of doc fetched in Reporter
Julien Nioche (JIRA)
2010/01/04
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Julien Nioche (JIRA)
2010/01/04
[jira] Commented: (NUTCH-658) Add Counter for # of doc fetched in Reporter
Julien Nioche (JIRA)
2010/01/03
Debug Nutch Web Site In Eclipse?
Jason DeMorrow
2009/12/31
Happy New Year 2010
Raghavendra Neelekani
2009/12/30
unsubscribe
Futebol DotInfo
2009/12/30
[jira] Updated: (NUTCH-775) Enhance Searcher interface
Sami Siren (JIRA)
2009/12/30
Re: [jira] Updated: (NUTCH-755) DomainURLFilter crashes on malformed URL
Futebol DotInfo
2009/12/29
[jira] Updated: (NUTCH-755) DomainURLFilter crashes on malformed URL
Mike Baranczak (JIRA)
2009/12/29
[jira] Commented: (NUTCH-755) DomainURLFilter crashes on malformed URL
Mike Baranczak (JIRA)
2009/12/28
RE: Mutithreaded parsing
Santiago Pérez
2009/12/28
RE: Mutithreaded parsing
Fuad Efendi
2009/12/28
Mutithreaded parsing
Santiago Pérez
2009/12/27
[jira] Commented: (NUTCH-385) Server delay feature conflicts with maxThreadsPerHost
Mike Baranczak (JIRA)
2009/12/27
[Nutch Wiki] Update of "search2.net" by search2.net
Apache Wiki
2009/12/26
[Nutch Wiki] Update of "PublicServers" by search2.net
Apache Wiki
2009/12/25
Re: [ANNOUNCE] New Nutch Committer: Julien Nioche
Futebol DotInfo
2009/12/25
Re: [ANNOUNCE] New Nutch Committer: Julien Nioche
Doğacan Güney
2009/12/25
Re: [ANNOUNCE] New Nutch Committer: Julien Nioche
Julien Nioche
2009/12/25
[Nutch Wiki] Update of "PublicServers" by RBalmes
Apache Wiki
2009/12/24
[Nutch Wiki] Update of "PublicServers" by search2.net
Apache Wiki
2009/12/24
[Nutch Wiki] Update of "PublicServers" by search2.net
Apache Wiki
2009/12/24
[ANNOUNCE] New Nutch Committer: Julien Nioche
Mattmann, Chris A (388J)
2009/12/21
unsubscribe
宫照
2009/12/19
答复: unsubscribe
Boycott
2009/12/18
[jira] Commented: (NUTCH-777) Upgrading to jetty6 broke unit tests
Hudson (JIRA)
2009/12/18
[jira] Commented: (NUTCH-768) Upgrade Nutch 1.0 to use Hadoop 0.20
Hudson (JIRA)
2009/12/18
Hudson build is back to normal: Nutch-trunk #1015
Apache Hudson Server
2009/12/18
[jira] Updated: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2009/12/18
[jira] Resolved: (NUTCH-777) Upgrading to jetty6 broke unit tests
Chris A. Mattmann (JIRA)
2009/12/18
[jira] Commented: (NUTCH-777) Upgrading to jetty6 broke unit tests
Chris A. Mattmann (JIRA)
2009/12/18
[jira] Commented: (NUTCH-777) Upgrading to jetty6 broke unit tests
Chris A. Mattmann (JIRA)
2009/12/18
[jira] Commented: (NUTCH-777) Upgrading to jetty6 broke unit tests
Chris A. Mattmann (JIRA)
2009/12/18
[jira] Work started: (NUTCH-777) Upgrading to jetty6 broke unit tests
Chris A. Mattmann (JIRA)
2009/12/18
[jira] Created: (NUTCH-777) Upgrading to jetty6 broke unit tests
Chris A. Mattmann (JIRA)
2009/12/18
Creating an alternative Linkdb with part of the outlinks
Santiago Pérez
2009/12/17
Build failed in Hudson: Nutch-trunk #1014
Apache Hudson Server
2009/12/17
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Dennis Kubes (JIRA)
2009/12/17
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Andrzej Bialecki (JIRA)
2009/12/17
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Dennis Kubes (JIRA)
2009/12/17
[jira] Updated: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Dennis Kubes (JIRA)
2009/12/17
[jira] Created: (NUTCH-776) Configurable queue depth
MilleBii (JIRA)
2009/12/16
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Sami Siren (JIRA)
2009/12/16
unsubscribe
malsmith
2009/12/16
Build failed in Hudson: Nutch-trunk #1013
Apache Hudson Server
2009/12/16
Build failed in Hudson: Nutch-trunk #1012
Apache Hudson Server
2009/12/16
[jira] Commented: (NUTCH-775) Enhance Searcher interface
Andrzej Bialecki (JIRA)
2009/12/16
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche
Apache Wiki
2009/12/15
[jira] Created: (NUTCH-775) Enhance Searcher interface
Sami Siren (JIRA)
2009/12/15
[jira] Work started: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2009/12/15
[jira] Assigned: (NUTCH-766) Tika parser
Chris A. Mattmann (JIRA)
2009/12/15
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche
Apache Wiki
2009/12/14
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation.
Vincent Couturier (JIRA)
2009/12/14
[jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool
Andrzej Bialecki (JIRA)
2009/12/14
Re: Build failed in Hudson: Nutch-trunk #1011
Dennis Kubes
2009/12/14
[jira] Commented: (NUTCH-768) Upgrade Nutch 1.0 to use Hadoop 0.20
Dennis Kubes (JIRA)
Earlier messages
Later messages