nutch-dev
Thread
Date
Earlier messages
Later messages
Messages by Thread
nutch gui on github
Marko Bauhardt
[Nutch Wiki] Update of "PublicServers" by ReinierBattenberg
Apache Wiki
codeformatting
Marko Bauhardt
Re: codeformatting
Andrzej Bialecki
Re: codeformatting
Marko Bauhardt
How to see System.out.println() values Featcher.java
ranjeet98
Re: How to see System.out.println() values Featcher.java
Marko Bauhardt
Re: How to see System.out.println() values Featcher.java
ranjeet98
How to enter data in to the Crawldb
Sailaja Dhiviti
Re: How to enter data in to the Crawldb
Marko Bauhardt
[jira] Created: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls
Marko Bauhardt (JIRA)
[jira] Updated: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls
Marko Bauhardt (JIRA)
[jira] Commented: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls
Marko Bauhardt (JIRA)
[jira] Updated: (NUTCH-747) inject&Index metadatas and inherit these metadatas to all matching suburls
Chris A. Mattmann (JIRA)
serializing and deserializing lucene query
ilayaraja
About NUTCH-650 (hbase integration)
Doğacan Güney
Re: About NUTCH-650 (hbase integration)
Andrzej Bialecki
Can I add a url to be crawled without putting it in a file and feeding it to "Inject"?
Paul Tomblin
Re: Can I add a url to be crawled without putting it in a file and feeding it to "Inject"?
Marko Bauhardt
MeetUp topic list posted
Ken Krugler
Re: MeetUp topic list posted
Andrzej Bialecki
Re: MeetUp topic list posted
Ken Krugler
Re: MeetUp topic list posted
Ken Krugler
[Nutch Wiki] Trivial Update of "ApacheConUs2009MeetUp" by KenKrugler
Apache Wiki
[Nutch Wiki] Trivial Update of "ApacheConUs2009MeetUp" by KenKrugler
Apache Wiki
[Nutch Wiki] Update of "ApacheConUs2009MeetUp" by KenKrugler
Apache Wiki
[Nutch Wiki] Trivial Update of "FrontPage" by KenKrugler
Apache Wiki
[Nutch Wiki] Trivial Update of "FrontPage" by KenKrugler
Apache Wiki
[Nutch Wiki] Trivial Update of "FrontPage" by KenKrugler
Apache Wiki
OSGi progress
Kirby Bohling
Re: OSGi progress
Andrzej Bialecki
Re: OSGi progress
Kirby Bohling
Web Crawler MeetUp info on wiki
Ken Krugler
Re: Web Crawler MeetUp info on wiki
Andrzej Bialecki
Meetup at ApacheCon US 2009
Ken Krugler
[Nutch Wiki] Update of "PublicServers" by stoicleo
Apache Wiki
[Nutch Wiki] Update of "07CommandLineOptions" by AlexMc
Apache Wiki
[Nutch Wiki] Update of "FrontPage" by AlexMc
Apache Wiki
[Nutch Wiki] Update of "CommandLineOptions" by AlexMc
Apache Wiki
New Extension Points?
Marko Bauhardt
Re: New Extension Points?
Andrzej Bialecki
Re: New Extension Points?
Marko Bauhardt
Wiki errors?
Alex McLintock
[Nutch Wiki] Trivial Update of "bin/nutch readdb" by AlexMc
Apache Wiki
[Nutch Wiki] Update of "bin/nutch readdb" by AlexMc
Apache Wiki
Running the Crawl without using bin/nutch in side a scala program
Sailaja Dhiviti
Re: Running the Crawl without using bin/nutch in side a scala program
Doğacan Güney
RE: Running the Crawl without using bin/nutch in side a scala program
Sailaja Dhiviti
[jira] Created: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Kirby Bohling (JIRA)
[jira] Updated: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Kirby Bohling (JIRA)
[jira] Updated: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Otis Gospodnetic (JIRA)
[jira] Closed: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Hudson (JIRA)
Server suggestion
fredericoagent
Re: Server suggestion
Dennis Kubes
Re: Server suggestion
Doğacan Güney
Re: Server suggestion
Dennis Kubes
[ApacheCon US] Travel Assistance
Grant Ingersoll
How to test searcher of nutch 1.0?
xiao yang
Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Doğacan Güney
Re: Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Doğacan Güney
Re: Nutch dev. plans
Dennis Kubes
Re: Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Kirby Bohling
Re: Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Kirby Bohling
Re: Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Doğacan Güney
Re: Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Kirby Bohling
Re: Nutch dev. plans
Andrzej Bialecki
Re: Nutch dev. plans
Ken Krugler
Re: Nutch dev. plans
Enis Soztutar
[Nutch Wiki] Update of "FrontPage" by DanielZhou
Apache Wiki
[jira] Created: (NUTCH-745) MyHtmlParser getParse return not null,so all Analyzer-(zh|fr) cannot run
jcore_XiaTian (JIRA)
[jira] Created: (NUTCH-744) indexing items in rss-feed in seperate page
Tarun Agrawal (JIRA)
[jira] Closed: (NUTCH-744) indexing items in rss-feed in seperate page
JIRA
[jira] Commented: (NUTCH-744) indexing items in rss-feed in seperate page
Tarun (JIRA)
Test Mail <EOM>
Sailaja Dhiviti
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic
Alex McLintock (JIRA)
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic
Morille Jerome (JIRA)
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic
garpinc (JIRA)
Upgrade to hadoop 0.20?
Doğacan Güney
Re: Upgrade to hadoop 0.20?
Julien Nioche
Re: Upgrade to hadoop 0.20?
Doğacan Güney
Re: Upgrade to hadoop 0.20?
Julien Nioche
adding fields to index
Beats
what is Non DFS Used in cluster summary? how to delete Non DFS Used data
Pravin Karne
what is diff between "mapred.map.tasks" and "mapred.tasktracker.map.tasks.maximum"
Pravin Karne
Nutch is very slow....what does following graph shows
Pravin Karne
test mail
Pravin Karne
Getting Crawl Depth During Runtime
MyD
How to optimize nutch's fetch perfotmance
Pravin Karne
Per-host fetch-interval
Sandeep Tata
Re: Per-host fetch-interval
Andrzej Bialecki
Re: Per-host fetch-interval
Sandeep Tata
[jira] Created: (NUTCH-743) Site search powered by Lucene/Solr
Sami Siren (JIRA)
[jira] Updated: (NUTCH-743) Site search powered by Lucene/Solr
Sami Siren (JIRA)
[jira] Commented: (NUTCH-743) Site search powered by Lucene/Solr
Andrzej Bialecki (JIRA)
[jira] Resolved: (NUTCH-743) Site search powered by Lucene/Solr
Sami Siren (JIRA)
[jira] Commented: (NUTCH-743) Site search powered by Lucene/Solr
Hudson (JIRA)
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
[jira] Created: (NUTCH-742) Checksum Error
mawanqiang (JIRA)
[jira] Resolved: (NUTCH-742) Checksum Error
Otis Gospodnetic (JIRA)
[jira] Resolved: (NUTCH-101) RobotRulesParser
Otis Gospodnetic (JIRA)
[jira] Commented: (NUTCH-101) RobotRulesParser
Ken Krugler (JIRA)
Language plugin tokenizers in Indexer?
Aaron Binns
Plugins: when to perform web service requests, on fetch or on index?
caezar
Re: Plugins: when to perform web service requests, on fetch or on index?
joel gump
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
Re: Plugins: when to perform web service requests, on fetch or on index?
joel gump
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
Re: Plugins: when to perform web service requests, on fetch or on index?
Kirby Bohling
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
Re: Plugins: when to perform web service requests, on fetch or on index?
Stefan Dlugolinsky
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
Re: Plugins: when to perform web service requests, on fetch or on index?
Stefan Dlugolinsky
[Nutch Wiki] Update of "HttpAuthenticationSchemes" by wobbet
Apache Wiki
[Nutch Wiki] Update of "Support" by Justin Gilbreath
Apache Wiki
[Nutch Wiki] Update of "Support" by Justin Gilbreath
Apache Wiki
[Nutch Wiki] Update of "Support" by Justin Gilbreath
Apache Wiki
a nutch Chinese language processing problem
fashengliu
Re: a nutch Chinese language processing problem
joel.gump
Why does TestNodeWalker keep failing?
Doğacan Güney
Re: Why does TestNodeWalker keep failing?
Andrzej Bialecki
Re: Why does TestNodeWalker keep failing?
Doğacan Güney
Antwort: Re: Why does TestNodeWalker keep failing?
marcel . schnippe
Re: Antwort: Re: Why does TestNodeWalker keep failing?
Andrzej Bialecki
Build failed in Hudson: Nutch-trunk #840
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #841
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #842
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #843
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #844
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #845
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #846
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #847
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #848
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #849
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #850
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #851
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #852
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #853
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #854
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #855
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #856
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #857
Apache Hudson Server
Re: Build failed in Hudson: Nutch-trunk #857
Doğacan Güney
Re: Build failed in Hudson: Nutch-trunk #857
Dennis Kubes
Build failed in Hudson: Nutch-trunk #858
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #859
Apache Hudson Server
Build failed in Hudson: Nutch-trunk #860
Apache Hudson Server
Hudson build is back to normal: Nutch-trunk #861
Apache Hudson Server
[Nutch Wiki] Update of "IntranetRecrawl" by susam
Apache Wiki
[Nutch Wiki] Update of "IntranetRecrawl" by susam
Apache Wiki
Software to Evaluate Algorithms
kloc4mif
org.apache.nutch.protocol.file.FileError: File Error: 404
Mr Shore
Extending Nutch to create HTML text summaries
Rodrigo Reyes C.
anyone sucessfully debug nutch1.0 in ecli...@windows?
Mr Shore
[Nutch Wiki] Update of "GettingNutchRunningWithWindows" by JohnWhelan
Apache Wiki
[Nutch Wiki] Update of "FrontPage" by JohnWhelan
Apache Wiki
[Nutch Wiki] Update of "FrontPage" by JohnWhelan
Apache Wiki
IOException in dedup
Nic M
Re: IOException in dedup
Ken Krugler
Re: IOException in dedup
Nic M
Re: IOException in dedup
Ken Krugler
Re: IOException in dedup
MyD
Re: IOException in dedup
Doğacan Güney
Re: IOException in dedup
Nic M
[Nutch Wiki] Update of "Support" by JulienNioche
Apache Wiki
debugging problem of nutch10
Mr Shore
How can I get startted with Nutch 1.0
逐鹿
Re: How can I get startted with Nutch 1.0
Susam Pal
Ranking & Scoring Algorithm Pseudocode
atencorps
Re: Ranking & Scoring Algorithm Pseudocode
Dennis Kubes
[jira] Created: (NUTCH-741) Job file includes multiple copies of nutch config files.
Kirby Bohling (JIRA)
[jira] Updated: (NUTCH-741) Job file includes multiple copies of nutch config files.
Kirby Bohling (JIRA)
[jira] Commented: (NUTCH-741) Job file includes multiple copies of nutch config files.
Andrzej Bialecki (JIRA)
[jira] Closed: (NUTCH-741) Job file includes multiple copies of nutch config files.
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-741) Job file includes multiple copies of nutch config files.
Hudson (JIRA)
Eclipse Nutch1.0 IOException
Georg Kirschner
Re: Eclipse Nutch1.0 IOException
Marko Bauhardt
Re: Eclipse Nutch1.0 IOException
Frank McCown
Re: Eclipse Nutch1.0 IOException
Georg Kirschner
[jira] Created: (NUTCH-740) Configuration option to override default language for fetched pages.
Marcin Okraszewski (JIRA)
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Marcin Okraszewski (JIRA)
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Otis Gospodnetic (JIRA)
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages.
JIRA
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Marcin Okraszewski (JIRA)
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages.
Julien Nioche (JIRA)
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Otis Gospodnetic (JIRA)
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Julien Nioche (JIRA)
[jira] Closed: (NUTCH-740) Configuration option to override default language for fetched pages.
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages.
Hudson (JIRA)
Remove duplicate nutch conf files from .job file
Kirby Bohling
Earlier messages
Later messages