Thanks!
Unfortunately my work is only in russian - anyway here is the link
https://github.com/volkov/diploma/blob/master/main.pdf?raw=true
Actually it is technical work contains some specific optimizations for
crawling news sites. At this moment my english writing skills aren't
good, but i'll try to represent some interesting aspects of my graduate
paper somehow=)
On Wed 16 Nov 2011 05:16:05 AM MSK, Lewis John Mcgibbney wrote:
---------- Forwarded message ----------
From: Lewis John Mcgibbney<[email protected]>
Date: Wed, Nov 16, 2011 at 1:15 AM
Subject: Re: Nutch project and my Ph.D. thesis.
To: [email protected]
Hi Sergey,
There was a Professor from somewhere in S America that posted recently
rearding some work he did, if you search the archives you may get a taster
for work related to Nutch.
Also can you provide a link to your work? I would be very intersted in
having a look at the areas you have been working on. Also feel free to add
your work to the wiki page references for others to see.
Thank you.
http://wiki.apache.org/nutch/AcademicArticles
On Wed, Nov 16, 2011 at 12:39 AM, Sergey A Volkov<[email protected]
wrote:
Hi!
I am postgraduate student in Saint Petersburg State University. I was
working with Nutch for about 3 years, have written my graduate work based
on it, and now I don't know what to do in my Ph.D work. (Nobody in my
department (System Programming) deals with web crawling)
I hope someone knows problems in web crawling, whose solutions can help
Nutch project and me in my future Ph.D. thesis.
Any ideas?
Thanks,
Sergey.