Sorry about that i copy and pasted the name of the field and ended up sending the link.. Oppps

Shane.

On 31/03/14 13:57, Shane Wood wrote:
Thanks for your reply

I made no changes except i editored the db.fetch.schedule.class and replaced the default value with "org.apache.nutch.crawl.AdaptiveFetchSchedule" Now the modifiedTime <http://cbm8bit.com/phpmyadmin/sql.php?db=websearch&table=webpage&sql_query=SELECT+%2A+FROM+%60webpage%60+ORDER+BY+%60webpage%60.%60modifiedTime%60+ASC&token=fbe95d5ad41f1dc57ac94d5bfb3c1ec0> field in the database is being filled, i am still reading on the Adaptive fetch. Before the modifieldTime field was always Null... The prevModifiedTime <http://cbm8bit.com/phpmyadmin/sql.php?db=websearch&table=webpage&sql_query=SELECT+%2A+FROM+%60webpage%60+ORDER+BY+%60webpage%60.%60prevModifiedTime%60+ASC&token=fbe95d5ad41f1dc57ac94d5bfb3c1ec0> field is un affected and remains Null so far.

Cheers
Shane.



On 30/03/14 20:32, Lewis John Mcgibbney wrote:
Hi Shane,

On Sat, Mar 29, 2014 at 10:15 PM,<[email protected]> wrote:

How do you use the readdb command when using MYSQL there is no crawldb
created ?

A physical crawldb residing on HDFS is non-existent. It's equivalent in 2.x
is the WebPage table which you will see is created depending on which
mapping you use within your data model.
You can take a look into /conf/gora-sql-mapping.xml to get an idea of how
physical mapping's work into MySQL.


Can you list the command too use.

The command use can be found in the 2.x section here
https://wiki.apache.org/nutch/bin/nutch%20readdb


Or does Nutch still create a crawldb but i cant find it, where is it
created ? i have /crawl folder but nothing appears in there.

No there is no crawldb as explained above. MySQL stores the WebPage data in
a table.


hth
Lewis




Reply via email to