[ https://issues.apache.org/jira/browse/NUTCH-780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vu Hoang updated NUTCH-780: --------------------------- Component/s: (was: ndfs) fetcher > Nutch crawler did not read configuration files > ---------------------------------------------- > > Key: NUTCH-780 > URL: https://issues.apache.org/jira/browse/NUTCH-780 > Project: Nutch > Issue Type: Bug > Components: fetcher > Affects Versions: 1.0.0 > Reporter: Vu Hoang > > Nutch searcher can read properties at the constructor ... > {code:java|title=NutchSearcher.java|borderStyle=solid} > NutchBean bean = new NutchBean(getFilesystem().getConf(), fs); > ... // put search engine code here > {code} > ... but Nutch crawler is not, it only reads data from arguments. > {code:java|title=NutchCrawler.java|borderStyle=solid} > StringBuilder builder = new StringBuilder(); > builder.append(domainlist + SPACE); > builder.append(ARGUMENT_CRAWL_DIR); > builder.append(domainlist + SUBFIX_CRAWLED + SPACE); > builder.append(ARGUMENT_CRAWL_THREADS); > builder.append(threads + SPACE); > builder.append(ARGUMENT_CRAWL_DEPTH); > builder.append(depth + SPACE); > builder.append(ARGUMENT_CRAWL_TOPN); > builder.append(topN + SPACE); > Crawl.main(builder.toString().split(SPACE)); > {code} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.