Oh I have asked a silly question about regex, hehe. 2006/2/23, Jack Tang <[EMAIL PROTECTED]>: > > Hi > > I think in the url-filter it uses "contain" rather than "match". > > /Jack > > On 2/23/06, Elwin <[EMAIL PROTECTED]> wrote: > > # accept hosts in MY.DOMAIN.NAME > > +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/ > > > > Will this pattern accept url like this > http://MY.DOMAIN.NAME/([a-z0-9]*\.)*/? > > I think it's not, but in fact nutch can crawl and get urls like that in > > intranet crawl. Why? > > > > > > > -- > Keep Discovering ... ... > http://www.jroller.com/page/jmars >
-- 《盖世豪侠》好评如潮,让无线收视居高不下, 无线高兴之余,仍未重用。周星驰岂是池中物, 喜剧天分既然崭露,当然不甘心受冷落,于是 转投电影界,在大银幕上一展风采。无线既得 千里马,又失千里马,当然后悔莫及。
