Oh I have asked a silly question about regex, hehe.

2006/2/23, Jack Tang <[EMAIL PROTECTED]>:
>
> Hi
>
> I think in the url-filter it uses "contain" rather than "match".
>
> /Jack
>
> On 2/23/06, Elwin <[EMAIL PROTECTED]> wrote:
> > # accept hosts in MY.DOMAIN.NAME
> > +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
> >
> > Will this pattern accept url like this
> http://MY.DOMAIN.NAME/([a-z0-9]*\.)*/?
> > I think it's not, but in fact nutch can crawl and get urls like that in
> > intranet crawl. Why?
> >
> >
>
>
> --
> Keep Discovering ... ...
> http://www.jroller.com/page/jmars
>



--
《盖世豪侠》好评如潮,让无线收视居高不下,
无线高兴之余,仍未重用。周星驰岂是池中物,
喜剧天分既然崭露,当然不甘心受冷落,于是
转投电影界,在大银幕上一展风采。无线既得
千里马,又失千里马,当然后悔莫及。

Reply via email to