Try this one:
^http://123.com/456/789/[^/]+/$
With that you allow all characters except / between the forward slashes.

Besides, a good web page for online regular expression testing can be found at:
http://www.fileformat.info/tool/regex.htm

Kind regards,
Martina


-----Ursprüngliche Nachricht-----
Von: Alex Basa [mailto:[email protected]] 
Gesendet: Montag, 16. Februar 2009 18:08
An: [email protected]
Betreff: Re: regex for a folder only crawl

I actually tried that but it also picks up

http://123.com/456/789/folderA/folderB/
http://123.com/456/789/folderA/folderB/folderC/

what I really need is something to say the first slash after the previous one.

--- On Mon, 2/16/09, Cool The Breezer <[email protected]> wrote:

> From: Cool The Breezer <[email protected]>
> Subject: Re: regex for a folder only crawl
> To: [email protected]
> Date: Monday, February 16, 2009, 9:47 AM
> Try 
> ^http://123.com/456/789/.*/$, which says end should be /
> 
> - RB
> 
> --- On Mon, 2/16/09, Alex Basa <[email protected]>
> wrote:
> 
> > From: Alex Basa <[email protected]>
> > Subject: regex for a folder only crawl
> > To: [email protected]
> > Date: Monday, February 16, 2009, 9:54 AM
> > Hi guys,
> > 
> > I'm trying to make a regex to only crawl a folder.
>  So
> > if I was crawling 123.com/456/789
> > 
> > I would only want to crawl
> > ^http://123.com/456/789/(.*)/
> > 
> > I tried
> > ^http://123.com/456/789/*\.*
> > 
> > but there are many web pages with no file extensions.
> > 
> > I'm not sure how to specify only one forward slash
> > after in the regex.  Any ideas?
> > 
> > Thanks as always in advance,
> > 
> > Alex


      

Reply via email to