Find:

(?<=/)(?:d([^\s/]|\\\x20)*?\.zip)\b

Or commented:

(?x)                (?# Use multi-line and comments)
(?<=/)              (?# Look behind a slash not including it in the match)
(                   (?# Start of capture \1)
    d               (?# Literal 'd')
    (?:             (?# Start non capturing parentheses)
        [^\s/]      (?# NOT [whitespace or slash] character)
    |               (?# or)
        \\\x20      (?# Backslash escaped space)
    )               (?# End non capturing parentheses)
    *?              (?# Match 0 or more greedily)
    \.              (?# Literal '.')
    zip             (?# Literal 'zip')
)                   (?# End of capture \1)
\b                  (?# Word boundary)

Should match:
ftp://ftp.scene.org/pub/demos/artists/0xf/drunkchessboard.zip
ftp://ftp.scene.org/pub/demos/artists/0xf/d.zip 
"ftp://ftp.scene.org/pub/demos/artists/0xf/d0xf+==&amp;.zip";
/path/to/unicode/files/d你好.zip
/path/to/document\ with_escaped_space.zip

Should NOT match:
ftp://ftp.scene.org/pub/demos/artists/0xf/d.zipped      -- Wrong extension.
/path/to/document with_unescaped_space.zip              -- Has unescaped 
space.
document.zip                                            -- Missing /.

HTH

Jean Jourdain

On Sunday, February 6, 2022 at 10:39:09 AM UTC+1 Kaveh wrote:

> not clear for me what you want to do. can you put a sample of input lines 
> and output needed?
>
> On Sat, 5 Feb 2022 at 23:09, ejonesss <[email protected]> wrote:
>
>> i was wondering what is the grep i would need to find all occurrences of 
>> a word that begins with
>>
>> ftp://ftp.scene.org/pub/demos/artists/0xf/drunkchessboard.zip
>>
>>
>> for example i want to find all lines who has file of “.zip" and begins 
>> with “d"
>>
>> drunkchessboard.zip
>>
>> i got the finding .zip part ok that is how i extracted all the zips from 
>> a massive 600000 line list
>>
>> now the tricky part is detecting the “/d” part
>>
>>
>>
>> -- 
>> This is the BBEdit Talk public discussion group. If you have a feature 
>> request or need technical support, please email "[email protected]" 
>> rather than posting here. Follow @bbedit on Twitter: <
>> https://twitter.com/bbedit>
>> --- 
>> You received this message because you are subscribed to the Google Groups 
>> "BBEdit Talk" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected].
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/bbedit/9029aa96-cd05-4724-8126-4ea34ef23e99n%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/bbedit/9029aa96-cd05-4724-8126-4ea34ef23e99n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> -- 
> Kaveh Bazargan PhD
> Director
> River Valley Technologies <http://rivervalley.io> ● Twitter 
> <https://twitter.com/rivervalley1000> ● LinkedIn 
> <https://www.linkedin.com/in/bazargankaveh/> ● ORCID 
> <https://orcid.org/0000-0002-1414-9098>
> *Accelerating the Communication of Research*
>

-- 
This is the BBEdit Talk public discussion group. If you have a feature request 
or need technical support, please email "[email protected]" rather than 
posting here. Follow @bbedit on Twitter: <https://twitter.com/bbedit>
--- 
You received this message because you are subscribed to the Google Groups 
"BBEdit Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/bbedit/141970c2-2cdc-4fa5-bf98-e91602e29291n%40googlegroups.com.

Reply via email to