Just got around to this again, I'll have a go with your suggestions this 
weekend. Thanks!

On Tuesday, April 23, 2019 at 7:23:56 PM UTC-7, Dave R wrote:
>
> Hi David… 
>
> The only way you can extract them is if you can identify the bold text, so 
> you could do it by finding the <em> and <strong> and probably other tags in 
> html.. and probably epub too, as I think that is basically html. 
>
> PDF’s i’m not sure of.. 
>
> A single regex won’t do it, you’ll end up with some kind of script or 
> BBEdit Text Factory. 
>
> if html 
>         look for content inside these tags 
> if epub 
>         … 
> if pdf 
>         .. 
> etc. 
>
> To build a regex the best place to start is by finding examples of each 
> thing you want to match. Then show us the examples and we can help with the 
> next steps! 
>
> Cheers, 
>
> Dave 
>
> > On 23-April-2019, at 4:02 PM, 'David J' via BBEdit Talk <
> [email protected] <javascript:>> wrote: 
> > 
> > I have a folder full of journal entries from over the years that are 
> various file types; .html, epub, pdf. I would like to extract all text that 
> is in bold from them. Is it possible to do with one regex for all the file 
> types in one go?  Thanks! 
> > 
> > -- 
> > This is the BBEdit Talk public discussion group. If you have a 
> > feature request or need technical support, please email 
> > "[email protected] <javascript:>" rather than posting to the group. 
> > Follow @bbedit on Twitter: <https://www.twitter.com/bbedit> 
> > --- 
> > You received this message because you are subscribed to the Google 
> Groups "BBEdit Talk" group. 
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email to [email protected] <javascript:>. 
> > To post to this group, send email to [email protected] 
> <javascript:>. 
> > Visit this group at https://groups.google.com/group/bbedit. 
>
>

-- 
This is the BBEdit Talk public discussion group. If you have a 
feature request or need technical support, please email
"[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <https://www.twitter.com/bbedit>
--- 
You received this message because you are subscribed to the Google Groups 
"BBEdit Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/bbedit.

Reply via email to