Here's a place to start, using our friend BBEdit's powers.
I copied the sample you provided below into a new BBEdit doc,
and then chose the Menu Command Markup -> Utilities -> Translate
HTML to Text, with Remove Tags and Convert Paragraphs checked
and Convert HTML Entities un-checked. That yielded:
1092
00:40:25,710 --> 00:40:29,220
the kid has no idea what you mean
1093
00:40:27,119 --> 00:40:31,019
because you don't know what you mean but
1094
00:40:29,219 --> 00:40:33,149
you mean something like well don't be a
... etc
Now your job is much easier to handle with pattern-matching in
the Find/Replace dialog, especially if the numbers and times
above each dialog line are as consistent as in your sample -
maybe just delete any line starting with a digit.
HTH
On 1/18/19 at 6:00 PM, [email protected] (Dj) wrote:
Original text is like below and I'm trying to fetch* only the
the spoken/text elements so they sit next to each other without
gaps*. Is this a multi-part operation, or is it possible with
one expression? Thanks!
1092
00:40:25,710 --> 00:40:29,220
the kid has<font color="#E5E5E5"> no idea what you mean</font>
--
- Bruce
_bruce__van_allen__santa_cruz__ca_
--
This is the BBEdit Talk public discussion group. If you have a
feature request or need technical support, please email
"[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <https://www.twitter.com/bbedit>
---
You received this message because you are subscribed to the Google Groups "BBEdit Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/bbedit.