Here's a place to start, using our friend BBEdit's powers.

I copied the sample you provided below into a new BBEdit doc, and then chose the Menu Command Markup -> Utilities -> Translate HTML to Text, with Remove Tags and Convert Paragraphs checked and Convert HTML Entities un-checked. That yielded:

1092
00:40:25,710 --> 00:40:29,220
the kid has no idea what you mean

1093
00:40:27,119 --> 00:40:31,019
because you don't know what you mean but

1094
00:40:29,219 --> 00:40:33,149
you mean something like well don't be a

... etc

Now your job is much easier to handle with pattern-matching in the Find/Replace dialog, especially if the numbers and times above each dialog line are as consistent as in your sample - maybe just delete any line starting with a digit.

HTH




On 1/18/19 at 6:00 PM, [email protected] (Dj) wrote:
Original text is like below and I'm trying to fetch* only the the spoken/text elements so they sit next to each other without gaps*. Is this a multi-part operation, or is it possible with one expression? Thanks!

1092
00:40:25,710 --> 00:40:29,220
the kid has<font color="#E5E5E5"> no idea what you mean</font>
--

  - Bruce

_bruce__van_allen__santa_cruz__ca_

--
This is the BBEdit Talk public discussion group. If you have a feature request or need technical support, please email
"[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <https://www.twitter.com/bbedit>
--- You received this message because you are subscribed to the Google Groups "BBEdit Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/bbedit.

Reply via email to