On 25 May 2013, at 16:13, Rob Lewis wrote:

Never tried any of BBEdit's fancier features. What would be the best way to take a file of HTML and extract just the code between the following tag
pairs:

<p itemprop="articleBody"> and </p>

There may be multiple blocks like this in the file.

It would be helpful to know if you are trying to remove all of the HTML markup or you are trying to get a plain-text rendering of the HTML file.

For the latter, the only recommendation that I would give would be to create a Text Filter or Script which would send the file through `lynx` which is going to do much better at parsing HTML and outputting plain text than any home-made solution.

TjL

--
This is the BBEdit Talk public discussion group. If you have a feature request or would like to report a problem, please email
"[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>

--- You received this message because you are subscribed to the Google Groups "BBEdit Talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].


Reply via email to