Ciao Mark
I'm late on this. I got really interested in this kind of extraction, which
I think there is demand for.
Two issues I can't figure out ...
1 - does "<$vars realchars="[^\s]+">" need to be that? Rather than its
inverse "<$vars realchars="\S+">"? (Where you would not need the variable
as no need for
"[...]"??)
2 - WHEN you have text BETWEEN tags, is there a way to dump it?
Only if you have time and interest!
Best wishes
TT
Mark S. wrote:
>
> Actually, the tool we have for regexp is also a bit lacking. There's no
> tool for directly lifting desired target text. The new splitregexp only
> splits, it doesn't
> return the text we want to find. Here's my version that does most
> literally what you ask for
>
> <$vars realchars="[^\s]+">
> <$list filter="[{test}splitregexp[\n]join[ ]splitregexp[<li>
> ]butfirst[1]splitregexp[</li>]butlast[1]regexp<realchars>]">
>
> </$list>
> </$vars>
>
> Input:
>
> More text here
> <li>line 3</li>
> <li>line 2</li>
> <li>line 1</li>
> More text there
>
> Output
>
>
> line 3 <https://tiddlywiki.com/#line%203>
> line 2 <https://tiddlywiki.com/#line%202>
> line 1 <https://tiddlywiki.com/#line%201>
>
>
>
> Good luck!
>
> On Thursday, August 22, 2019 at 2:21:34 AM UTC-7, TonyM wrote:
>>
>> Jeremy,
>>
>> You are aware I do not want so much to parse it as locate the content
>> between matching tags.
>>
>> Its intention is to access content delimited by html tags inside the text
>> content.
>>
>> Perhaps we could use it to retrieve items between the section div tags or
>> all instances of text between the li tags.
>>
>> Regards
>> Tony
>>
>>
--
You received this message because you are subscribed to the Google Groups
"TiddlyWiki" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/tiddlywiki/fb977184-2b39-4666-a3b7-4a1100f51afb%40googlegroups.com.