On Nov 10, 5:38 pm, "Edward K. Ream" <[email protected]> wrote:

> Here is the test case, boiled down to its essence from data.html::
>
>     <td><a href="1">Standards</a> <a href="2">Fees</a></td>

[snip]

> The problem is that there seems to be no way to "pull" the whitespace into 
> either "class" (element)

Not sure why I was so pessimistic.

It should be possible to extend the first <a> element so that it
contain the troublesome space.  More generally, every element could
contain everything up to the next (class-like) element.  That is,
every element that gets its own node will contain everything up to the
next element in the import_html_tags list.  Doing this should be
straightforward.

> One idea, that might work is to special case single-line elements.

The problem with this is that html lines can be very long.  I suppose
the code could special-case single-line elements that are shorter than
some cutoff, say 80 characters, but that adds more complexity.

So for now I'll just see if extending elements will work.

EKR

-- 
You received this message because you are subscribed to the Google Groups 
"leo-editor" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/leo-editor?hl=en.

Reply via email to