On Oct 28, 2:12 pm, "Edward K. Ream" <[email protected]> wrote:
> On Oct 28, 3:49 am, zpcspm <[email protected]> wrote:
>
> > Is there a leo plugin that would automatically solve this by being
> > able to import HTML code into a tree of HTML tags - one node per tag,
> > inner nodes for inner tags? Something like this:
>
> Leo's import command works with html files.  It will generate nodes
> for the tags mentioned in the setting:
>
>     @data import_xml_tags
>
> Edward

1. Why are there redundant values (uppercase and lowercase) in this
setting? I know that XML tags are case sensitive, but HTML tags are
not. Having to double each HTML tag is slightly inconvenient.

2. Is it possible to have an alternative value for this setting
(commented, so it won't interfere with the default one) in
leoSettings.leo, that would contain all HTML tags? I can probably
google this list of tags, but I'm confused by the inconsistency
between the setting name (it's about XML) and the default tags (HTML
ones!). Maybe it would make sense to have a distinct setting for HTML
tags? I understand it would add redundancy, but HTML is a very big
particular case.

3. I've tried to import a file and got some warnings:

warning: intermixed blanks and tabs
warning: mismatch in leading whitespace
first mismatched line at line 95
original line:          <DIV class="webonly">

generated line:     <DIV class="webonly">

error: import command did not import @file data.html perfectly
first mismatched line: 272
 270 u'\t\t\t\n'
 270 u'    \t\t\t\n'
 271 u'\n'
 271 u'\n'
 272 u'\t\t<table width="619" border="0" cellspacing="0"
cellpadding="0">\t<tr valign="top">\t<td width="377">\t<!-- View First
part -->\n'
 272 u'            <table width="619" border="0" cellspacing="0"
cellpadding="0">\\\\-4<tr valign="top">\t<td width="377">\t<!-- View
First part -->\n'
 273 u'\n'
 273 u'\n'
 274 u'\n'
 274 u'\n'

4. The import algorithm for HTML doesn't seem to be perfect. For
instance, I have a <div> node with the following body:

</TD></TR><TR><TD class="disclaimer"><div align="center">(as of the
previous business day)</div>

This looks wrong.

-- 
You received this message because you are subscribed to the Google Groups 
"leo-editor" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/leo-editor?hl=en.

Reply via email to