https://bugs.documentfoundation.org/show_bug.cgi?id=98561

            Bug ID: 98561
           Summary: : Writer does not import certain information from a
                    webpage.
           Product: LibreOffice
           Version: 5.1.1.1 rc
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Writer
          Assignee: [email protected]
          Reporter: [email protected]

Writer does not copy certain information from a webpage.

Address of webpage:
http://www.electronicsandradios.com/page-used-icom-bc80.html


What was highlighted for copy as seen on web page:

ICOM BC-80 DROP IN CHARGER + ICOM 110 to 12 VOLT 1 Amp AC ADAPTER
for the ICOM IC-P2AT / IC-P2ET / IC-P3AT / IC-P4AET / IC-P4ET
for BATTERY PACKS: BP110 / BP111 / BP112 / BP113

This ICOM BC-80 drop in charger and AC power supply allows you to charge your
Icom IC-P2AT or similar while it is standing up in the cradle.

Works with the following Icom models: IC-P2AT / IC-P2ET / IC-P3AT / IC-P4AET/
IC-P4ET

Note that the above text was pasted into text edit and then pasted into writer,
in order to give the full text for this report.



What was copied from the web page and pasted into writer:
ICOM BC-
for the ICOM IC-
for BATTERY PACKS: BP110 / BP111 / BP112 / BP113
This ICOM BC-
Works with the following Icom models: IC-



This is the code that created the screen output:
     <table id="table_77" border="1" class="OBJ-8"
style="position:absolute;left:37px;top:371px;width:883px;height:170px;">
        <col style="width:505px;">
        <col style="width:378px;">
        <tr style="height:170px;">
          <td colspan="2" class="TC-1">
            <p class="Normal P-6"><span class="C-7">I<span class="C-8">COM
BC-<wbr>80 DROP IN CHARGER + ICOM 110 to 12 VOLT 1 Amp AC ADAPTER<br></span>for
the ICOM IC-<wbr>P2AT / IC-<wbr>P2ET / IC-<wbr>P3AT / IC-<wbr>P4AET /
IC-<wbr>P4ET<br>for BATTERY PACKS: BP110 / BP111 / BP112 / BP113</span></p>
            <p class="Normal P-7"><span class="C-7">This ICOM BC-<wbr>80 drop
in charger and AC power supply allows you to charge your Icom IC-<wbr>P2AT or
similar while it is standing up in the cradle.</span></p>
            <p class="Normal"><span class="C-8">Works with the following Icom
models:<span class="C-7"> IC-<wbr>P2AT / IC-<wbr>P2ET / IC-<wbr>P3AT /
IC-<wbr>P4AET/ IC-<wbr>P4ET<br><br></span></span></p>
          </td>
        </tr>
      </table>

The problem appears to be with the <wbr> tag. Was the html parser in writer
updated for this element in HTML5?


Definition and Usage
The <wbr> (Word Break Opportunity) tag specifies where in a text it would be ok
to add a line-break.
Tip: When a word is too long, or you are afraid that the browser will break
your lines at the wrong place, you can use the <wbr> element to add word break
opportunities.
Source: http://www.w3schools.com/TAgs/tag_wbr.asp


Another newly introduced HTML5 element is the <wbr> tag which is used to
specify a break point in a very long word or string. 
The main difference between <wbr> tag and <br> tag is that <br> element forces
to break a line. On the other hand, <wbr> element breaks a line at the
specified break points only if required.
http://www.html5tutorial4u.com/wbr-element.htm

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to