Author: onealj
Date: Mon Jul 10 01:19:50 2017
New Revision: 1801403
URL: http://svn.apache.org/viewvc?rev=1801403&view=rev
Log:
Microsoft Publisher binary format is undocumented
Modified:
poi/site/publish/hpbf/file-format.html
poi/site/publish/hpbf/file-format.xml
poi/site/publish/hpbf/index.html
poi/site/src/documentation/content/xdocs/hpbf/file-format.xml
poi/site/src/documentation/content/xdocs/hpbf/index.xml
Modified: poi/site/publish/hpbf/file-format.html
URL:
http://svn.apache.org/viewvc/poi/site/publish/hpbf/file-format.html?rev=1801403&r1=1801402&r2=1801403&view=diff
==============================================================================
--- poi/site/publish/hpbf/file-format.html (original)
+++ poi/site/publish/hpbf/file-format.html Mon Jul 10 01:19:50 2017
@@ -76,23 +76,18 @@ if (VERSION > 3) {
<div class="menuBar">
<div class="menu">
<span class="menuLabel">Apache POI</span>
-
<div class="menuItem">
<a href="../index.html">Top</a>
</div>
-
</div>
<div class="menu">
<span class="menuLabel">HPBF</span>
-
<div class="menuItem">
<a href="index.html">Overview</a>
</div>
-
<div class="menuItem">
<a href="file-format.xml">File Format</a>
</div>
-
</div>
</div>
</div>
@@ -333,7 +328,7 @@ PL 62 1a 00 00 48 00 00 00 // PL fro
unsigned ints for each entry.</p>
<p>Type 12 holds hyperlinks, and is very much more complex.
- See <span
class="codefrag">org.apache.poi.hpbf.model.qcbits.QCPLCBit</span>
+ See <a
href="https://svn.apache.org/viewvc/poi/trunk/src/scratchpad/src/org/apache/poi/hpbf/model/qcbits/QCPLCBit.java?view=markup"><span
class="codefrag">org.apache.poi.hpbf.model.qcbits.QCPLCBit</span></a>
for our best guess as to how the contents match up.</p>
Modified: poi/site/publish/hpbf/file-format.xml
URL:
http://svn.apache.org/viewvc/poi/site/publish/hpbf/file-format.xml?rev=1801403&r1=1801402&r2=1801403&view=diff
==============================================================================
--- poi/site/publish/hpbf/file-format.xml (original)
+++ poi/site/publish/hpbf/file-format.xml Mon Jul 10 01:19:50 2017
@@ -187,7 +187,7 @@ PL 62 1a 00 00 48 00 00 00 // PL fro
<p>Type 8 has 7 2 byte unsigned ints, then a pair of 4 byte
unsigned ints for each entry.</p>
<p>Type 12 holds hyperlinks, and is very much more complex.
- See <code>org.apache.poi.hpbf.model.qcbits.QCPLCBit</code>
+ See <link
href="https://svn.apache.org/viewvc/poi/trunk/src/scratchpad/src/org/apache/poi/hpbf/model/qcbits/QCPLCBit.java?view=markup"><code>org.apache.poi.hpbf.model.qcbits.QCPLCBit</code></link>
for our best guess as to how the contents match up.</p>
</section>
</section>
Modified: poi/site/publish/hpbf/index.html
URL:
http://svn.apache.org/viewvc/poi/site/publish/hpbf/index.html?rev=1801403&r1=1801402&r2=1801403&view=diff
==============================================================================
--- poi/site/publish/hpbf/index.html (original)
+++ poi/site/publish/hpbf/index.html Mon Jul 10 01:19:50 2017
@@ -141,7 +141,7 @@ if (VERSION > 3) {
(now done), and be able to extract hyperlinks from within
the document (partly supported). Additional low level
code to process the file format may follow, if there
- is demand and developer interest warrant it.</p>
+ is demand and developer interest warrants it.</p>
<p>Text Extraction is available via the
<em>org.apache.poi.hpbf.extractor.PublisherTextExtractor</em>
@@ -153,6 +153,15 @@ if (VERSION > 3) {
<p>Our current understanding of the file format is documented
<a href="file-format.html">here</a>.</p>
+
+<p>As of 2017, we are unaware of a public format specification for
+ Microsoft Publisher .pub files. This format was not included in
+ the Microsoft Open Specifications Promise with the rest of the
+ Microsoft Office file formats.
+ As of <a
href="https://social.msdn.microsoft.com/Forums/en-US/63dc6c4e-d6b2-4873-97dd-139ddb304e24/what-about-publisher-file-format?forum=os_binaryfile">2009</a>
and <a
href="https://social.msdn.microsoft.com/Forums/en-US/a5f55c72-5378-4dc9-944a-9973a12bfaa7/reading-viso-vsdfiles-and-publisher-pubfiles-without-office?forum=os_binaryfile">2016</a>,
Microsoft had no plans to document the .pub file format.
+ If this changes in the future, perhaps we will see a spec
published
+ on the <a
href="https://msdn.microsoft.com/en-us/library/cc313105(v=office.12).aspx">Microsoft
Office File Format Open Specification Technical Documentation</a>.
+ </p>
<div class="frame note">
Modified: poi/site/src/documentation/content/xdocs/hpbf/file-format.xml
URL:
http://svn.apache.org/viewvc/poi/site/src/documentation/content/xdocs/hpbf/file-format.xml?rev=1801403&r1=1801402&r2=1801403&view=diff
==============================================================================
--- poi/site/src/documentation/content/xdocs/hpbf/file-format.xml (original)
+++ poi/site/src/documentation/content/xdocs/hpbf/file-format.xml Mon Jul 10
01:19:50 2017
@@ -189,7 +189,7 @@ PL 62 1a 00 00 48 00 00 00 // PL fro
<p>Type 8 has 7 2 byte unsigned ints, then a pair of 4 byte
unsigned ints for each entry.</p>
<p>Type 12 holds hyperlinks, and is very much more complex.
- See <code>org.apache.poi.hpbf.model.qcbits.QCPLCBit</code>
+ See <link
href="https://svn.apache.org/viewvc/poi/trunk/src/scratchpad/src/org/apache/poi/hpbf/model/qcbits/QCPLCBit.java?view=markup"><code>org.apache.poi.hpbf.model.qcbits.QCPLCBit</code></link>
for our best guess as to how the contents match up.</p>
</section>
</section>
Modified: poi/site/src/documentation/content/xdocs/hpbf/index.xml
URL:
http://svn.apache.org/viewvc/poi/site/src/documentation/content/xdocs/hpbf/index.xml?rev=1801403&r1=1801402&r2=1801403&view=diff
==============================================================================
--- poi/site/src/documentation/content/xdocs/hpbf/index.xml (original)
+++ poi/site/src/documentation/content/xdocs/hpbf/index.xml Mon Jul 10 01:19:50
2017
@@ -44,7 +44,7 @@
(now done), and be able to extract hyperlinks from within
the document (partly supported). Additional low level
code to process the file format may follow, if there
- is demand and developer interest warrant it.</p>
+ is demand and developer interest warrants it.</p>
<p>Text Extraction is available via the
<em>org.apache.poi.hpbf.extractor.PublisherTextExtractor</em>
class.</p>
@@ -53,6 +53,14 @@
the file, but by no means all of it.</p>
<p>Our current understanding of the file format is documented
<link href="file-format.html">here</link>.</p>
+ <p>As of 2017, we are unaware of a public format specification for
+ Microsoft Publisher .pub files. This format was not included in
+ the Microsoft Open Specifications Promise with the rest of the
+ Microsoft Office file formats.
+ As of <link
href="https://social.msdn.microsoft.com/Forums/en-US/63dc6c4e-d6b2-4873-97dd-139ddb304e24/what-about-publisher-file-format?forum=os_binaryfile">2009</link>
and <link
href="https://social.msdn.microsoft.com/Forums/en-US/a5f55c72-5378-4dc9-944a-9973a12bfaa7/reading-viso-vsdfiles-and-publisher-pubfiles-without-office?forum=os_binaryfile">2016</link>,
Microsoft had no plans to document the .pub file format.
+ If this changes in the future, perhaps we will see a spec
published
+ on the <link
href="https://msdn.microsoft.com/en-us/library/cc313105(v=office.12).aspx">Microsoft
Office File Format Open Specification Technical Documentation</link>.
+ </p>
<note>
This code currently lives the
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]