[Bug 59748] [PATCH] Remove use of obsolete Hashtable class

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59748

--- Comment #1 from Javen O'Neal  ---
Looks okay to me.

We should add key-value generic type when missing.
Public methods should return a new map or wrap the map with
Collections.unmodifiableMap.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



Jenkins build is back to normal : POI #1390

2016-06-23 Thread Apache Jenkins Server
See 


-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



Jenkins build is back to normal : POI-JDK8 #1082

2016-06-23 Thread Apache Jenkins Server
See 


-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



Re: [PATCH] Use of Hashtable in POI codebase

2016-06-23 Thread Nick Burch

On Thu, 23 Jun 2016, Axel Howind wrote:

Nick Burch  gagravarr.org> writes:

Patches to the list tend to get lost. Any chance you could either open a
bugzilla entry and attach it there, or open it as a new pull request on
github?


I have created https://bz.apache.org/bugzilla/show_bug.cgi?id=59748 with an
updated patch.


Thanks! Looks good at first glance. If someone has time to review it fully 
and apply, great! If not, I'll apply it in a few days, allowing for lazy 
consensus


Nick

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59748] [PATCH] Remove use of obsolete Hashtable class

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59748

Axel Howind  changed:

   What|Removed |Added

 CC||a...@dua3.com

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59748] New: [PATCH] Remove use of obsolete Hashtable class

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59748

Bug ID: 59748
   Summary: [PATCH] Remove use of obsolete Hashtable class
   Product: POI
   Version: 3.15-dev
  Hardware: All
OS: All
Status: NEW
  Severity: enhancement
  Priority: P2
 Component: POI Overall
  Assignee: dev@poi.apache.org
  Reporter: a...@dua3.com

Created attachment 33981
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33981=edit
Replace Hashtable by HashMap, LinkedHashMap

Current development version still uses Hashtable in a number of places. The
attached patch replaces instaniaions of Hashtable with HashMap. he only example
is ContentType.java where LinkedHashMap is used instead because a unit test
implicitly relies on the order entries are retrieved from a map.

See the discussion on poidev
http://thread.gmane.org/gmane.comp.jakarta.poi.devel/33069/focus=33070 for
details.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



Build failed in Jenkins: POI #1389

2016-06-23 Thread Apache Jenkins Server
See 

Changes:

[nick] Changelog update

[nick] #59746 XSSF support for files from certain alternate tools where the row 
XML is missing the row number

[nick] Disabled unit test for bug #59746, test file from Rob Owen

--
Started by an SCM change
[EnvInject] - Loading node environment variables.
Building remotely on ubuntu-6 (docker Ubuntu ubuntu yahoo-not-h2) in workspace 

Updating http://svn.apache.org/repos/asf/poi/trunk at revision 
'2016-06-23T20:34:20.885 +'
AUtest-data/spreadsheet/59746_NoRowNums.xlsx
U src/integrationtest/org/apache/poi/TestAllFiles.java
U 
src/ooxml/testcases/org/apache/poi/xssf/streaming/TestSXSSFWorkbook.java
U src/ooxml/testcases/org/apache/poi/xssf/usermodel/TestXSSFBugs.java
U src/ooxml/java/org/apache/poi/xssf/usermodel/XSSFRow.java
U src/ooxml/java/org/apache/poi/xssf/streaming/SXSSFWorkbook.java
Fetching 'https://svn.apache.org/repos/asf/poi/site/forrest.properties' at -1 
into '
At revision 1750016

Fetching 'https://svn.apache.org/repos/asf/poi/site/src/documentation' at -1 
into '
U src/documentation/content/xdocs/status.xml
At revision 1750016

At revision 1750016

No changes for http://svn.apache.org/repos/asf/poi/site/forrest.properties 
since the previous build
[EnvInject] - Executing scripts and injecting environment variables after the 
SCM step.
[EnvInject] - Injecting as environment variables the properties content 
LANG=en_US.UTF-8

[EnvInject] - Variables injected successfully.
[POI] $ /bin/bash -xe /tmp/hudson628047550877973195.sh
+ svn status
X   forrest.properties
X   src/documentation

Performing status on external item at 'src/documentation':
+ exit 0
[POI] $ /home/jenkins/tools/ant/latest/bin/ant -Dcoverage.enabled=true clean 
jenkins
#
# There is insufficient memory for the Java Runtime Environment to continue.
# Cannot create GC thread. Out of system resources.
# An error report file with more information is saved as:
# 
Build step 'Invoke Ant' marked build as failure
[FINDBUGS] Skipping publisher since build result is FAILURE
[WARNINGS] Skipping publisher since build result is FAILURE
Archiving artifacts
Compressed 66.31 MB of artifacts by 65.3% relative to #1388
Recording test results
ERROR: Step ?Publish JUnit test result report? failed: No test report files 
were found. Configuration error?


-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



Build failed in Jenkins: POI-JDK8 #1081

2016-06-23 Thread Apache Jenkins Server
See 

Changes:

[nick] Changelog update

[nick] #59746 XSSF support for files from certain alternate tools where the row 
XML is missing the row number

[nick] Disabled unit test for bug #59746, test file from Rob Owen

--
[...truncated 3552 lines...]
[junit] Reading spreadsheet/Themes.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/Themes2.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/Themes2.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/Themes2.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/TwoSheetsNoneHidden.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/TwoSheetsNoneHidden.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/TwoSheetsNoneHidden.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/TwoSheetsOneHidden.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/TwoSheetsOneHidden.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/TwoSheetsOneHidden.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/UncalcedRecord.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/UnionPtg.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WORKBOOK_in_capitals.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WeekNumFunctionTestCaseData.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WeekNumFunctionTestCaseData2013.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithChart.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithChart.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithChart.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithChartSheet.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithChartSheet.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithCheckBoxes.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithConditionalFormatting.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithConditionalFormatting.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithConditionalFormatting.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithDrawing.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithDrawing.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithEmbeddedObjects.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithEmbeded.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithEmbeded.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithExtendedStyles.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithFormattedGraphTitle.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithHyperlink.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithMoreVariousData.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithMoreVariousData.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithTable.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithTable.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithTextBox.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithTextBox.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithTextBox2.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithTextBox2.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithThreeCharts.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithThreeCharts.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithThreeCharts.xlsx with class 
org.apache.poi.stress.OPCFileHandler
[junit] Reading spreadsheet/WithTwoCharts.xls with class 
org.apache.poi.stress.HSSFFileHandler
[junit] Reading spreadsheet/WithTwoCharts.xlsx with class 
org.apache.poi.stress.XSSFFileHandler
[junit] Reading spreadsheet/WithTwoCharts.xlsx 

[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

Nick Burch  changed:

   What|Removed |Added

 Status|NEEDINFO|RESOLVED
 Resolution|--- |FIXED

--- Comment #7 from Nick Burch  ---
Hopefully fixed in r1749971 - I can now read back the values the same as shown
in OpenOffice for the file

Do let us know if there are other issues with these files due to them having
missed other parts of the XML out though!

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

--- Comment #6 from Rob  ---
Created attachment 33980
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33980=edit
Smaller file that does not work

Here is a smaller file. Software did not make it easy so it might be slightly
biger than you were hoping

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59738] Excel Files generated using XSSFWorkbook can't be opened using Ms-Excel or OpenOffice

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59738

--- Comment #9 from Adrodoc55  ---
(In reply to Dominik Stadler from comment #8)
> Where can the project be accessed? You can also zip it up and attach it here
> if you want.

Ups I forgot to post the link: https://github.com/Adrodoc55/poi-xalan-bug

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59747] xlsx file does not conform to bit patterns used by common file type detection software

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59747

--- Comment #2 from Mark Murphy  ---
Seems to me, those tools that rely on a specific file order within an archive
have a design flaw, that is, they rely on a specific file order within the
archive. Apparently Tika does not have that issue, but anything that does will
have an issue if Excel ever changes the order in which it writes files to the
xlsx archive. It apparently doesn't care what the order is, so there is no
guarantee the order will remain the same in future versions of the product.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59747] xlsx file does not conform to bit patterns used by common file type detection software

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59747

Nick Burch  changed:

   What|Removed |Added

 OS||All

--- Comment #1 from Nick Burch  ---
Apart from a handful of formats (eg those which require a mimetypes file that's
uncompressed as the first entry in the zip), reliably detecting container
formats can only be done by opening up the container itself

Apache Tika ships with a special detector for zip-based container formats for
this very reason!

(Tika also, on trunk, correctly detects POI-generated OOXML files as OOXML from
mime magic only)

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59747] New: xlsx file does not conform to bit patterns used by common file type detection software

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59747

Bug ID: 59747
   Summary: xlsx file does not conform to bit patterns used by
common file type detection software
   Product: POI
   Version: 3.14-FINAL
  Hardware: PC
Status: NEW
  Severity: normal
  Priority: P2
 Component: XSSF
  Assignee: dev@poi.apache.org
  Reporter: dominik.ma...@ecclesia.de

Hi,

I'm creating this bug due to a problem we've encountered with POI generated
xlsx files.

Apparently the order of zip entries in xlsx files is important for tools which
determine the file type be matching a byte pattern. See for example Apache Tika
(without deeper OOXML support library) and linux's file command.

The OOXML spec and Excel have no problem with POI files but tools relying on a
certain pattern have.

Here the output of unzip -l on a POI xlsx file:

Archive:  poi.xlsx
  Length Date   TimeName
    
  591  02.06.16 12:40   _rels/.rels
 1063  02.06.16 12:40   [Content_Types].xml
  183  02.06.16 12:40   docProps/app.xml
  437  02.06.16 12:40   docProps/core.xml
  137  02.06.16 12:40   xl/sharedStrings.xml
  818  02.06.16 12:40   xl/styles.xml
  349  02.06.16 12:40   xl/workbook.xml
  569  02.06.16 12:40   xl/_rels/workbook.xml.rels
  670  02.06.16 12:40   xl/worksheets/sheet1.xml
    ---
 4817   9 files

And for a native file:

Archive:  excel.xlsx
  Length Date   TimeName
    
 1032  01.01.80 00:00   [Content_Types].xml
  588  01.01.80 00:00   _rels/.rels
  557  01.01.80 00:00   xl/_rels/workbook.xml.rels
  906  01.01.80 00:00   xl/workbook.xml
 1542  01.01.80 00:00   xl/styles.xml
 6790  01.01.80 00:00   xl/theme/theme1.xml
 1306  01.01.80 00:00   xl/worksheets/sheet1.xml
  593  01.01.80 00:00   docProps/core.xml
  816  01.01.80 00:00   docProps/app.xml
    ---
14130   9 files

According to linux file and Tika they seem to expect [Content_Types].xml as the
first entry, skip the second and look for a "xl/" in the third entry.

Would it be possible to fix the order of the entries?

We've written a simple post processing tool which rewrites the zip file but
would be happy to have this in POI proper.

Thanks and contact me if I can help.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59738] Excel Files generated using XSSFWorkbook can't be opened using Ms-Excel or OpenOffice

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59738

--- Comment #8 from Dominik Stadler  ---
Where can the project be accessed? You can also zip it up and attach it here if
you want.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

--- Comment #5 from Rob  ---
Thanks for the rapid follow up. I will find a smaller file and attached it ASAP

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

Nick Burch  changed:

   What|Removed |Added

 Status|NEW |NEEDINFO
 OS||All

--- Comment #4 from Nick Burch  ---
The problem is that all the rows in your file neglect to say which one they
are!

In a normal file, the sheet xml should be like:

 

  
Destination
  
  
Dial Code
  


  
Abkhazia - Fixed
  
  
7840
  

Where each row indicates the row number. However, in your files, the row
numbers are missing:

  

  

  Checked

  
  

  Ion

  


  
1
  
  

  [M+H]+1

  


My guess is that Excel assumes that a row with no row number should be treated
as the next row, and if all have no row number start from row 1.


Any chance you could produce a much smaller problematic excel file from the
other software, perhaps with just a few 10s of lines, which we could use for
unit testing and a possible fix?

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59738] Excel Files generated using XSSFWorkbook can't be opened using Ms-Excel or OpenOffice

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59738

--- Comment #7 from Adrodoc55  ---
(In reply to Andreas Beeker from comment #5)
> First of all, this sounded too fantastic, that all versions since 3.11 can't
> generate usual .xlsx files - ... and actually, I've just tried the
> businessplan with LO/Excel/ExcelViewer, just to be sure ...
> 
> To investigate this we could either look what's wrong with the gradle build
> or maybe we can find the error, if you attach your resulting businessplan
> .xlsx of POI 3.10 and 3.14.
> 
> My guess is, that there are either duplicates jars in the classpath /
> dependencies or the wrong ooxml-schemas (ooxml-schemas-1.3.jar for POI 3.14
> or later, ooxml-schemas-1.1.jar for POI 3.7 up to POI 3.13).
> 
> Can you package your example application and list the dependencies?
> 
> (maybe we can spot something like a wrong version of
> xerces/saxon/xmlbeans/xmlbeans-xpath ..)

Thanks, you were right :)
I was able to track down the problem to my xalan library. I had
'xalan:xalan:2.4.0' in my classpath, because that is used in our application
server. Luckily I don't actually need that dependency and can just remove it.
I still created a git repository containing everything to create corrupted xlsx
files, if you want to investigate. All you have to do is open a command line
and run 'gradlew run' or './gradlew run' to create a new businessplan.xlsx.
A corupted one is already checked into the repo aswell.
Please tell me when you no longer need the repo so I can delete it.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

--- Comment #3 from Rob  ---
Created attachment 33979
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33979=edit
Full data saved by excel

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

--- Comment #2 from Rob  ---
Created attachment 33978
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33978=edit
Full data that will not open

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

--- Comment #1 from Rob  ---
Created attachment 33977
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=33977=edit
Suporting docs with xlsx files

I cannot attached the larger files as it goes over the allowed file size.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



[Bug 59746] New: Unable to correctly read data in XLSX file

2016-06-23 Thread bugzilla
https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

Bug ID: 59746
   Summary: Unable to correctly read data in XLSX file
   Product: POI
   Version: 3.14-FINAL
  Hardware: PC
Status: NEW
  Severity: normal
  Priority: P2
 Component: XSSF
  Assignee: dev@poi.apache.org
  Reporter: robert.o...@pfizer.com

I have an excel xlsx file that was exported by some mass spec software
(ThemoScientific I think) that looks to contain hierarchical or grouped data.
There are two examples of these files in the attached, one that is the full
export and one where only the top level data is exported. If you try to open
these files using POI, the file opens correctly, but the lastRowNum for the
only sheet is -1 (example code is also in the attachment). However, excel can
open the file and if you immediately hit save on the file (no changes needed)
and then try to open with POI, the file opens and the correct number of rows
are found. 

Looking at the XLM for the saved file, there are definitely changes that excel
has made, but I am not sure where POI is failing in parsing the original XML.
Not sure where the problem lies, but I have found a python package
(https://pypi.python.org/pypi/xlrd) that will correctly read the files that POI
is unable to read, so the XML does not appear to be completely nonstandard (or
only one that excel itself can read). 

This appears still be a problem with 3.15-beta1 as well.

-- 
You are receiving this mail because:
You are the assignee for the bug.

-
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org



Re: 3.15 beta 2 soon?

2016-06-23 Thread Dominik Stadler
Hi,

I think my scripts may do a bit more with the POI classes, like
copying/drawing/writing/shifting/formating/evaluating, ... this covers more
of the functionality and thus shows more things that can break after
changes.

Unfortunately I did not manage to get it to run on the VM yet to see how my
stuff looks like with the different corpus that is available there, but
that is still on the plan!

Dominik.

On Thu, Jun 23, 2016 at 4:08 AM, Allison, Timothy B. 
wrote:

> Your corpus has far more diversity than mine... :)
>
> I found only a very small handful of new exceptions aside from 59739, and
> those are probably caused by changes in Tika:
>
> java.lang.IllegalArgumentException
> at
> org.apache.poi.openxml4j.opc.PackagingURIHelper.resolvePartUri(PackagingURIHelper.java:427)
> at
> org.apache.poi.openxml4j.opc.PackageRelationship.getTargetURI(PackageRelationship.java:206)
> at
> o.a.t.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.addDrawingHyperLinks(XSSFExcelExtractorDecorator.java:174)
>
> There are a few handfuls of files with markedly less content...I'll
> diagnose tomorrow.
>
> -Original Message-
> From: Dominik Stadler [mailto:dominik.stad...@gmx.at]
> Sent: Wednesday, June 22, 2016 3:23 PM
> To: POI Developers List 
> Subject: Re: 3.15 beta 2 soon?
>
> Hi,
>
> I did another run in the meantime with an intermediate fix for the
> HWPF-Version, see http://people.apache.org/~centic/poi_regression/reports/
> ,
> the following pops up now a few times:
>
> java.lang.NullPointerException
> at
> o.a.p.xslf.usermodel.XSLFTextParagraph.getDefaultFontSize(XSLFTextParagraph.java:953)
> at
> o.a.p.sl.draw.DrawTextParagraph.getAttributedString(DrawTextParagraph.java:534)
> at
> o.a.p.sl.draw.DrawTextParagraph.breakText(DrawTextParagraph.java:235)
> at
> o.a.p.sl.draw.DrawTextShape.drawParagraphs(DrawTextShape.java:149)
> at
> o.a.p.sl.draw.DrawTextShape.getTextHeight(DrawTextShape.java:210)
> at o.a.p.sl.draw.DrawTextShape.drawContent(DrawTextShape.java:86)
> at o.a.p.sl.draw.DrawSimpleShape.draw(DrawSimpleShape.java:93)
> at o.a.p.sl.draw.DrawSheet.draw(DrawSheet.java:67)
> at o.a.p.sl.draw.DrawSlide.draw(DrawSlide.java:39)
> at o.a.p.xslf.usermodel.XSLFSlide.draw(XSLFSlide.java:301)
> at
> o.a.p.stress.SlideShowHandler.renderSlides(SlideShowHandler.java:120)
> at
> o.a.p.stress.SlideShowHandler.handleSlideShow(SlideShowHandler.java:43)
> at o.a.p.stress.XSLFFileHandler.handleFile(XSLFFileHandler.java:43)
>
> and a few others once or twice, although I am not sure about those, some
> documents cause OOMs and these can cause concurrently running tests to run
> into timeouts and run through in the following run...
>
> Dominik.
>
>
>