[ 
https://issues.apache.org/jira/browse/TIKA-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16669974#comment-16669974
 ] 

ASF GitHub Bot commented on TIKA-2766:
--------------------------------------

jtbdevelopment opened a new pull request #256: TIKA-2766 - be able to get raw 
values from excel
URL: https://github.com/apache/tika/pull/256
 
 
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> Be able to extract raw values from excel, not formatted
> -------------------------------------------------------
>
>                 Key: TIKA-2766
>                 URL: https://issues.apache.org/jira/browse/TIKA-2766
>             Project: Tika
>          Issue Type: Improvement
>          Components: core
>            Reporter: JTB Development
>            Priority: Minor
>
> By default, tika extracts excel values as formatted in the sheet.  It's a 
> fine default.
>  
> However, many times, I am asked to extract raw values as the nicely formatted 
> sheet for human eyes is losing precision.
>  
> In local instances, I've cloned the tika classes in order to do so, but it's 
> messy due to how the code is layered (i wind up extending/copying 3-4 classes 
> because chain of class construction). 
> I believe by adding a config option to the open office config class I can 
> implement same option much more cleanly. 
>  
> I plan to issue a pull request in few weeks (doing this contribute on the 
> side based on professional use)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to