[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14020025#comment-14020025 ]
Tim Allison commented on TIKA-1325: ----------------------------------- Looks like Fontbox/TTFDataStream isn't setting a timezone, so it defaults to "default" timezone. My nonstandard default is different from your standard default. :) {code} /** * Read an eight byte international date. * * @return An signed short. * @throws IOException If there is an error reading the data. */ public Calendar readInternationalDate() throws IOException { long secondsSince1904 = readLong(); GregorianCalendar cal = new GregorianCalendar(1904, 0, 1); long millisFor1904 = cal.getTimeInMillis(); millisFor1904 += (secondsSince1904 * 1000); cal.setTimeInMillis(millisFor1904); return cal; } {code} Should we comment out: {code} assertEquals("1904-01-01T00:00:00Z", metadata.get(Metadata.CREATION_DATE)); {code} > Move the font metadata definitions to properties > ------------------------------------------------ > > Key: TIKA-1325 > URL: https://issues.apache.org/jira/browse/TIKA-1325 > Project: Tika > Issue Type: Improvement > Components: metadata, parser > Affects Versions: 1.5, 1.6 > Reporter: Nick Burch > > As noticed while working on TIKA-1182, the AFM font parser has a bunch of > hard coded strings it uses as metadata keys, while the TTF font parser > doesn't have many > We should switch these to being proper Properties, with definitions from a > well known standard (+ compatibility fallbacks), and have both use largely > the same set -- This message was sent by Atlassian JIRA (v6.2#6252)