Author: tallison
Date: Thu Aug 8 14:50:15 2013
New Revision: 1511816
URL: http://svn.apache.org/r1511816
Log:
Tika 1139 update to 1129
Added:
tika/trunk/tika-core/src/test/resources/org/apache/tika/mime/test-tika-327.html
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-core/src
Author: tallison
Date: Thu Aug 8 18:09:39 2013
New Revision: 1511908
URL: http://svn.apache.org/r1511908
Log:
Tika 1124 not 1142...sorry
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testPDFEmbeddingAndEmbedded.docx
(with props)
Removed:
tika/trunk/tika-parsers
Author: tallison
Date: Thu Aug 15 01:59:26 2013
New Revision: 1514126
URL: http://svn.apache.org/r1514126
Log:
TIKA 1001 more flexible html meta-header encoding detector
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testHTMLNoisyMetaEncoding_1.html
tika/trunk/tika
Author: tallison
Date: Fri Aug 16 01:15:40 2013
New Revision: 1514551
URL: http://svn.apache.org/r1514551
Log:
TIKA-1153 upgrade PDFBox to 1.8.2
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewvc/tika
Author: tallison
Date: Thu Sep 19 13:55:28 2013
New Revision: 1524741
URL: http://svn.apache.org/r1524741
Log:
bumped poi to 3.10-beta2
Modified:
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/tika-parsers/pom.xml
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-parsers/pom.xml?rev
Author: tallison
Date: Thu Sep 26 15:25:19 2013
New Revision: 1526570
URL: http://svn.apache.org/r1526570
Log:
TIKA-792 fixed by POI-3.10-beta2; added test for missing ooxml bean
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testWORD_missing_ooxml_bean1.docx
Author: tallison
Date: Thu Sep 26 16:18:07 2013
New Revision: 1526593
URL: http://svn.apache.org/r1526593
Log:
commented out TIKA-792 test for now
Modified:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
Modified:
tika/trunk/tika-parsers
Author: tallison
Date: Fri Sep 27 18:55:31 2013
New Revision: 1527030
URL: http://svn.apache.org/r1527030
Log:
TIKA-1171 -- extra asterisks from master slide in PPT; added tests to TIKA-712
test files to show 1171 was fixed. Borrowed extraction code from POI
PowerPointExtractor
Modified
Author: tallison
Date: Fri Sep 27 19:38:03 2013
New Revision: 1527044
URL: http://svn.apache.org/r1527044
Log:
added 1130 to CHANGES.txt
Modified:
tika/trunk/CHANGES.txt
Modified: tika/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewvc/tika/trunk/CHANGES.txt?rev=1527044r1=1527043r2
Author: tallison
Date: Mon Dec 2 14:46:38 2013
New Revision: 1547037
URL: http://svn.apache.org/r1547037
Log:
TIKA-1200 upgrade pdfbox to 1.8.3
Modified:
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/tika-parsers/pom.xml
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-parsers
Author: tallison
Date: Fri Dec 13 13:20:43 2013
New Revision: 1550725
URL: http://svn.apache.org/r1550725
Log:
TIKA-973 reopened. Would prefer test docs unequivocally consistent with Apache
License 2.0. Deleted initial test docs from trunk and commented out test case.
Also added
Author: tallison
Date: Mon Jan 27 13:18:54 2014
New Revision: 1561665
URL: http://svn.apache.org/r1561665
Log:
TIKA-1226, removed println...doh.
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
Modified:
tika/trunk/tika-parsers/src/main/java/org
Author: tallison
Date: Mon Feb 3 20:11:10 2014
New Revision: 1564042
URL: http://svn.apache.org/r1564042
Log:
TIKA-1228: Look for attachments under Kids node if embeddedFiles.getNames()
returns null
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents
Author: tallison
Date: Tue Feb 11 12:08:26 2014
New Revision: 1567074
URL: http://svn.apache.org/r1567074
Log:
TIKA-1237 upgrade to poi-3.10-FINAL
Modified:
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/tika-parsers/pom.xml
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-parsers
Author: tallison
Date: Wed Feb 19 15:27:24 2014
New Revision: 1569788
URL: http://svn.apache.org/r1569788
Log:
got rid of brittle requirement for specific number of pdfs to be tested in
PDFParserTest
Modified:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/pdf
Author: tallison
Date: Thu Mar 6 16:52:19 2014
New Revision: 1574959
URL: http://svn.apache.org/r1574959
Log:
TIKA-1232: add fine-grained pdf version extraction
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testPDF_Version.10.x.pdf
(with props)
tika/trunk/tika
Author: tallison
Date: Fri Mar 7 01:27:41 2014
New Revision: 1575112
URL: http://svn.apache.org/r1575112
Log:
TIKA-1252 small clean up
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika
Author: tallison
Date: Fri Mar 7 01:57:26 2014
New Revision: 1575120
URL: http://svn.apache.org/r1575120
Log:
cleanup whitespace in OutlookPSTParser
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/mbox/OutlookPSTParser.java
Modified:
tika/trunk/tika-parsers/src
Author: tallison
Date: Fri Apr 11 01:48:48 2014
New Revision: 1586529
URL: http://svn.apache.org/r1586529
Log:
TIKA-1271: trivial refactoring of classes useful for testing embedded document
handling
Modified:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/TikaTest.java
tika
Author: tallison
Date: Wed Apr 16 18:04:20 2014
New Revision: 1588005
URL: http://svn.apache.org/r1588005
Log:
TIKA-1010 extract embedded documents from RTF
Added:
tika/trunk/tika-core/src/main/java/org/apache/tika/metadata/RTFMetadata.java
tika/trunk/tika-parsers/src/main/java/org
Author: tallison
Date: Mon May 12 14:46:16 2014
New Revision: 1593983
URL: http://svn.apache.org/r1593983
Log:
TIKA-1233: removed catch blocks after upgrade to PDFBOX-1.8.5; see PDFBOX-1803
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
Modified
Author: tallison
Date: Mon May 12 15:14:09 2014
New Revision: 1593996
URL: http://svn.apache.org/r1593996
Log:
TIKA-1231: added more null checks after underlying fix was made in PDFBox-1.8.5
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser
Author: tallison
Date: Thu May 15 15:50:18 2014
New Revision: 1594958
URL: http://svn.apache.org/r1594958
Log:
test doc actually added for r1594957 temporary bug fix until TIKA-1295 is
resolved
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testPDFTripleLangTitle.pdf
Author: tallison
Date: Thu May 15 14:36:24 2014
New Revision: 1594930
URL: http://svn.apache.org/r1594930
Log:
Ignore a test until TIKA-1298 is fixed
Modified:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java
Modified:
tika/trunk/tika-parsers/src/test
Author: tallison
Date: Fri May 23 17:11:28 2014
New Revision: 1597132
URL: http://svn.apache.org/r1597132
Log:
add license header to RTFObjDataParser and clean up whitespace in
RTFEmbObjHandler
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/rtf/RTFEmbObjHandler.java
Author: tallison
Date: Tue May 27 19:33:07 2014
New Revision: 1597856
URL: http://svn.apache.org/r1597856
Log:
TIKA-1294 add ability to turn off image extraction from PDFs
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-core/src/main/java/org/apache/tika/metadata
Author: tallison
Date: Thu May 29 14:37:25 2014
New Revision: 1598305
URL: http://svn.apache.org/r1598305
Log:
fix to TIKA-1294, uppercase enum
Modified:
tika/trunk/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
tika/trunk/tika-parsers/src/main/java/org/apache
Author: tallison
Date: Fri May 30 18:23:15 2014
New Revision: 1598693
URL: http://svn.apache.org/r1598693
Log:
TIKA-1305: make RTF list handling slightly more robust against corrupt list
metadata
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/src/main/java/org/apache/tika
Author: tallison
Date: Tue Jun 17 16:05:44 2014
New Revision: 1603208
URL: http://svn.apache.org/r1603208
Log:
TIKA-1341: fix double endDocument in PDFParser
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
tika/trunk/tika-parsers/src/test/java
Author: tallison
Date: Tue Jun 24 01:12:45 2014
New Revision: 1604989
URL: http://svn.apache.org/r1604989
Log:
TIKA-1352 upgrade to PDFBox 1.8.6
Modified:
tika/trunk/tika-parsers/pom.xml
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
tika/trunk/tika
Author: tallison
Date: Fri Aug 1 17:31:21 2014
New Revision: 1615174
URL: http://svn.apache.org/r1615174
Log:
TIKA-1380: staging an updated test file for the actual patch once POI
3.11-beta-1 is released
Modified:
tika/trunk/tika-parsers/src/test/resources/test-documents
Author: tallison
Date: Tue Aug 5 13:03:05 2014
New Revision: 1615923
URL: http://svn.apache.org/r1615923
Log:
TIKA-1275 upgrade Commons Compress to 1.8.1; updated CHANGES.txt, too
Modified:
tika/branches/1.6/CHANGES.txt
tika/branches/1.6/tika-parsers/pom.xml
Modified: tika/branches/1.6
Author: tallison
Date: Tue Aug 5 13:15:12 2014
New Revision: 1615926
URL: http://svn.apache.org/r1615926
Log:
TIKA-1275 upgrade commons compress to 1.8.1; updated CHANGES.txt, too
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/CHANGES.txt
URL
Author: tallison
Date: Tue Aug 5 18:17:39 2014
New Revision: 1615970
URL: http://svn.apache.org/r1615970
Log:
TIKA-1380; fix for null ole.getLabel()
Modified:
tika/branches/1.6/tika-app/src/test/java/org/apache/tika/cli/TikaCLITest.java
tika/branches/1.6/tika-parsers/src/main/java/org
Author: tallison
Date: Tue Aug 5 19:02:11 2014
New Revision: 1615980
URL: http://svn.apache.org/r1615980
Log:
TIKA-1380; fix cases where ole.getLabel() == null for ole attachments
Modified:
tika/trunk/tika-app/src/test/java/org/apache/tika/cli/TikaCLITest.java
tika/trunk/tika-parsers
Author: tallison
Date: Fri Sep 19 14:00:24 2014
New Revision: 1626221
URL: http://svn.apache.org/r1626221
Log:
TIKA-1418 add example for how to dump tika config; and add --config to CLI
Modified:
tika/trunk/tika-app/src/main/java/org/apache/tika/cli/TikaCLI.java
tika/trunk/tika-app/src
Author: tallison
Date: Fri Sep 19 14:10:20 2014
New Revision: 1626223
URL: http://svn.apache.org/r1626223
Log:
TIKA-1418 remove println...the horror.
Modified:
tika/trunk/tika-example/src/main/java/org/apache/tika/example/DumpTikaConfigExample.java
Modified:
tika/trunk/tika-example/src
Author: tallison
Date: Fri Sep 19 19:18:08 2014
New Revision: 1626300
URL: http://svn.apache.org/r1626300
Log:
TIKA-1329 add RecursiveParserWrapper
Added:
tika/trunk/tika-core/src/main/java/org/apache/tika/parser/RecursiveParserWrapper.java
tika/trunk/tika-core/src/main/java/org/apache
Author: tallison
Date: Wed Sep 24 12:58:56 2014
New Revision: 1627304
URL: http://svn.apache.org/r1627304
Log:
TIKA-1424: clear PDFont's resources after each document
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
Modified:
tika/trunk/tika
Author: tallison
Date: Wed Sep 24 13:10:10 2014
New Revision: 1627308
URL: http://svn.apache.org/r1627308
Log:
TIKA-1419: upgrade to PDFBox 1.8.7 and update CHANGES.txt for this and a few
recent changes
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/pom.xml
Modified: tika
Author: tallison
Date: Tue Sep 30 01:41:20 2014
New Revision: 1628350
URL: http://svn.apache.org/r1628350
Log:
TIKA-1433 : extract documents embedded within annotations in PDFs
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testPDFFileEmbInAnnotation.pdf
(with props
Author: tallison
Date: Wed Oct 1 14:35:46 2014
New Revision: 1628715
URL: http://svn.apache.org/r1628715
Log:
TIKA-1427, small clean up to ensure that inline image number tracks with
extracted file
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
Author: tallison
Date: Tue Oct 21 12:26:15 2014
New Revision: 1633357
URL: http://svn.apache.org/r1633357
Log:
clean up from TIKA-1311
Removed:
tika/trunk/tika-app/src/main/java/org/apache/tika/io/
Author: tallison
Date: Wed Oct 22 00:31:37 2014
New Revision: 1633499
URL: http://svn.apache.org/r1633499
Log:
TIKA-1451 add RecursiveParserWrapper output to CLI and GUI
Added:
tika/trunk/tika-app/src/test/resources/test-data/test_recursive_embedded.docx
(with props)
tika/trunk/tika
Author: tallison
Date: Thu Oct 23 15:45:20 2014
New Revision: 1633845
URL: http://svn.apache.org/r1633845
Log:
move pretty print metadata key sorter into standalone class
Modified:
tika/trunk/tika-serialization/src/main/java/org/apache/tika/metadata/serialization/JsonMetadataBase.java
Author: tallison
Date: Thu Oct 23 15:46:09 2014
New Revision: 1633846
URL: http://svn.apache.org/r1633846
Log:
move pretty print metadata key sorter into standalone class, with added
PrettyMetadataKeyComparator...argh
Added:
tika/trunk/tika-serialization/src/main/java/org/apache/tika
Author: tallison
Date: Mon Oct 27 17:00:03 2014
New Revision: 1634594
URL: http://svn.apache.org/r1634594
Log:
TIKA-1459 fix write limit bug in BasicContentHandlerFactory when creating a
BodyContentHandler
Added:
tika/trunk/tika-core/src/test/java/org/apache/tika/sax
Author: tallison
Date: Wed Oct 29 10:57:28 2014
New Revision: 1635097
URL: http://svn.apache.org/r1635097
Log:
cleanup tika-app pom, remove unnecessary gson dependency
Modified:
tika/trunk/tika-app/pom.xml
Modified: tika/trunk/tika-app/pom.xml
URL:
http://svn.apache.org/viewvc/tika/trunk
Author: tallison
Date: Mon Nov 10 14:15:22 2014
New Revision: 1637868
URL: http://svn.apache.org/r1637868
Log:
TIKA-1467: in PDFParser, move metadata set isEncrypted() to before decryption
step.
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
Author: tallison
Date: Fri Dec 19 02:07:04 2014
New Revision: 1646612
URL: http://svn.apache.org/r1646612
Log:
TIKA-1498: now actually add providers to cli...argh
Modified:
tika/trunk/tika-server/src/main/java/org/apache/tika/server/TikaServerCli.java
Modified:
tika/trunk/tika-server/src
Author: tallison
Date: Fri Dec 19 03:12:38 2014
New Revision: 1646616
URL: http://svn.apache.org/r1646616
Log:
TIKA-1497: add JSON and XMP output to tika-server's /meta
Added:
tika/trunk/tika-server/src/main/java/org/apache/tika/server/XMPMessageBodyWriter.java
Modified:
tika/trunk/tika
Author: tallison
Date: Fri Dec 19 03:13:56 2014
New Revision: 1646617
URL: http://svn.apache.org/r1646617
Log:
TIKA-1497: update changes.txt
Modified:
tika/trunk/CHANGES.txt
Modified: tika/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewvc/tika/trunk/CHANGES.txt?rev=1646617r1=1646616r2
Author: tallison
Date: Fri Feb 6 20:33:02 2015
New Revision: 1657952
URL: http://svn.apache.org/r1657952
Log:
TIKA-1542 substitute Apache friendly TTF test file for our current copyrighted
file, take 2. See PDFBOX-2383
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents
Author: tallison
Date: Fri Feb 6 02:26:22 2015
New Revision: 1657739
URL: http://svn.apache.org/r1657739
Log:
TIKA-1542 substitute Apache friendly TTF test file for our current copyrighted
file
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testTrueType2.ttf
Author: tallison
Date: Thu Jan 22 18:29:19 2015
New Revision: 1653994
URL: http://svn.apache.org/r1653994
Log:
TIKA-1526: initial fix for jvm bug that can affect users with a default Locale
of tr running on MACOSX or BSD. We still need to confirm that this fixes the
problem and/or add a unit
Author: tallison
Date: Fri Feb 13 02:03:39 2015
New Revision: 1659449
URL: http://svn.apache.org/r1659449
Log:
TIKA-1511 add parser for sqlite3
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testSqlite3b.db
(with props)
Modified:
tika/trunk/CHANGES.txt
tika/trunk
Author: tallison
Date: Fri Feb 13 01:00:31 2015
New Revision: 1659446
URL: http://svn.apache.org/r1659446
Log:
TIKA-1548 improve handling of encrypted pdfs when wrong password is offered
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
tika
Author: tallison
Date: Fri Feb 20 14:16:18 2015
New Revision: 1661129
URL: http://svn.apache.org/r1661129
Log:
TIKA-1553: add an EvilParser for testing purposes
Added:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/evil/
tika/trunk/tika-parsers/src/test/java/org/apache
Author: tallison
Date: Fri Jan 23 14:36:36 2015
New Revision: 1654225
URL: http://svn.apache.org/r1654225
Log:
TIKA-1529: step 1...get rid of toLowerCase in BasicContentHandlerFactoryTest
Modified:
tika/trunk/tika-core/src/test/java/org/apache/tika/sax/BasicContentHandlerFactoryTest.java
Author: tallison
Date: Wed Jan 28 19:04:39 2015
New Revision: 1655433
URL: http://svn.apache.org/r1655433
Log:
TIKA-1534: Upgrade to Commons Compress 1.9
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/CHANGES.txt
URL:
http://svn.apache.org/viewvc
Author: tallison
Date: Fri Feb 13 12:54:45 2015
New Revision: 1659547
URL: http://svn.apache.org/r1659547
Log:
TIKA-1511, third time is the charm...many apologies
Added:
tika/trunk/tika-core/src/main/java/org/apache/tika/metadata/Database.java
Added: tika/trunk/tika-core/src/main/java/org
Author: tallison
Date: Fri Feb 13 16:40:55 2015
New Revision: 1659598
URL: http://svn.apache.org/r1659598
Log:
TIKA-1511 try to revert to earlier version of sqlite-jdbc to avoid
unsatisfiedlikeerror on ubuntu
Modified:
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/tika-parsers
Author: tallison
Date: Wed Feb 11 12:59:03 2015
New Revision: 1658947
URL: http://svn.apache.org/r1658947
Log:
TIKA-1544 consecutive new lines not preserved in rtf
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents/testRTFNewlines.rtf
Modified:
tika/trunk/tika-parsers/src
Author: tallison
Date: Fri Feb 13 12:43:56 2015
New Revision: 1659545
URL: http://svn.apache.org/r1659545
Log:
TIKA-1511, with new files added...doh
Added:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/jdbc/
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser
Author: tallison
Date: Wed Jan 7 16:48:43 2015
New Revision: 1650117
URL: http://svn.apache.org/r1650117
Log:
TIKA-1445: add tests to TesseractOCRParserTest to ensure metadata is extracted
Modified:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/ocr
Author: tallison
Date: Fri Mar 6 14:41:07 2015
New Revision: 1664635
URL: http://svn.apache.org/r1664635
Log:
TIKA-1553 change EvilParser to MockParser and move to core
Added:
tika/trunk/tika-core/src/test/java/org/apache/tika/parser/mock/
tika/trunk/tika-core/src/test/java/org/apache
Author: tallison
Date: Mon Mar 30 13:29:11 2015
New Revision: 1670090
URL: http://svn.apache.org/r1670090
Log:
TIKA-1512 temporary workaround. Currently not including test docs or tests
that derive from govdocs1
Added:
tika/trunk/tika-parsers/src/test/resources/test-documents
Author: tallison
Date: Mon Mar 30 13:57:06 2015
New Revision: 1670095
URL: http://svn.apache.org/r1670095
Log:
TIKA-1584: fixed regression in Tika 1.7 that prevents processing of embedded
docs with /tika service
Modified:
tika/trunk/tika-server/src/main/java/org/apache/tika/server/resource
Author: tallison
Date: Mon Mar 30 19:43:38 2015
New Revision: 1670185
URL: http://svn.apache.org/r1670185
Log:
TIKA-1330, trivial fixes to avoid NPE with consumersManagerMaxMillis parameter
Modified:
tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/fs/builders
Author: tallison
Date: Tue Mar 31 01:58:04 2015
New Revision: 1670238
URL: http://svn.apache.org/r1670238
Log:
TIKA-1423: exclude pdfs and readme.txt files from tika-app and tika-server
jars. Anything else we can exclude?
Modified:
tika/trunk/tika-app/pom.xml
tika/trunk/tika-server
Author: tallison
Date: Tue Mar 31 01:54:40 2015
New Revision: 1670237
URL: http://svn.apache.org/r1670237
Log:
TIKA-1330: add integration tests to TikaCLITest
Modified:
tika/trunk/tika-app/src/main/java/org/apache/tika/cli/TikaCLI.java
tika/trunk/tika-app/src/test/java/org/apache/tika
Author: tallison
Date: Mon Mar 2 20:40:35 2015
New Revision: 1663424
URL: http://svn.apache.org/r1663424
Log:
TIKA-758 clean up after remembering PDFBOX-1130
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java
Modified:
tika/trunk/tika-parsers/src
Modified:
tika/trunk/tika-server/src/test/java/org/apache/tika/server/MetadataResourceTest.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-server/src/test/java/org/apache/tika/server/MetadataResourceTest.java?rev=1661200r1=1661199r2=1661200view=diff
Author: tallison
Date: Fri Feb 20 19:11:44 2015
New Revision: 1661193
URL: http://svn.apache.org/r1661193
Log:
TIKA-1323: allow tika-server to return stack traces from parse exceptions for
easier analysis of parser exceptions via tika-server.
Added:
tika/trunk/tika-server/src/main/java/org
Modified:
tika/trunk/tika-server/src/main/java/org/apache/tika/server/TikaResource.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-server/src/main/java/org/apache/tika/server/TikaResource.java?rev=1661200r1=1661199r2=1661200view=diff
Author: tallison
Date: Tue Mar 24 19:38:41 2015
New Revision: 1668967
URL: http://svn.apache.org/r1668967
Log:
TIKA-1531 upgrade to POI 3.12-beta1
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-bundle/pom.xml
tika/trunk/tika-parsers/pom.xml
Modified: tika/trunk/CHANGES.txt
URL
Added:
tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/BatchProcessDriverCLI.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/BatchProcessDriverCLI.java?rev=1668673view=auto
Added: tika/trunk/tika-batch/src/test/resources/tika-batch-config-broken.xml
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-batch/src/test/resources/tika-batch-config-broken.xml?rev=1668673view=auto
==
---
Added:
tika/trunk/tika-batch/src/test/java/org/apache/tika/batch/fs/FSBatchTestBase.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-batch/src/test/java/org/apache/tika/batch/fs/FSBatchTestBase.java?rev=1668673view=auto
Added:
tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/fs/RecursiveParserWrapperFSConsumer.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/fs/RecursiveParserWrapperFSConsumer.java?rev=1668673view=auto
Added:
tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/builders/BatchProcessBuilder.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-batch/src/main/java/org/apache/tika/batch/builders/BatchProcessBuilder.java?rev=1668673view=auto
Author: tallison
Date: Thu Apr 2 01:54:33 2015
New Revision: 1670807
URL: http://svn.apache.org/r1670807
Log:
TIKA-1323: flush writer when printing stack trace
Modified:
tika/trunk/tika-server/src/main/java/org/apache/tika/server/TikaServerParseExceptionMapper.java
Modified:
tika/trunk
Author: tallison
Date: Tue Apr 14 10:57:30 2015
New Revision: 1673406
URL: http://svn.apache.org/r1673406
Log:
TIKA-1605
Modified:
tika/trunk/tika-core/src/main/java/org/apache/tika/parser/RecursiveParserWrapper.java
tika/trunk/tika-core/src/test/java/org/apache/tika/io
Author: tallison
Date: Tue Apr 21 17:25:47 2015
New Revision: 1675159
URL: http://svn.apache.org/r1675159
Log:
TIKA-1611 -- allow RecursiveParserWrapper to catch exceptions caused by
embedded documents
Added:
tika/trunk/tika-core/src/main/java/org/apache/tika/utils/ExceptionUtils.java
Author: tallison
Date: Mon Apr 20 11:24:43 2015
New Revision: 1674800
URL: http://svn.apache.org/r1674800
Log:
TIKA-1511, move xerial dependency to 'provided'
Modified:
tika/trunk/CHANGES.txt
tika/trunk/tika-app/src/main/appended-resources/META-INF/LICENSE
tika/trunk/tika-parsers
Author: tallison
Date: Tue Apr 21 14:03:47 2015
New Revision: 1675121
URL: http://svn.apache.org/r1675121
Log:
TIKA-1501: Fix disabled OSGi related unit tests. Fixes from Bob Paulin.
Modified:
tika/trunk/tika-bundle/src/test/java/org/apache/tika/bundle/BundleIT.java
Modified:
tika/trunk
Modified:
tika/trunk/tika-server/src/test/java/org/apache/tika/server/StackTraceOffTest.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-server/src/test/java/org/apache/tika/server/StackTraceOffTest.java?rev=1679211r1=1679210r2=1679211view=diff
Modified:
tika/trunk/tika-example/src/main/java/org/apache/tika/example/DescribeMetadata.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-example/src/main/java/org/apache/tika/example/DescribeMetadata.java?rev=1679211r1=1679210r2=1679211view=diff
Modified:
tika/trunk/tika-example/src/main/java/org/apache/tika/example/SpringExample.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-example/src/main/java/org/apache/tika/example/SpringExample.java?rev=1679211r1=1679210r2=1679211view=diff
Modified:
tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/pkg/RarParserTest.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/pkg/RarParserTest.java?rev=1679211r1=1679210r2=1679211view=diff
Modified: tika/trunk/tika-batch/src/test/resources/log4j.properties
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-batch/src/test/resources/log4j.properties?rev=1679211r1=1679210r2=1679211view=diff
==
---
Author: tallison
Date: Mon Apr 6 13:18:59 2015
New Revision: 1671533
URL: http://svn.apache.org/r1671533
Log:
TIKA-1519 - don't allow potentially erroneous http-equiv Content-Type to
overwrite Content-Type in HtmlParser
Modified:
tika/trunk/tika-core/src/main/java/org/apache/tika/metadata
Author: tallison
Date: Mon Apr 6 15:54:59 2015
New Revision: 1671561
URL: http://svn.apache.org/r1671561
Log:
TIKA-1519 change underscore to dash
Modified:
tika/trunk/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
Modified:
tika/trunk/tika-core/src/main/java/org
Author: tallison
Date: Wed Apr 1 18:27:23 2015
New Revision: 1670749
URL: http://svn.apache.org/r1670749
Log:
TIKA-1330 clean up logging in tika-batch ant tika-app integration of tika-batch
Added:
tika/trunk/tika-app/src/main/resources/log4j_batch_process.properties
tika/trunk/tika-app
Author: tallison
Date: Wed Apr 1 18:52:08 2015
New Revision: 1670751
URL: http://svn.apache.org/r1670751
Log:
TIKA-1330 clean up logging in tika-batch ant tika-app integration of
tika-batch, take 2
Modified:
tika/trunk/tika-batch/src/main/java/org/apache/tika/batch
Author: tallison
Date: Fri Jun 5 01:44:56 2015
New Revision: 1683656
URL: http://svn.apache.org/r1683656
Log:
TIKA-1233 reopened
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika
Author: tallison
Date: Wed Jun 3 18:26:25 2015
New Revision: 1683409
URL: http://svn.apache.org/r1683409
Log:
TIKA-1646 small cleanup
Modified:
tika/trunk/tika-core/src/test/java/org/apache/tika/parser/mock/MockParser.java
Modified:
tika/trunk/tika-core/src/test/java/org/apache/tika
Author: tallison
Date: Wed Jun 24 19:49:50 2015
New Revision: 1687353
URL: http://svn.apache.org/r1687353
Log:
add test to ensure that the list reader for tika-batch properly creates
subdirectories
Added:
tika/trunk/tika-batch/src/test/resources/test-input/hierarchical/
tika/trunk/tika
Author: tallison
Date: Thu May 28 17:28:40 2015
New Revision: 1682287
URL: http://svn.apache.org/r1682287
Log:
TIKA-1315 -- basic list support for WordExtractor; still need to add in
override behavior once we add a class to ooxml via POI
Added:
tika/trunk/tika-parsers/src/main/java/org
Modified:
tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/txt/CharsetRecognizer.java
URL:
http://svn.apache.org/viewvc/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/txt/CharsetRecognizer.java?rev=1682489r1=1682488r2=1682489view=diff
1 - 100 of 6407 matches
Mail list logo