[Tika Wiki] Update of "TikaOCR" by ChrisMattmann

2014-09-29 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification. The "TikaOCR" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/TikaOCR?action=diff&rev1=2&rev2=3 3. install leptonica with tiff support `brew install leptonica --w

[Tika Wiki] Update of "TikaOCR" by ChrisMattmann

2014-09-29 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification. The "TikaOCR" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/TikaOCR?action=diff&rev1=1&rev2=2 = Mac Installation Instructions = - # If you are lucky `brew i

[Tika Wiki] Update of "TikaOCR" by ChrisMattmann

2014-09-29 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification. The "TikaOCR" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/TikaOCR New page: With [[https://issues.apache.org/jira/browse/TIKA-93|TIKA-93]] you can now use the aw

[Tika Wiki] Update of "FrontPage" by ChrisMattmann

2014-09-29 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification. The "FrontPage" page has been changed by ChrisMattmann: https://wiki.apache.org/tika/FrontPage?action=diff&rev1=18&rev2=19 Comment: - add in Wiki page on Tika OCR * [[TikaJAXRS|Tika J

svn commit: r1628354 - in /tika/trunk: CHANGES.txt tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java

2014-09-29 Thread tallison
Author: tallison Date: Tue Sep 30 02:39:26 2014 New Revision: 1628354 URL: http://svn.apache.org/r1628354 Log: TIKA-1427: add markup for documents embedded in pdfs Modified: tika/trunk/CHANGES.txt tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java tika/t

svn commit: r1628350 - in /tika/trunk/tika-parsers/src: main/java/org/apache/tika/parser/pdf/PDF2XHTML.java test/java/org/apache/tika/parser/pdf/PDFParserTest.java test/resources/test-documents/testPD

2014-09-29 Thread tallison
Author: tallison Date: Tue Sep 30 01:41:20 2014 New Revision: 1628350 URL: http://svn.apache.org/r1628350 Log: TIKA-1433 : extract documents embedded within annotations in PDFs Added: tika/trunk/tika-parsers/src/test/resources/test-documents/testPDFFileEmbInAnnotation.pdf (with props) Mod

svn commit: r1628341 - /tika/trunk/tika-parsers/src/test/java/org/apache/tika/sax/PhoneExtractingContentHandlerTest.java

2014-09-29 Thread tpalsulich
Author: tpalsulich Date: Tue Sep 30 00:34:33 2014 New Revision: 1628341 URL: http://svn.apache.org/r1628341 Log: Use TikaTest.assertContains in PhoneExtractorContentHandlerTest. Modified: tika/trunk/tika-parsers/src/test/java/org/apache/tika/sax/PhoneExtractingContentHandlerTest.java Modifi

svn commit: r1628340 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/sax/ tika-example/src/main/java/org/apache/tika/example/ tika-example/src/test/java/org/apache/tika/example/ tika-example

2014-09-29 Thread tpalsulich
Author: tpalsulich Date: Tue Sep 30 00:17:58 2014 New Revision: 1628340 URL: http://svn.apache.org/r1628340 Log: TIKA-1420, move the PhoneExtractingContentHandler to tika-core. Tests in tika-parsers. Added: tika/trunk/tika-core/src/main/java/org/apache/tika/sax/CleanPhoneText.java - co