[ 
https://issues.apache.org/jira/browse/TIKA-1319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14019374#comment-14019374
 ] 

Chris A. Mattmann commented on TIKA-1319:
-----------------------------------------

OK integrated in the patch from review board into trunk, built and test:

{noformat}
[INFO] 
[INFO] --- maven-site-plugin:3.0:attach-descriptor (attach-descriptor) @ tika 
---
[INFO] 
[INFO] --- maven-install-plugin:2.3.1:install (default-install) @ tika ---
[INFO] Installing /Users/mattmann/tmp/tika/pom.xml to 
/Users/mattmann/.m2/repository/org/apache/tika/tika/1.6-SNAPSHOT/tika-1.6-SNAPSHOT.pom
[INFO] Installing /Users/mattmann/tmp/tika/target/tika-1.6-SNAPSHOT-site.xml to 
/Users/mattmann/.m2/repository/org/apache/tika/tika/1.6-SNAPSHOT/tika-1.6-SNAPSHOT-site.xml
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO] 
[INFO] Apache Tika parent ................................ SUCCESS [4.679s]
[INFO] Apache Tika core .................................. SUCCESS [42.186s]
[INFO] Apache Tika parsers ............................... SUCCESS [2:55.816s]
[INFO] Apache Tika XMP ................................... SUCCESS [10.876s]
[INFO] Apache Tika serialization ......................... SUCCESS [8.565s]
[INFO] Apache Tika application ........................... SUCCESS [45.837s]
[INFO] Apache Tika OSGi bundle ........................... SUCCESS [50.357s]
[INFO] Apache Tika server ................................ SUCCESS [51.133s]
[INFO] Apache Tika translate ............................. SUCCESS [6.250s]
[INFO] Apache Tika Java-7 Components ..................... SUCCESS [7.767s]
[INFO] Apache Tika ....................................... SUCCESS [0.055s]
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 6:45.795s
[INFO] Finished at: Thu Jun 05 13:56:12 PDT 2014
[INFO] Final Memory: 76M/247M
[INFO] ------------------------------------------------------------------------
{noformat}

All tests passed, going forward to commit now.


> Translation
> -----------
>
>                 Key: TIKA-1319
>                 URL: https://issues.apache.org/jira/browse/TIKA-1319
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Tyler Palsulich
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>
> I just opened up a review on reviews.apache.org -- 
> https://reviews.apache.org/r/22219/. I copied the description below. 
> This patch adds basic language translation functionality to Tika. Translation 
> is provided by a Microsoft API, but accessed through Apache 2 licensed 
> com.memetix.microsoft-translator-java-api 
> (https://code.google.com/p/microsoft-translator-java-api/ ). If a user wants 
> to use the translation feature, they have to add a client id and client 
> secret to the 
> tika-core/src/main/resources/org/apache/tika/language/translator.properties 
> file (see http://msdn.microsoft.com/en-us/library/hh454950.aspx ). I added 
> com.memetix as a dependency in tika-core. I put the Translator class in 
> org.apache.tika.language. There is no integration with the server or CLI, 
> yet. Further, only Strings are translated right now -- if you pass in a full 
> document with xml tags, the structure will be mangled. But, I think that 
> would be a cool feature -- translate the body, title, subtitle, etc, but not 
> the structural elements. 
> There is still more work to do, but I wanted some more eyes on this to make 
> sure I'm heading in the right direction and this is a desired feature. Let me 
> know what you think!
> There are two simple unit tests for now which translate "hello" to French 
> ("salut"). One for inputting the source and target languages, one for 
> inputing just the target language (and detecting the source language 
> automatically).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to