[ 
https://issues.apache.org/jira/browse/TIKA-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683363#action_12683363
 ] 

Uwe Schindler edited comment on TIKA-200 at 3/19/09 1:41 AM:
-------------------------------------------------------------

For a more advanced parsing of content type and also support of compressed HTTP 
streams, have a look at 
http://panfmp.svn.sourceforge.net/viewvc/panfmp/main/trunk/src/de/pangaea/metadataportal/harvester/OAIHarvesterBase.java?view=markup
 line 177 ff.
This is a nice method that creates a SAX InputSource with all properties 
correctly set from an HTTP URL with some extra features. The InputSource with 
only a given SystemID does not support compression, retry-after. For the 
underlying parser to work correct, the charset encoding should be set (if 
available from the HTTP response). This more complex example was needed for an 
OAI-PMH harvester for effective metadata harvesting with compression, 
last-modified and so on.

      was (Author: thetaphi):
    For a more advanced parsing of content type and also support of compressed 
HTTP streams, have a look at 
http://panfmp.svn.sourceforge.net/viewvc/panfmp/main/trunk/src/de/pangaea/metadataportal/harvester/OAIHarvesterBase.java?view=markup
 line 177 ff.
This is a nice method that creates a SAX InputSource with all properties 
correctly set from an HTTP urlswith some extra features, the InputSource with 
only a given SystemID does not support (compression, retry-after). For the 
underlying parser to work correct, the charset encoding should be set (if 
available from the HTTP response). This mpore complex example was needed for an 
OAI-PMH harvester for effective metadata harvesting with compression and so on.
  
> Allow URL drag and drop in the Tika GUI
> ---------------------------------------
>
>                 Key: TIKA-200
>                 URL: https://issues.apache.org/jira/browse/TIKA-200
>             Project: Tika
>          Issue Type: New Feature
>          Components: gui
>            Reporter: Jukka Zitting
>            Priority: Minor
>             Fix For: 0.4
>
>         Attachments: TIKA-200.diff
>
>
> It would be nice if I could drag a URL from my browser to the Tika GUI window 
> to have the linked document downloaded and parsed by Tika.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to