[ https://issues.apache.org/jira/browse/TIKA-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683363#action_12683363 ]
Uwe Schindler edited comment on TIKA-200 at 3/19/09 1:41 AM: ------------------------------------------------------------- For a more advanced parsing of content type and also support of compressed HTTP streams, have a look at http://panfmp.svn.sourceforge.net/viewvc/panfmp/main/trunk/src/de/pangaea/metadataportal/harvester/OAIHarvesterBase.java?view=markup line 177 ff. This is a nice method that creates a SAX InputSource with all properties correctly set from an HTTP URL with some extra features. The InputSource with only a given SystemID does not support compression, retry-after. For the underlying parser to work correct, the charset encoding should be set (if available from the HTTP response). This more complex example was needed for an OAI-PMH harvester for effective metadata harvesting with compression, last-modified and so on. was (Author: thetaphi): For a more advanced parsing of content type and also support of compressed HTTP streams, have a look at http://panfmp.svn.sourceforge.net/viewvc/panfmp/main/trunk/src/de/pangaea/metadataportal/harvester/OAIHarvesterBase.java?view=markup line 177 ff. This is a nice method that creates a SAX InputSource with all properties correctly set from an HTTP urlswith some extra features, the InputSource with only a given SystemID does not support (compression, retry-after). For the underlying parser to work correct, the charset encoding should be set (if available from the HTTP response). This mpore complex example was needed for an OAI-PMH harvester for effective metadata harvesting with compression and so on. > Allow URL drag and drop in the Tika GUI > --------------------------------------- > > Key: TIKA-200 > URL: https://issues.apache.org/jira/browse/TIKA-200 > Project: Tika > Issue Type: New Feature > Components: gui > Reporter: Jukka Zitting > Priority: Minor > Fix For: 0.4 > > Attachments: TIKA-200.diff > > > It would be nice if I could drag a URL from my browser to the Tika GUI window > to have the linked document downloaded and parsed by Tika. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.