[
https://issues.apache.org/jira/browse/CONNECTORS-767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13749982#comment-13749982
]
Shinichiro Abe commented on CONNECTORS-767:
-------------------------------------------
I think we can get the file name from the last directory of URLs, this is not
always important.
In intra web sites of enterprise, there are attached and liked binary
files(pdf, xls, doc etc).
After crawling these files and posting to Solr, I think a user may search
documents
by file name , not by title's term and content's term.
If we can get not only file name but also mime type, in Solr side we can do
filter seaching and faceting.
This is why I'd like to get file name(and mime type).
> Can we get file name in Web Connector?
> --------------------------------------
>
> Key: CONNECTORS-767
> URL: https://issues.apache.org/jira/browse/CONNECTORS-767
> Project: ManifoldCF
> Issue Type: Bug
> Components: Web connector
> Affects Versions: ManifoldCF 1.3
> Reporter: Shinichiro Abe
> Priority: Trivial
>
> Although file and sharedDrive connectors get file name, web connector does
> not get file name currently. Can web connector get a file name like below? :
> RepositoryDocument rd = new RepositoryDocument();
> rd.setFileName(filename from URL);
> rd.setBinary(InputStream);
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira