Hi Karl,
we tried to apply your patch from manifold home /var/lib/apache-manifoldcf-1.7
as an Ubuntu root user
with the following command:
patch -p0 < CONNECTORS-1022.patch
but we got this error:
(Stripping trailing CRs from patch.)
can't find file to patch at input line 5
Perhaps you used the wrong -p or --strip option?
The text leading up to this was:
--------------------------
|Index:
connectors/rss/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/rss/RSSConnector.java
|===================================================================
|---
connectors/rss/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/rss/RSSConnector.java
(revision 1622851)
|+++
connectors/rss/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/rss/RSSConnector.java
(revision 1622852)
What’s wrong with it?
Thanks,
Luca
Da: Basso Luca [mailto:[email protected]]
Inviato: lunedì 8 settembre 2014 11:34
A: Karl Wright; [email protected]
Oggetto: R: User Mapping Connection
Hi Karl,
you’ve been faster than light!
Thank you so much, we’ll test your patch asap.
Luca
Da: Karl Wright [mailto:[email protected]]
Inviato: sabato 6 settembre 2014 14:34
A: Basso Luca; [email protected]<mailto:[email protected]>
Oggetto: Re: User Mapping Connection
Hi Luca,
I've created the ticket CONNECTORS-1022. I ported the URL Mapping facility
from the RSS connector to the Web connector. There's a patch attached. Please
see if this works for you.
Thanks!
Karl
On Fri, Sep 5, 2014 at 7:28 AM, Karl Wright
<[email protected]<mailto:[email protected]>> wrote:
The next release is scheduled for December. The feature would become available
when complete on the dev_1x branch. I am very busy at this time so I doubt
that would be possible in less than a few weeks.
Karl
Sent from my Windows Phone
________________________________
From: Basso Luca
Sent: 9/5/2014 6:05 AM
To: [email protected]<mailto:[email protected]>
Subject: R: User Mapping Connection
Hi Karl,
we really need such URL Mapping facility.
We also strongly believe that it could be very useful for the whole community,
so we are going to create a ticket for that.
Nevertheless our main concern is about the timeline: which is your estimate
about
a possible release date for the facility at issue?
Thank,
Luca
Da: Karl Wright [mailto:[email protected]<mailto:[email protected]>]
Inviato: giovedì 4 settembre 2014 16:20
A: [email protected]<mailto:[email protected]>
Oggetto: Re: User Mapping Connection
Hi Luca,
User name mapping is useful only in the context of document authorization.
From your description of the problem, that has nothing to do at all with the
actual problem you have, which, if I may restate, is to map the URL of a
crawled Web document from its original form to a new form, where there are no
trailing "/" characters.
To do what you are looking to do, you do not need any authority groups or
authorities. But you do need some facility for mapping URLs, either in the web
connector itself, or in the solr connector, or possibly in a transformation
connector. Unfortunately, no such support exists in ManifoldCF at this time;
something would need to be developed. You are free to create a ticket for this
work, which I think would best be modeled as a transformation connector.
Thanks,
Karl
On Thu, Sep 4, 2014 at 10:09 AM, Basso Luca
<[email protected]<mailto:[email protected]>>
wrote:
Hi Karl,
we are running ManifoldCF 1.6.1 with the Web Repository connector and the Solr
Output connector.
Our goal now is to remap crawled urls using the “User Mapping Connection”
utility.
To achieve this result we tried the following steps:
1. define a User Mapping Connection with these fields:
User name regular expression: (.+)(/)$
User expression: $(1)
in order to remove a possible slash at the end of any url
2. define an Authority Group
3. define an Authority Connection, belonging to our Authority Group and
containing our User Mapping Connection
4. connect our Repository Connection to the Authority Group
Unfortunately this procedure is not working.
What is going wrong?
Thank you.
Best regards,
Luca