Hi Maurizio,

I will be available all day tomorrow (Wednesday) to help out as much as I can. 
If it's possible for you to look into this I can take whatever steps you need.

Many thanks,

Paul

-----Original Message-----
From: "Karl Wright" <[email protected]>
Sent: Tuesday, October 20, 2015 12:34pm
To: "[email protected]" <[email protected]>
Subject: Re: Manifold/Alfresco seeding and security

Hi Maurizio,

This is the third time we've seen this; can you use Paul's help to chase
down what the issue is?

Karl


On Tue, Oct 20, 2015 at 12:19 PM, Paul Farrell <[email protected]>
wrote:

> Hi,
>
> I am using Alfresco Community 5.0.
>
> Having taken that AMP file (version 0.7.1) and then installed it into
> Alfresco and restarted the services, the issue is still present.
>
> I suspect that this is probably more to do with the Manifold end than the
> Alfresco end. It seems it is Manifold that is automatically appending the
> “/api/node” string into the path whenever I use “/alfresco/service” as the
> Context in the repository connection configuration.
>
> If it is of interest, this is the output in the manifoldcf.log file when I
> use the repo connection config I mentioned earlier.
>
> DEBUG 2015-10-20 12:18:46,869 (qtp182259421-40) - Connection request:
> [route: {}->http://54.165.85.140:8080][total kept alive: 0; route
> allocated: 0 of 2; total allocated: 0 of 20]
> DEBUG 2015-10-20 12:18:46,869 (qtp182259421-40) - Connection leased: [id:
> 10][route: {}->http://54.165.85.140:8080][total kept alive: 0; route
> allocated: 1 of 2; total allocated: 1 of 20]
> DEBUG 2015-10-20 12:18:46,869 (qtp182259421-40) - Opening connection {}->
> http://54.165.85.140:8080
> DEBUG 2015-10-20 12:18:46,869 (qtp182259421-40) - Connecting to /
> 54.165.85.140:8080
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - Connection established
> 172.31.23.90:58712<->54.165.85.140:8080
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - Executing request GET
> /alfresco/service/api/node/auth/resolve/admin HTTP/1.1
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - Proxy auth state:
> UNCHALLENGED
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >> GET
> /alfresco/service/api/node/auth/resolve/admin HTTP/1.1
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> Accept: application/json
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> Authorization: Basic YWRtaW46RnVubmVsYmFjazE=
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> Host: 54.165.85.140:8080
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> Connection: Keep-Alive
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> User-Agent: Apache-HttpClient/4.3.5 (java 1.5)
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> Accept-Encoding: gzip,deflate
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >> "GET
> /alfresco/service/api/node/auth/resolve/admin HTTP/1.1[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "Accept: application/json[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "Authorization: Basic YWRtaW46RnVubmVsYmFjazE=[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "Host: 54.165.85.140:8080[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "Connection: Keep-Alive[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "User-Agent: Apache-HttpClient/4.3.5 (java 1.5)[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "Accept-Encoding: gzip,deflate[\r][\n]"
> DEBUG 2015-10-20 12:18:46,870 (qtp182259421-40) - http-outgoing-10 >>
> "[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "HTTP/1.1 404 Not Found[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Server: Apache-Coyote/1.1[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Cache-Control: no-cache[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Expires: Thu, 01 Jan 1970 00:00:00 GMT[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Pragma: no-cache[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Content-Type: text/html;charset=UTF-8[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Transfer-Encoding: chunked[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "Date: Tue, 20 Oct 2015 16:18:47 GMT[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "630[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "
> http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd";>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "<html xmlns="http://www.w3.org/1999/xhtml";>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
> <head>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>    <title>Web Script Status 404 - Not Found</title>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>    <link rel="stylesheet" href="/alfresco/css/webscripts.css"
> type="text/css" />[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
> </head>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
> <body>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>    <div>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       <table>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>             <td><img src="/alfresco/images/logo/AlfrescoLogo32.png"
> alt="Alfresco" /></td>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>             <td><span class="title">Web Script Status 404 - Not
> Found</span></td>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          </tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       </table>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       <br/>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       <table>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td>The Web Script <a
> href="%2Falfresco%2Fservice%2Fapi%2Fnode%2Fauth%2Fresolve%2Fadmin">/alfresco/service/api/node/auth/resolve/admin</a>
> has responded with a status of 404 - Not Found.</td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       </table>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       <br/>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       <table>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td><b>404 Description:</b></td><td> Requested resource is not
> available.</td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td> </td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td><b>Message:</b></td><td>Cannot find object for
> NodeIdReference[storeRef=auth://resolve,id=admin]</td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td><b>Server</b>:</td><td>Community v5.0.0 (r75118-b23)
> schema 8,001</td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td><b>Time</b>:</td><td>Oct 20, 2015 4:18:47 PM</td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td></td><td> </td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>          <tr><td><b>Diagnostics</b>:</td><td><a
> href="/alfresco/service/script/org/alfresco/cmis/item.get">Inspect Web
> Script (org/alfresco/cmis/item.get)</a></td></tr>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>       </table>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
>    </div>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 << "
> </body>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "</html>[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "[\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> "[\r][\n]"
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> HTTP/1.1 404 Not Found
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Server: Apache-Coyote/1.1
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Cache-Control: no-cache
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Expires: Thu, 01 Jan 1970 00:00:00 GMT
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Pragma: no-cache
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Content-Type: text/html;charset=UTF-8
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Transfer-Encoding: chunked
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10 <<
> Date: Tue, 20 Oct 2015 16:18:47 GMT
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - Connection can be kept
> alive indefinitely
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10:
> Shutdown connection
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - Connection discarded
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - http-outgoing-10: Close
> connection
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - Connection released:
> [id: 10][route: {}->http://54.165.85.140:8080][total kept alive: 0; route
> allocated: 0 of 2; total allocated: 0 of 20]
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - Connection manager is
> shutting down
> DEBUG 2015-10-20 12:18:46,883 (qtp182259421-40) - Connection manager shut
> down
>
> *Paul Farrell*
> Senior Search Consultant
>
> 109-123 Clifton Street, London EC2A 4LD
> *T* +44 (0) 207 183 6865 | funnelback.com <http://www.funnelback.com/>
>
> *UNITED KINGDOM* | AUSTRALIA | NEW ZEALAND | POLAND | UNITED STATES
>
> Connect with us: LinkedIn <http://www.linkedin.com/company/funnelback> -
> Twitter <https://twitter.com/funnelback>
>
> Funnelback UK Ltd is a limited liability company registered in England &
> Wales. Registered address: Zetland House 109-123, Clifton Street, London.
> EC2A 4LD. Company registration number: 07004264.
>
> On 20 Oct 2015, at 16:50, Maurizio Pillitu <[email protected]> wrote:
>
> Hi Paul,
>
> it looks like you're hitting
> https://github.com/maoo/alfresco-indexer/issues/3 ; which version of
> alfresco-indexer are you using? Can you try using
> http://search.maven.org/#artifactdetails%7Ccom.github.maoo.indexer%7Calfresco-indexer-webscripts%7C0.7.1%7Camp
>  (or
> the pre-built WAR file -
> http://search.maven.org/#artifactdetails%7Ccom.github.maoo.indexer%7Calfresco-indexer-webscripts-war%7C0.7.1%7Cwar
>  )
>
> HTH
>   mao
>
> On Tue, Oct 20, 2015 at 5:36 PM Paul Farrell <[email protected]>
> wrote:
>
>> Hi,
>>
>> Having had to go back to basics and re-install my Alfresco instance, I
>> can confirm that the AMP file for the alfresco indexer web scripts *does*
>> actually install without error. There must have been an issue with my
>> previous Alfresco instance.
>>
>> Having said that, the Alfresco WebScript connector fails. The failure is
>> down to the ‘Context’ setting (see below):
>>
>> <4a6db6238cff01e7ff77cdaf7e6ea050.png>
>>
>> When you attempt to save the configuration of the WebScript connector,
>> Manifold clearly tries to check the connection. It seems to do this by
>> making an API call (/auth/resolve/admin). The issue is with what Manifold
>> prepends to the start of that path.
>> If I leave the setting as above then Manifold reports   :
>>
>> <tr><td>The Web Script <a
>> href="%2Falfresco%2Fservice%2Fapi%2Fnode%2Fauth%2Fresolve%2Fadmin">/alfresco/service/api/node/auth/resolve/admin</a>
>> has responded with a status of 404 - Not Found.</td></tr>[\n]”
>>
>> In other words, it builds the full path as
>> “alfresco/service/api/node/auth/resolve/admin”.
>>
>> For my Alfresco Community 5.0 instance, I get to that same web script via
>> the URL “/alfresco/service/auth/resolve/admin” i.e. without the ‘/api/node’.
>>
>> Somewhere, Manifold is assuming that the ‘/api/node’ is a correct path
>> inclusion. In other words, there is nothing I can put into that box to
>> prevent it.
>>
>> Paul
>>
>> On 20 Oct 2015, at 12:56, Karl Wright <[email protected]> wrote:
>>
>> Hmm.  What file was missing?  Maurizio maintains the indexer plugin; I
>> feel certain he'd want to know.
>>
>> Karl
>>
>>
>> On Tue, Oct 20, 2015 at 7:53 AM, Paul Farrell <[email protected]>
>> wrote:
>>
>>> Hi guys,
>>>
>>> Just to let you know what’s going on - for informational purposes more
>>> than anything.
>>>
>>> I initially tried taking the AMP file provided in the MCF plugins
>>> directory (0.7.0) and tried to install it into Alfresco but got a message
>>> saying a file was missing.
>>>
>>> Instead, I cloned the repository on GitHub for the alfresco-indexer
>>> project and then built it on my local machine. This generated the AMP file
>>> (0.7.2).
>>>
>>> I was able to successfully install the AMP file onto my Alfresco
>>> instance.
>>>
>>> As it happens I now cannot log into Alfresco Share ('bad credentials or
>>> server not available' message) but that is something I can work on.
>>> Apparently the installation of some AMP files have been known to cause this
>>> issue.
>>>
>>> So, progress to a point!
>>>
>>> *Paul Farrell*
>>> Senior Search Consultant
>>>
>>> 109-123 Clifton Street, London EC2A 4LD
>>> *T* +44 (0) 207 183 6865 | funnelback.com <http://www.funnelback.com/>
>>>
>>> *UNITED KINGDOM* | AUSTRALIA | NEW ZEALAND | POLAND | UNITED STATES
>>>
>>> Connect with us: LinkedIn <http://www.linkedin.com/company/funnelback> -
>>>  Twitter <https://twitter.com/funnelback>
>>>
>>> Funnelback UK Ltd is a limited liability company registered in England &
>>> Wales. Registered address: Zetland House 109-123, Clifton Street, London.
>>> EC2A 4LD. Company registration number: 07004264.
>>>
>>> On 20 Oct 2015, at 12:36, Rafa Haro <[email protected]> wrote:
>>>
>>> Hi,
>>>
>>> At the Alfresco side, hope this helps:
>>>
>>> http://docs.alfresco.com/4.1/tasks/amp-install.html
>>>
>>> Cheers
>>>
>>>
>>>
>>>
>>>
>>> On Tue, Oct 20, 2015 at 1:13 PM, Karl Wright <[email protected]> wrote:
>>>
>>>> The AMP file is actually shipped as part of the binary MCF
>>>> distribution.  You can find it under "plugins".
>>>>
>>>> Karl
>>>>
>>>>
>>>> On Tue, Oct 20, 2015 at 6:42 AM, Paul Farrell <[email protected]>
>>>> wrote:
>>>>
>>>>> Hi all,
>>>>>
>>>>> Hopefully this will be my only request for information today.
>>>>> I’m afraid this is a bit of a newbie question but I have managed to
>>>>> get the Manifold UI to now show ‘Alfresco Webscripts’ as a connector. The
>>>>> only bit I am missing now is to install the AMP file in Afresco.
>>>>>
>>>>> I realise that this is slightly outside of the Manifold remit but I
>>>>> wondered if anyone can advise how I build the AMP file from the URL (
>>>>> https://github.com/maoo/alfresco-indexer)? I have cloned the
>>>>> repository to my local drive but, having never worked with Maven, am at a
>>>>> loss at how to generate the AMP file that I then need to install into
>>>>> Alfresco.
>>>>>
>>>>> Many thanks,
>>>>>
>>>>> On 19 Oct 2015, at 17:36, Karl Wright <[email protected]> wrote:
>>>>>
>>>>> The only way you can have such a reduced list of connectors is if
>>>>> somebody commented out many connectors in your connectors.xml, or removed
>>>>> them from the database table where they are registered by hand.
>>>>>
>>>>> Karl
>>>>>
>>>>>
>>>>> On Mon, Oct 19, 2015 at 12:33 PM, Paul Farrell <
>>>>> [email protected]> wrote:
>>>>>
>>>>>> After a good deal of time clicking around I came to the same
>>>>>> conclusion - that there is no way of telling from the UI!!
>>>>>>
>>>>>> Having dug a bit deeper I believe I may actually have the Alfresco
>>>>>> WebScript connectors installed. At least the 0.7.0 version. I notice in 
>>>>>> the
>>>>>> ‘lib’ directory that I have ‘alfresco-indexer-webscripts-0.7.0.amp.
>>>>>>
>>>>>> Looking in the ‘connectors.xml’ file I can also see the line :
>>>>>>
>>>>>> <repositoryconnector name="Alfresco Webscript"
>>>>>> class="org.apache.manifoldcf.crawler.connectors.alfrescowebscript.AlfrescoConnector”/>
>>>>>>
>>>>>> You can imagine my excitement!
>>>>>>
>>>>>> The only thing I am missing is the option in the UI. When I click to
>>>>>> create a new repo connection I get:  CMIS, Dropbox, Generic, GoogleDrive,
>>>>>> HDFS, Jira, Meridio, RSS, Sharepoint.
>>>>>>
>>>>>> Perhaps I am hoping for too much to hope that I can make a simple
>>>>>> change to enable this repo connection?
>>>>>>
>>>>>> Thanks for all the help everyone
>>>>>>
>>>>>>
>>>>>>
>>>>>> On 19 Oct 2015, at 17:26, Karl Wright <[email protected]> wrote:
>>>>>>
>>>>>> Hah; there's not a way to inquire in the UI, if that's what you
>>>>>> mean.  But if you see "Alfresco webscript" in the list of repository
>>>>>> connection types, you've got a version that supports that connector.
>>>>>>
>>>>>> Thanks,
>>>>>> Karl
>>>>>>
>>>>>>
>>>>>> On Mon, Oct 19, 2015 at 12:17 PM, Paul Farrell <
>>>>>> [email protected]> wrote:
>>>>>>
>>>>>>> Thanks Rafa.
>>>>>>>
>>>>>>> As an aside, is there an easy way to identify which version of
>>>>>>> ManifoldCF you are on?
>>>>>>>
>>>>>>> Cheers
>>>>>>>
>>>>>>> *Paul Farrell*
>>>>>>> Senior Search Consultant
>>>>>>>
>>>>>>> 109-123 Clifton Street, London EC2A 4LD
>>>>>>> *T* +44 (0) 207 183 6865 | funnelback.com
>>>>>>> <http://www.funnelback.com/>
>>>>>>>
>>>>>>> *UNITED KINGDOM* | AUSTRALIA | NEW ZEALAND | POLAND | UNITED STATES
>>>>>>>
>>>>>>> Connect with us: LinkedIn
>>>>>>> <http://www.linkedin.com/company/funnelback> - Twitter
>>>>>>> <https://twitter.com/funnelback>
>>>>>>>
>>>>>>> Funnelback UK Ltd is a limited liability company registered in
>>>>>>> England & Wales. Registered address: Zetland House 109-123, Clifton 
>>>>>>> Street,
>>>>>>> London. EC2A 4LD. Company registration number: 07004264.
>>>>>>>
>>>>>>> On 19 Oct 2015, at 16:54, Rafa Haro <[email protected]> wrote:
>>>>>>>
>>>>>>> Hi Paul,
>>>>>>>
>>>>>>> All you need to do is to install this webscript
>>>>>>> <https://github.com/maoo/alfresco-indexer> within your Alfresco
>>>>>>> instance. The connector itself is already part of the most recent 
>>>>>>> versions
>>>>>>> of ManifoldCF
>>>>>>>
>>>>>>> Cheers,
>>>>>>> Rafa
>>>>>>>
>>>>>>> On Mon, Oct 19, 2015 at 5:29 PM, Paul Farrell <
>>>>>>> [email protected]> wrote:
>>>>>>>
>>>>>>>> Ok, thanks again guys.
>>>>>>>>
>>>>>>>> The Webscript connector it is.
>>>>>>>>
>>>>>>>> I realise I am asking a lot here but are there any easy-to-follow
>>>>>>>> guidelines on how to get this Webscript connector installed?  I see 
>>>>>>>> there
>>>>>>>> is a GitHub page here (
>>>>>>>> https://github.com/maoo/alfresco-webscript-manifold-connector)
>>>>>>>> which discusses it (although it directs you to a repository of files).
>>>>>>>>
>>>>>>>> I am just keen to make sure that any steps I follow to try and get
>>>>>>>> this Webscript connector installed and working are updated, reliable 
>>>>>>>> steps.
>>>>>>>> I would hate to waste time with out of date information.
>>>>>>>>
>>>>>>>> Thanks all
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On 19 Oct 2015, at 16:23, Muhammed Olgun <[email protected]>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>> Hi Paul,
>>>>>>>>
>>>>>>>> I suggest that you should use Alfresco Webscript as Karl mentioned.
>>>>>>>> Web services is so slow compared to other services and I've also 
>>>>>>>> checked
>>>>>>>> that Alfresco CMIS web services does not return change token(may be 
>>>>>>>> there
>>>>>>>> is something that I don't know).
>>>>>>>>
>>>>>>>> By the way current version of CMIS connector is not aware of change
>>>>>>>> token. I would write a patch for you if alfresco supports change token
>>>>>>>> property.
>>>>>>>>
>>>>>>>> Thanks!
>>>>>>>> Muhammed
>>>>>>>> 19 Eki 2015 Pzt, saat 18:11 tarihinde Karl Wright <
>>>>>>>> [email protected]> şunu yazdı:
>>>>>>>>
>>>>>>>>> Hi Paul,
>>>>>>>>>
>>>>>>>>> The Alfresco Webscript connector is a wholly different connector
>>>>>>>>> that has no relation to the CMIS connector.  It requires an Alfresco
>>>>>>>>> webscript plugin be installed on your Alfresco server to work, though.
>>>>>>>>>
>>>>>>>>> Hope that helps.
>>>>>>>>>
>>>>>>>>> Karl
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Mon, Oct 19, 2015 at 10:32 AM, Paul Farrell <
>>>>>>>>> [email protected]> wrote:
>>>>>>>>>
>>>>>>>>>> Hi Muhammed/Karl,
>>>>>>>>>>
>>>>>>>>>> Firstly, thank-you so much for taking the time to reply. It is
>>>>>>>>>> very much appreciated.
>>>>>>>>>>
>>>>>>>>>> Currently I am using the AtomPub for my CMIS repository
>>>>>>>>>> connection. I have just read something which may shed a little light 
>>>>>>>>>> on
>>>>>>>>>> this. The post read that change tokens are not passed via AtomPub
>>>>>>>>>> connections (
>>>>>>>>>> https://forums.alfresco.com/forum/developer-discussions/alfresco-api/cmis-change-log-token-problem-using-opencmis-03282011-1758).
>>>>>>>>>> If true, this would explain why ManifoldCF may be unable to 
>>>>>>>>>> determine a
>>>>>>>>>> change in Alfresco.
>>>>>>>>>>
>>>>>>>>>> It looks like I have two possible options left open to me
>>>>>>>>>> (correct me if I’m wrong):
>>>>>>>>>>
>>>>>>>>>> 1. I look to use ‘Web Services’ instead of ‘AtomPub’ for the
>>>>>>>>>> connection mechanism
>>>>>>>>>> 2. I upgrade ManifoldCF so that I can use the ‘Web Scripts’
>>>>>>>>>> connector?  (or is this the same as the ‘Web Services’ connection 
>>>>>>>>>> mentioned
>>>>>>>>>> above?)
>>>>>>>>>>
>>>>>>>>>> Thanks again,
>>>>>>>>>>
>>>>>>>>>> Paul
>>>>>>>>>>
>>>>>>>>>> *Paul Farrell*
>>>>>>>>>> Senior Search Consultant
>>>>>>>>>>
>>>>>>>>>> 109-123 Clifton Street, London EC2A 4LD
>>>>>>>>>> *T* +44 (0) 207 183 6865 | funnelback.com
>>>>>>>>>> <http://www.funnelback.com/>
>>>>>>>>>>
>>>>>>>>>> *UNITED KINGDOM* | AUSTRALIA | NEW ZEALAND | POLAND | UNITED
>>>>>>>>>> STATES
>>>>>>>>>>
>>>>>>>>>> Connect with us: LinkedIn
>>>>>>>>>> <http://www.linkedin.com/company/funnelback> - Twitter
>>>>>>>>>> <https://twitter.com/funnelback>
>>>>>>>>>>
>>>>>>>>>> Funnelback UK Ltd is a limited liability company registered in
>>>>>>>>>> England & Wales. Registered address: Zetland House 109-123, Clifton 
>>>>>>>>>> Street,
>>>>>>>>>> London. EC2A 4LD. Company registration number: 07004264.
>>>>>>>>>>
>>>>>>>>>> On 19 Oct 2015, at 15:12, Muhammed Olgun <[email protected]>
>>>>>>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>> Hi Paul,
>>>>>>>>>>
>>>>>>>>>> Repositories should give information to ManifoldCF when they
>>>>>>>>>> updated. Current CMIS connector reindex document if the lastest 
>>>>>>>>>> version of
>>>>>>>>>> the document has changed, not updated.
>>>>>>>>>>
>>>>>>>>>> There is a change token property in CMIS specification and it
>>>>>>>>>> should change when document is updated so ManifoldCF can understand 
>>>>>>>>>> that
>>>>>>>>>> document is updated but implementing change token property is 
>>>>>>>>>> optional.
>>>>>>>>>> I've checked Alfresco's CMIS web site and seen that they didn't set 
>>>>>>>>>> the
>>>>>>>>>> change token.
>>>>>>>>>>
>>>>>>>>>> I think, there is nothing we can do at this point.
>>>>>>>>>>
>>>>>>>>>> 19 Eki 2015 Pzt, 15:59 tarihinde, Karl Wright <[email protected]>
>>>>>>>>>> şunu yazdı:
>>>>>>>>>>
>>>>>>>>>>> Hi Paul,
>>>>>>>>>>>
>>>>>>>>>>> This looks like a bug in the CMIS connector to me; usually the
>>>>>>>>>>> document version string the connector constructs should be adequate 
>>>>>>>>>>> to
>>>>>>>>>>> detect all changes.  Can you create a ticket?
>>>>>>>>>>> https://issues.apache.org/jira , project ManifoldCF.  Please
>>>>>>>>>>> include what version of MCF you are using here.  FWIW, this may be 
>>>>>>>>>>> in fact
>>>>>>>>>>> a bug in the Alfresco CMIS implementation, but we'll have to have 
>>>>>>>>>>> some back
>>>>>>>>>>> and forth before I can determine that for sure.
>>>>>>>>>>>
>>>>>>>>>>> In the meantime, have you considered using the Alfresco
>>>>>>>>>>> Webscript connector?  It's the preferred way to do Alfresco 
>>>>>>>>>>> indexing,
>>>>>>>>>>> although there have been issues reported having to do with running 
>>>>>>>>>>> it on
>>>>>>>>>>> some configurations of Alfresco.  I'm not entirely sure what the 
>>>>>>>>>>> problem is
>>>>>>>>>>> there; maybe a version dependency of some kind.
>>>>>>>>>>>
>>>>>>>>>>> Karl
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> On Mon, Oct 19, 2015 at 7:43 AM, Paul Farrell <
>>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>>>
>>>>>>>>>>>> Hi Everyone,
>>>>>>>>>>>>
>>>>>>>>>>>> Hoping someone may be able to advise.
>>>>>>>>>>>>
>>>>>>>>>>>> I am currently using Manifold, together with a CMIS connector,
>>>>>>>>>>>> to retrieve and index content from an Alfresco repository.
>>>>>>>>>>>>
>>>>>>>>>>>> All is going well apart from, what I would call, the
>>>>>>>>>>>> ‘incremental crawl’.
>>>>>>>>>>>>
>>>>>>>>>>>> The main issue I am having is that the modification of a
>>>>>>>>>>>> document’s security settings, in Alfresco, is not being picked up 
>>>>>>>>>>>> in next
>>>>>>>>>>>> Manifold crawl. As an example I have a document ‘TestDoc1’ which 
>>>>>>>>>>>> has user A
>>>>>>>>>>>> and B as Consumers. I run a crawl in Manifold and it picks up the 
>>>>>>>>>>>> documents
>>>>>>>>>>>> fine.  The security is set as expected. I then remove ‘User A’ 
>>>>>>>>>>>> from the
>>>>>>>>>>>> security of that document and re-run the Manifold crawl. User A 
>>>>>>>>>>>> can still
>>>>>>>>>>>> see the document in the local search engine.
>>>>>>>>>>>>
>>>>>>>>>>>> It is as if Manifold is not treating the security update as a
>>>>>>>>>>>> ‘modification’ and is therefore not refreshing it. Note that if I 
>>>>>>>>>>>> go into
>>>>>>>>>>>> the Output Connections, edit and save the relevant output 
>>>>>>>>>>>> connection and
>>>>>>>>>>>> then click ‘Remove all associated documents’, the next time I 
>>>>>>>>>>>> crawl, the
>>>>>>>>>>>> changes are picked up. It is clear that Manifold is just not 
>>>>>>>>>>>> updating
>>>>>>>>>>>> whatever internal record it has for this item.
>>>>>>>>>>>>
>>>>>>>>>>>> Any ideas?
>>>>>>>>>>>>
>>>>>>>>>>>> Many thanks.
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>>
>>
>


Reply via email to