I've created ticket CONNECTORS-1104 to cover this work. But please add as many details as you are aware of. If it is an authenticated proxy, for instance, we currently could only support basic auth and NTLM, but not really kerberos. (There is a separate ticket for Kerberos support but that requires a new release of HttpClient to properly be implemented.)
Thanks, Karl On Mon, Nov 17, 2014 at 6:09 AM, Karl Wright <[email protected]> wrote: > Hi Jitu, > > Is this a Windows proxy? Is it an authenticated proxy? > > Karl > > On Mon, Nov 17, 2014 at 5:07 AM, Jitu <[email protected]> wrote: > >> Hi Karl, >> Thanks for continuous support. >> >> The customer has several office locations in geographically distributed >> regions. Their SharePoint server is centrally hosted in US but all other >> offices across globe use this SharePoint. And today when they connect using >> the browser, they set up their internet proxy(url & port) as each of these >> offices are behind local firewall. All outgoing internet/WAN requests must >> go through this internet proxy else it never connects to the internet. >> >> In nutshell, I think we need to append the HTTPProxy to the method below >> from >> org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository.java >> that build the HTTPClient to connect to the SharePoint using SOAP call at >> line#264 to make sure the manifold can connect to the SharePoint server >> required to be connected using internet proxy >> >> HttpClientBuilder builder = HttpClients.custom() >> .setConnectionManager(connectionManager) >> .setMaxConnTotal(1) >> .disableAutomaticRetries() >> .setDefaultRequestConfig(requestBuilder.build()) >> .setDefaultSocketConfig(SocketConfig.custom() >> .setTcpNoDelay(true) >> .setSoTimeout(socketTimeout) >> .build()) >> .setDefaultCredentialsProvider(credentialsProvider); >> >> Thanks, >> Jitu >> >> On Tue, Oct 7, 2014 at 7:56 PM, Karl Wright <[email protected]> wrote: >> >>> Hi Jitu, >>> >>> There are tons of different ways of authenticating using HTTP. I need >>> to know which one is being used here. Basic auth? NTLM? Kerberos? etc. >>> >>> Karl >>> >>> >>> On Tue, Oct 7, 2014 at 10:24 AM, Jitu <[email protected]> wrote: >>> >>>> Hi Karl, >>>> Its basically web proxy server via http or secure web proxy >>>> server via https which takes server address, port and username, password in >>>> case if proxy server requires authentication as per attached screenshot. >>>> >>>> proxy source code will be something similar as shown in below link. >>>> >>>> https://hc.apache.org/httpcomponents-client-4.3.x/httpclient/examples/org/apache/http/examples/client/ClientProxyAuthentication.java >>>> >>>> Thanks, >>>> Jitu >>>> >>>> On Mon, Oct 6, 2014 at 2:56 PM, Karl Wright <[email protected]> wrote: >>>> >>>>> Hi Jitu, >>>>> >>>>> Heretofore, nobody has required proxy support in connecting to >>>>> SharePoint. However, it is relatively straightforward to add. But you >>>>> will need to determine what kind of proxy is being used before creating a >>>>> ticket for this work. Specifically, is this a Windows proxy, with >>>>> authentication? If it is, what kind of authentication is being used? >>>>> >>>>> Thanks, >>>>> Karl >>>>> >>>>> >>>>> On Mon, Oct 6, 2014 at 1:26 AM, Jitu <[email protected]> wrote: >>>>> >>>>>> Hi Karl, >>>>>> >>>>>> Thanks for your continuous support. we have a requirement where our >>>>>> client has setup manifoldcf in one server and sharepoint in another. Now >>>>>> to >>>>>> crawl sharepoint files manifoldcf needs proxy information because they >>>>>> have >>>>>> corporate level proxy settings. Please let us know if there is a way to >>>>>> set >>>>>> proxy while crawling. >>>>>> >>>>>> Thanks, >>>>>> Jitu >>>>>> >>>>> >>>>> >>>> >>> >> >
