adapidenied"/>
>
> it could be used to handle "session expired" situation, but should be
> checked on every API request...
>
>
> 2012/9/10 Karl Wright (JIRA)
>
>>
>> [
>> https://issues.apache.org/jira/browse/CONNECTORS-518?page=com.atlas
Usually in these situations Solr returns a 500 error. The Solr
Connector, at one point, used to retry indefinitely when such an error
came back, but I believe there were changes to this logic and now it
may well abort the job if this happens for more than a few hours
straight. This is because the
Piergiorgio and I got things working on his box, so +1 from me.
On Sun, Sep 9, 2012 at 12:52 PM, Karl Wright wrote:
> Please vote +1 if you think the SharePoint 2010 plugin is ready for release.
>
> Tag in the usual place
> (https://svn.apache.org/repos/asf/manifoldcf/integration/sha
Only one more vote needed!
Karl
On Mon, Sep 10, 2012 at 2:16 PM, Piergiorgio Lucidi
wrote:
> We have just finishing crawling my SharePoint 2010 instance correctly :)
>
> +1 from me.
>
> Piergiorgio
>
> 2012/9/10 Karl Wright
>
>> Piergiorgio and I got things wor
Hi Erlend,
I posted a warning about this a few days back. You need to rerun ant
make-core-deps.
On Tue, Sep 11, 2012 at 9:02 AM, Erlend Garåsen wrote:
> BUILD FAILED
> /Users/erlendfg/tmp/mcf_2012/build.xml:870: The following error occurred
> while executing this line:
> /Users/erlendfg/tmp/mcf
91 folder documents(ppt, doc, etc).
> 3) SharePoint List that has 798 items.
> 4) Document Library that contains 127 aspx files.
>
> Here is my +1 for this vote.
>
> Ahmet
>
> --- On Mon, 9/10/12, Karl Wright wrote:
>
>> From: Karl Wright
>> Subject: Re: [V
d, Sep 5, 2012 at 2:07 PM, Karl Wright wrote:
> I don't think there are any hard rules about what constitutes a 1.0
> release, except perhaps some subjective measure of completeness, and
> some measure of backwards compatibility support. For example, Lucene
> insures that every m
at the Versioning page [2], I don't see any problem.
>
> Piergiorgio
>
> [1] -
> http://maven.apache.org/ref/3.0.4/maven-artifact/xref/org/apache/maven/artifact/versioning/DefaultArtifactVersion.html
> [2] - http://docs.codehaus.org/display/MAVEN/Versioning
>
> 2012/9/13
We're not quite ready, but presuming we finish up with the
CONNECTORS-515 ticket on time, does anyone want to be the release
engineer for the ManifoldCF 1.0 release? The process is documented in
the wiki if you want to read about it.
Karl
Thanks!
Yes, the "how to build" page never was translated, so I think unless
Abe-san has much time on his hands it will not happen now.
The missing translations are covered in a ticket, I think, which is
assigned to Hitoshi Ozawa.
Karl
On Thu, Sep 20, 2012 at 9:43 AM, Erlend Garåsen wrote:
>
>
Please vote +1 to release ManifoldCF 1.0, RC0. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC0
Karl
the ImportConfiguration tool.
>
> Otherwise, everything seems to work as expected. I have tested:
> - Deployment on Resin 4 on Linux
> - ant test on OS X.
> - My new file encryption function.
>
> I'll wait with my vote until the above behaviour is explained.
>
wrote:
> On 21.09.12 14.07, Karl Wright wrote:
>
>> If you think the agents process is running, and yet you cannot delete
>> a job, you should see stuff in the manifoldcf log that would indicate
>> what the trouble likely is.
>
>
> solr-test02 mcf-1 $ /www/var/data
en wrote:
> On 21.09.12 14.47, Karl Wright wrote:
>>
>> A temporary error should not block a (non running) job from getting
>> cleaned up. The job can't be deleted until it is stopped, and no
>> outstanding documents are being worked on.
>>
>>
Thanks - have a good weekend.
I will also withhold my vote for this RC until this is ironed out.
What I'm worried about is that all of the startup/shutdown model
changes might have impacted ManifoldCF running on Resin, so I think it
is essential we wait until this is resolved before declaring any
to
delete jobs and connections when you upgrade. Nor is
unregistering/reregistering your connectors.
I'm going to spin a new RC with the NPE fix and a bunch of other
things - but I'm pretty certain the mystery is solved.
Thanks,
Karl
On Fri, Sep 21, 2012 at 10:17 AM, Karl Wright wrot
Withdrawn; spinning RC1 now.
Karl
On Fri, Sep 21, 2012 at 2:55 PM, Karl Wright wrote:
> Hi,
>
> I see what has happened here. You unregistered the connectors before
> you deleted the job. That basically meant that the job cleanup can't
> take place until the connecto
Please vote +1 to release ManifoldCF 1.0, RC1. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC1
Fixes since RC0:
CONNECTORS-532
CONNECTORS-533
CONNECTORS-
Examined the distribution for leakage of files that shouldn't be
there, ran ant rat-sources, and ran all tests.
+1 from me.
Karl
On Fri, Sep 21, 2012 at 3:48 PM, Karl Wright wrote:
> Please vote +1 to release ManifoldCF 1.0, RC1. The release artifact
> can be found a
Hi, please see below:
On Mon, Sep 24, 2012 at 9:59 AM, Erlend Garåsen wrote:
> On 21.09.12 20.55, Karl Wright wrote:
>
>> I see what has happened here. You unregistered the connectors before
>> you deleted the job. That basically meant that the job cleanup can't
olved this issue.
> I tested other functionalities and it seems all ok.
>
> So I think that if we build another RC, this time it could be ok :)
>
> Piergiorgio
>
> 2012/9/23 Karl Wright
>
>> Examined the distribution for leakage of files that shouldn't be
>>
Found an issue serious enough to block release. CONNECTORS-539.
Canceling the vote on RC1.
Karl
On Mon, Sep 24, 2012 at 10:24 AM, Karl Wright wrote:
> I think this is serious enough for a new RC.
>
> But first, let's try to be sure there aren't any other major issues...
Please vote +1 to release ManifoldCF 1.0, RC2. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC2
Fixes since RC1:
CONNECTORS-538
CONNECTORS-539
Karl
p 24, 2012 at 10:09 AM, Karl Wright wrote:
> Hi, please see below:
>
> On Mon, Sep 24, 2012 at 9:59 AM, Erlend Garåsen
> wrote:
>> On 21.09.12 20.55, Karl Wright wrote:
>>
>>> I see what has happened here. You unregistered the connectors before
>>> you
lassNotFoundException: org.apache.manifoldcf.apiservlet.APIServlet
> :
> :
> 6156 [main] WARN /mcf-api-service - unavailable
> javax.servlet.UnavailableException:
> org.apache.manifoldcf.apiservlet.APIServlet
> at org.eclipse.jetty.servlet.Holder.doStart(Holder.java:91)
>
The jar is missing from the proprietary war. I've not been running
that. So yes, I think this should be fixed, and the candidate
withdrawn.
Karl
On Tue, Sep 25, 2012 at 3:19 AM, Karl Wright wrote:
> Hmm, I don't see that here.
> Karl
>
>
> On Mon, Sep 24, 2012 a
Please vote +1 to release ManifoldCF 1.0, RC3. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC3
Fixes since RC2:
CONNECTORS-540
ary in the same way
> - checked signatures
>
> +1 from me
>
> Piergiorgio
>
> 2012/9/25 Karl Wright
>
>> Please vote +1 to release ManifoldCF 1.0, RC3. The release artifact
>> can be found at:
>>
>> http://people.apache.org/~kwright/apach
ar.gz) on OS X (64 bit Java 1.6). Ran ant test,
> ant doc (and viewed the build content with Firefox( and ant build.
> - Started MCF with Jetty from dist/example and dist/multiprocess-example
> (HSQLDB).
>
> Erlend
>
>
> On 25.09.12 12.17, Karl Wright wrote:
>>
>&
ll have constructed
> *three war files* ..."
>
> I can test this war file more properly if I get some basic information about
> it.
>
> Erlend
>
>
> On 26.09.12 12.47, Karl Wright wrote:
>>
>> Has anyone tried to deploy the combined war on any applicatio
I have asked someone with
> better server skills and Resin knowledge.
>
> Erlend
>
>
> On 26.09.12 15.02, Erlend Garåsen wrote:
>>
>> On 26.09.12 14.39, Karl Wright wrote:
>>>
>>> I didn't do documentation (or tests) because it is experimental
will find out why tomorrow after our Solr meeting.
>>
>> If I get a reply this evening, I will try to do a new test from home.
>>
>> Erlend
>>
>> On 26.09.12 17.55, Karl Wright wrote:
>>>
>>> Usually application servers unpack the war somewhere. U
Withdrawing candidate to fix CONNECTORS-544.
Karl
On Thu, Sep 27, 2012 at 1:02 PM, Karl Wright wrote:
> I wrote a test for it here. Unfortunately, it fails because the war
> does not include some key classes.
>
> I think even though this is an experimental feature, this is
> suff
Please vote +1 to release ManifoldCF 1.0, RC4. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC4
Fixes since RC3:
CONNECTORS-544
(also added example script
Ran all tests, tried the combined war, tried single-process example
and multiprocess example.
+1 (and I really hope this is the last RC for this release)
Karl
On Thu, Sep 27, 2012 at 5:30 PM, Karl Wright wrote:
> Please vote +1 to release ManifoldCF 1.0, RC4. The release artifact
>
Another serious build and example-related problem found: CONNECTORS-545.
Karl
On Thu, Sep 27, 2012 at 8:34 PM, Karl Wright wrote:
> Ran all tests, tried the combined war, tried single-process example
> and multiprocess example.
>
> +1 (and I really hope this is the last RC for
Please vote +1 to release ManifoldCF 1.0, RC5. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5
Fixes since RC4:
CONNECTORS-545
Fixes since RC3:
CONNECT
will end this job before I leave because I'm afraid that MCF will try to
> fetch these documents over and over again during this weekend.
>
> Erlend
>
>
> On 28.09.12 09.58, Karl Wright wrote:
>>
>> Please vote +1 to release ManifoldCF 1.0, RC5. The release ar
>
> I'm pretty sure they are related to each other.
>
> I will end this job before I leave because I'm afraid that MCF will try to
> fetch these documents over and over again during this weekend.
>
> Erlend
>
>
> On 28.09.12 09.58, Karl Wright wrote:
CONNECTORS-547 (index out of bounds)
CONNECTORS-548 (cannot build with maven)
Karl
On Fri, Sep 28, 2012 at 7:26 AM, Karl Wright wrote:
> "Meanwhile, the following is filling up my log:
> FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed:
> Strin
>> result code.
>>>
>>> Meanwhile, the following is filling up my log:
>>> FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: String
>>> index out of range: -1
>>> java.lang.StringIndexOutOfBoundsException: String index
Please vote +1 to release ManifoldCF 1.0, RC6. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC6
Fixes since RC5:
CONNECTORS-547
CONNECTORS-548 (documentat
Exercised it as I have before, and added postgresql and mysql tests.
+1
Karl
On Fri, Sep 28, 2012 at 9:38 AM, Karl Wright wrote:
> Please vote +1 to release ManifoldCF 1.0, RC6. The release artifact
> can be found at:
>
> http://people.apache.org/~kwright/apache-manifoldcf-1.0
Withdrawn because of CONNECTORS-549.
Karl
On Sun, Sep 30, 2012 at 8:07 AM, Piergiorgio Lucidi
wrote:
> -1 from me.
>
> I found an issue on the CMIS Connector (CONNECTORS-549), I'm committing the
> patch to fix the problem.
> We need another RC.
>
> Piergiorgio
Please vote +1 to release ManifoldCF 1.0, RC7. The release artifact
can be found at:
http://people.apache.org/~kwright/apache-manifoldcf-1.0
There is also an SVN tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC7
Fixes since RC6:
CONNECTORS-549
Fixes since RC5:
CONNECT
Ran all tests, just to be sure nothing got messed up. +1 from me.
Karl
On Sun, Sep 30, 2012 at 11:34 AM, Piergiorgio Lucidi
wrote:
> +1 from me:
>
> - checked signatures
> - crawling against a CMIS repository
> - tried to build using Maven and Ant
>
> Piergiorgio
>
No stack trace needed. If you read the rest of the mail, you will
note that I was able to reproduce the issue using the URL you had
provided. There have been two RC's since; we are on RC7 now.
Karl
On Tue, Oct 2, 2012 at 4:38 AM, Erlend Garåsen wrote:
> On 28.09.12 13.31, Erlend Garåsen wrote
>
> 'ant all' command yields:
>
> BUILD FAILED
> /manifoldcf/tags/release-1.0-RC7/build.xml:2529: A zip file cannot include
> itself
>
> Ahmet
>
>
> --- On Sun, 9/30/12, Karl Wright wrote:
>
>> From: Karl Wright
>> Subject: Re: [VO
Oh, and by the way, I used "ant image" to create the release, which
calls the "create-source-zip" target. So it's still working for me
just fine. Hmmm.
Karl
On Tue, Oct 2, 2012 at 2:41 PM, Karl Wright wrote:
> The ant build hasn't changed in this regard, but
Mine is:
C:\wip\mcf\trunk>which ant
c:\ant\apache-ant-1.8.4\bin/ant.bat
Are you running on Windows, or Linux?
Karl
On Tue, Oct 2, 2012 at 2:54 PM, Ahmet Arslan wrote:
>> This used to exclude the zip from itself. What version
>> of ant are you using?
>
> Apache Ant(TM) version 1.8.2 compiled on
Can you try changing the clause I noted earlier to
remove the leading "/" from "/apache-manifoldcf-*", and see if that
works?
I'd hesitate to spin a new kit here unless there's evidence there's a
viable fix.
Karl
On Tue, Oct 2, 2012 at 3:01 PM, Ahmet Arslan wrote:
>
>> Are you running on Windo
Ok - this is likely a windows/linux difference then.
I'll have to try it out on my linux system when I get home to be sure
I catch all the pertinent cases.
In any case, I'm not sure this is worth a respin, since the place it
fails is building a distribution zip. Does everyone agree?
Karl
On Tu
CONNECTORS-550 is the ticket.
Karl
On Tue, Oct 2, 2012 at 3:26 PM, Karl Wright wrote:
> Ok - this is likely a windows/linux difference then.
>
> I'll have to try it out on my linux system when I get home to be sure
> I catch all the pertinent cases.
>
> In any case, I
tpParser.parseAvailable(HttpParser.java:218)
> at
> org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:51)
> at
> org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:586)
> at
> org.eclipse.jetty.io.nio.SelectChanne
Hi Maciej,
It sounds like your loop condition must be somehow incorrect. You may
not receive the full number of documents specified by
getMaxDocumentRequest(), but rather a number less than that.
We have a number of connectors that use document batches > 1, e.g. the
LiveLink connector, so this i
To all,
Apache ManifoldCF 1.0 has been released. This release introduces
support for SharePoint 2010, a new LDAP authority, many bug fixes, a
new experimental deployment model (single-process combined war), and
full MySQL support. Details can be found at:
http://www.apache.org/dist/manifoldcf/a
Hi Maciej,
Did you intend to send this to the Solr/Lucene dev list? This really
isn't a ManifoldCF question.
I can help a little perhaps. You are correct that stemming and
normalization rules might well differ from language to language, but
it is worth noting that for at least normalization it
when hitting abort - nothing happens (job process remains in
> "aborting" state)...
>
> Problem is that it happens irregularly (sometime 10 documents,
> sometime 1600 and sometime all documents are indexed). Tried to check
> that locally but on first pass everything went ok
FWIW, getting thread dumps from the process running the agents process
when it is "hung" may (or may not) help determine the underlying
clause.
Karl
On Tue, Oct 9, 2012 at 9:21 AM, Karl Wright wrote:
> What is your deployment model? Is this a multiprocess deployment?
> What
d
> just disappeared...
>
> How to enable debug logs so I could see verbose output from core functions?
>
>
> 2012/10/9 Karl Wright :
>> FWIW, getting thread dumps from the process running the agents process
>> when it is "hung" may (or may not) help determ
What JVM are you using? Because frankly this cannot logically happen.
The only other possibility is that your code is somehow throwing
ManifoldCFExceptions of type ManifoldCFException.INTERRUPTED.
Karl
On Tue, Oct 9, 2012 at 10:20 AM, Maciej Liżewski
wrote:
> 2012/10/9 Karl Wri
ds are spawn...
>
>
>
>
> 2012/10/9 Maciej Liżewski :
>> 2012/10/9 Karl Wright :
>>> "- all worker threads are gone," ???
>>>
>>> Really??
>>
>> yes... really.. this is why I am also writing that this is strange...
>> this is lis
hat is possible :)
> what exactly ManifoldCFException.INTERRUPTED is doing that could cause
> such effects?
>
>
> 2012/10/9 Karl Wright :
>> What JVM are you using? Because frankly this cannot logically happen.
>>
>> The only other possibility i
ifoldcf.apache.org
>> Date: Wednesday, September 5, 2012, 1:50 PM
>> Hi Ahmet,
>>
>> a warm welcome from Rome!
>>
>> ^__^
>>
>> Piergiorgio
>>
>> 2012/9/5 Erlend Garåsen
>>
>> >
>> > Welcome to the MCF community, Ahmet!
>> >
>&
Hi folks,
Due to the potential severity of CONNECTORS-551, I think it might be a
good idea to release a ManifoldCF 1.0.1 release which contains the fix
for this ticket. Please can I have a show of "hands" as to whether
people agree that this is serious enough to warrant such a release.
Thanks!
K
Ok, it looks like there is consensus. I will prepare the
release-1.0-branch appropriately, and create a release candidate.
Karl
On Wed, Oct 10, 2012 at 5:28 AM, Erlend Garåsen wrote:
> +1
>
> Erlend
>
>
> On 09.10.12 22.53, Karl Wright wrote:
>>
>> Hi folks,
>&
Vote +1 to release Apache ManifoldCF 1.0.1, RC0. You can find a tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0.1-RC0
The artifact can be downloaded from:
http://people.apache.org/~kwright/apache-manifoldcf-1.0.1
This patch release fixes the critical bug CONNECTORS-551. I
Ran tests and checked documentation.
+1 from me.
Karl
On Sat, Oct 13, 2012 at 6:16 PM, Karl Wright wrote:
> Vote +1 to release Apache ManifoldCF 1.0.1, RC0. You can find a tag at:
>
> https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0.1-RC0
>
> The artifact can be
Looks great - hope it goes well! I wish I could come but I *just* got
back from Berlin yesterday, and I have responsibilities here thru Oct
31.
Karl
On Mon, Oct 15, 2012 at 4:17 AM, Piergiorgio Lucidi
wrote:
> Hi guys,
>
> I would like to share with you that at the next LinuxDay here in Rome I'
Looks like maven build is busted, see CONNECTORS-555.
Karl
On Sun, Oct 14, 2012 at 4:32 AM, Karl Wright wrote:
> Ran tests and checked documentation.
>
> +1 from me.
>
> Karl
>
> On Sat, Oct 13, 2012 at 6:16 PM, Karl Wright wrote:
>> Vote +1 to release Apache Mani
Sounds great! I can't wait to see it.
Karl
On Mon, Oct 15, 2012 at 6:31 AM, Erlend Garåsen wrote:
>
> Me and Karl had a short discussion about such a connector in Cambridge for
> some months ago. Now I have created the following ticket regarding an Email
> Connector:
> https://issues.apache.or
As far as I'm concerned, either coding style is OK. But having the
wrong number of spaces for indent is not. Nor is using tabs instead
of spaces.
The only other rule I think we should really enforce is for any
significant changes to the style of a class to be checked in or
patched independently
Vote +1 to release Apache ManifoldCF 1.0.1, RC1. You can find a tag at:
https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0.1-RC1
The artifact can be downloaded from:
http://people.apache.org/~kwright/apache-manifoldcf-1.0.1
This patch release fixes the critical bug CONNECTORS-551. I
Downloaded artifacts, looked for leakage of maven "target"
directories, and tried a maven build, which worked.
+1 from me.
Karl
On Mon, Oct 15, 2012 at 7:34 PM, Karl Wright wrote:
> Vote +1 to release Apache ManifoldCF 1.0.1, RC1. You can find a tag at:
>
> https://svn.a
Please join me in welcoming Maciej as our newest committer and PMC
member. Maciej brings with him knowledge of wikis, LDAP, and mail
protocols, among many other skills. Welcome, Maciej!
Karl
Hi Maciej,
First advice is to post questions of this kind to
dev@manifoldcf.apache.org. This functions in part as a repository of
general knowledge, and it is searchable, so in the future others can
maybe refer to answers there.
Please see below for detailed answers.
On Wed, Oct 17, 2012 at 4:3
changes are only in
> this single connector?
>
You always need to create a whole copy of trunk. svn is very
efficient about copies so this is not a problem.
Karl
> 2012/10/17 Karl Wright
>
>> Hi Maciej,
>>
>> First advice is to post questions of this kind to
>
gt;> - executed integration tests
>> - checked signatures
>> - crawled a CMIS repository
>>
>> +1 from me
>>
>> PJ
>>
>> 2012/10/16 Karl Wright
>>
>> > Downloaded artifacts, looked for leakage of maven
>> "target"
>>
This bug-fix release corrects one critical functional problem, which
was detected after 1.0 was released, plus fixes the broken maven
build. Plus we've included reworked how-to-build-and-deploy
documentation.
Please let me know of any problems!
Karl
te ;)
>
> Piergiorgio
>
> [1] -
> http://www.open4dev.com/journal/2012/10/28/apache-manifoldcf-at-linuxday-slides.html
>
> 2012/10/15 Karl Wright
>
>> Looks great - hope it goes well! I wish I could come but I *just* got
>> back from Berlin yesterday, and I have responsibili
s.
> At the end of the next week, after the Alfresco DevCon in Berlin, I can
> work on this task for updating the website ;)
>
> Piergiorgio
>
> [1] -
> http://www.open4dev.com/journal/2012/10/28/apache-manifoldcf-at-linuxday-slides.html
>
> 2012/10/15 Karl Wright
>
>
Hi Ahmet and/or Piergiorgio,
If you still have access to your SharePoint instances, I've got a
favor to ask. Can you check out
https://svn.apache.org/repos/asf/manifoldcf/branches/CONNECTORS-120,
run "ant make-core-deps", and "ant build", and then try the SharePoint
connector? I've converted ove
ache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
> at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
> at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknow
Ok, can you try it now? I think it is fixed.
Karl
On Sat, Nov 10, 2012 at 1:20 PM, Karl Wright wrote:
> Thanks - it looks like I will need to have it go through a temporary
> local file in order to work right. I'll make that change and let you
> know.
>
> Karl
>
>
eam.(FileInputStream.java:116)
> at
> org.apache.manifoldcf.crawler.connectors.sharepoint.CommonsHTTPSender$FileBackedInputStream.(CommonsHTTPSender.java:488)
> at
> org.apache.manifoldcf.crawler.connectors.sharepoint.CommonsHTTPSender$ExecuteMethodThread.run(CommonsHTTP
Great!
I've pulled this code up into trunk. Please, if anyone notices any
problems, please let me know.
Karl
On Sat, Nov 10, 2012 at 10:10 PM, Ahmet Arslan wrote:
> Hi Karl,
>
> Its all working for me now :)
>
> Thanks,
> Ahmet
>
> --- On Sun, 11/11/12, Karl W
Hi all,
The branch https://svn.apache.org/repos/asf/manifoldcf/branches/CONNECTORS-120
contains an RSS connector that has been updated to use httpcomponents
4.2.2. I'd love for people who are in a position to do significant
RSS crawling to try it out before I pull it into trunk. Any takers?
Kar
-1
> ERROR 2012-11-17 23:02:27,329 (Worker thread '30') -
> Exception tossed: Repeated service interruptions - failure
> getting document version
> org.apache.manifoldcf.core.interfaces.ManifoldCFException:
> Repeated service interruptions - failure getting document
> ver
o other servers...
Karl
On Sun, Nov 18, 2012 at 3:07 AM, Karl Wright wrote:
> Odd. The problem is obviously the port of -1. But the code does not
> attach a specific port to the URL in that case.
>
> I will try your example exactly when I have access to internet again.
>
> Karl
&
The CONNECTORS-120 branch now also has a httpcomponents version of the
wiki connector implemented. I think Maciej might be interested in
trying that one out.
Karl
On Sun, Nov 18, 2012 at 1:04 PM, Karl Wright wrote:
> Hi Ahmet,
>
> I tried your example, but it looked like it worked
ww.hurriyet.com.tr/robots.txt exists.
>>
>> Ahmet
>>
>> --- On Sun, 11/18/12, Karl Wright wrote:
>>
>> > From: Karl Wright
>> > Subject: Re: Anyone out there using RSS connector, who wants to help?
>> > To: "Ahmet Arslan" , "
I've ported the web connector, finally, to httpcomponents 4.2.2. This
was a lot of work. The areas I anticipate there will be problems will
be in exception handling and in session login. If anyone has
session-protected sites they typically crawl, it would be great if you
could try this code out!
e.
Thanks!
Karl
On Tue, Nov 20, 2012 at 7:11 AM, Karl Wright wrote:
> Thanks for the update!
>
> I'm working on the web connector now. That's going to require a bit more
> work.
>
> Karl
>
> On Tue, Nov 20, 2012 at 7:09 AM, Maciej Liżewski
> wrote:
>&g
Hello all committers,
This is a reminder that our next release is scheduled for 12/31/2012.
There are a number of fairly major open tickets out there that I think
are almost ready to be included in the release. I am especially
thinking of the ldap authority enhancements, and the simple JDBC
autho
Hi folks,
I've created a ManifoldCF 1.2 release in JIRA and triaged some tickets
I intend to work on for that release. I've also closed/resolved a
fair number of tickets that were hanging around marked "fix in
ManifoldCF next". You may want to do the same...
Karl
ManifoldCF scales based on how well the underlying database handles
two kinds of queries - direct access to a row via an index, and
reading from an index in ordered fashion. Both of these go up as
log(n) assuming b-trees.
I have personally done web crawls on the order of 5 million actual
content
The derby database support is not apparently able to discover database
indexes properly at this time, and that is causing derby crawls to
fail. I will be looking at this in detail this morning. Until then,
tests don't work, they hang...
Karl
Ok, this is now fixed.
Karl
On Wed, Dec 12, 2012 at 5:22 AM, Karl Wright wrote:
> The derby database support is not apparently able to discover database
> indexes properly at this time, and that is causing derby crawls to
> fail. I will be looking at this in detail this morning. U
DBDrop is in fact used internally by the tests we run. But you have
to do the uninstall sequence in the correct order, otherwise, as you
say, you are left with table dependencies.
The correct order is this:
org.apache.manifoldcf.crawler.UnRegisterAll
org.apache.manifoldcf.authorities.UnRegisterA
Hmm, somehow you lost a connector jar out of the connector-lib or
connector-lib-proprietary area. Deleting the jars before you clean up
the database is not going to work. ;-)
Karl
On Tue, Dec 18, 2012 at 10:26 AM, Erlend Garåsen
wrote:
>
> Yes, I know the order is important. I tried to follow t
401 - 500 of 9402 matches
Mail list logo