thanks for looking Gert. The problem seems to be, when indexing from scratch,
after creating an empty index and restarting solr, the resource is indexed but
only using its DC. So no searches on its dcterms work. It only indexes on
DCTERMS when the resource is indexed via pid.
Alistair
--
mov eax,1
mov ebx,0
int 80h
From: Gert Schmeltz Pedersen <gerts...@gmail.com<mailto:gerts...@gmail.com>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Monday, 19 August 2013 12:35
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
Thank you, Alistair. Sorry that I replied to your previous email, the later one
was placed in my spam folder by my email client, and I only noticed it this
morning. I include that email in the history below for completeness.
These two log extracts
the initial ingest from all foxml files for one specific resource that shows
the behaviour - initial–ingest-fgs.log
the log from indexing that resource from pid - index–from-pid.log
both show the generated index document, and they are identical. As I said in
the previous reply, I thought that the problem was that one of them would lack
the dcterms fields. Since they are identical, which is as it should be, I think
I need your explanation again about what the problem is.
Gert
On 19/08/2013, at 10.26, Alistair Young wrote:
the later email should contain that info Gert. Just wondering if the initial
indexing gets hold of a cached version of the resource (before the DCTERMS
namespace was renamed to be the same as the XSLT).
thanks,
Alistair
--
mov eax,1
mov ebx,0
int 80h
From: Gert Schmeltz Pedersen <gerts...@gmail.com<mailto:gerts...@gmail.com>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Friday, 16 August 2013 09:25
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
Thank you, Alistair. Your log lines show me part of one index document
generation, which contains the dct (dcterms) fields. However, if I understand
you right, the problem is that you get an index document without the dct
fields, when you call the updateIndex in one of the other ways (fromPid,
fromFoxmlFiles, triggered by ingest/update). So, in order to compare the
successful case and the problematic case, I need to see also the log lines from
the problematic index document generation, and I need to see all the log lines
of both operations, that is, from the INFO log line starting the operation to
the INFO log line at the end of the operation.
Gert
On 15/08/2013, at 10.58, Alistair Young wrote:
I've dug deeper into the logs Gert and attached are:
the initial ingest from all foxml files for one specific resource that shows
the behaviour - initial–ingest-fgs.log
the log from indexing that resource from pid - index–from-pid.log
and the corresponding solr log extracts for both indexing job - solr–extract.log
There appear to be two ingests initially for that resource into solr?
DEBUG - 2013-08-15 09:31:47.790
DEBUG - 2013-08-15 09:34:29.375
they're from the same indexing job, not run separately
thanks,
Alistair
--
On 14/08/2013, at 17.52, Alistair Young wrote:
Hope this is of some use…
Alistair
-----------------
mov eax,1
mov ebx,0
int 80
From: Gert Schmeltz Pedersen <gerts...@gmail.com<mailto:gerts...@gmail.com>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Wednesday, 14 August 2013 16:15
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
Let me see the log lines.
Gert
On 14/08/2013, at 17.04, Alistair Young wrote:
Having said that, I have to force index via pid to get it to index the updated
resources.
Renamed all dcterms datastreams namespaces to be same as xslt in the
objectStore. Rebuilt the resourceIndex and database. Blew away the solr index.
Index from scratch and it won't index dcterms on the resources that had their
dcterms renamed. Have to force update on them by pid and then it indexes their
dcterms.
Any idea why an initial index would fail while an individual index via pid
would work?
Alistair
-----------------
mov eax,1
mov ebx,0
int 80
From: Alistair Young <alistair.yo...@uhi.ac.uk<mailto:alistair.yo...@uhi.ac.uk>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Wednesday, 14 August 2013 09:53
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
Thanks for that Gert. Finally tracked it down to different representations of
the same namespace in the repo. Some objects have their dcterms namespace
declared as per the xslt, others don't. I feel some metadata munging coming on.
Thanks for confirming the single xslt though. At the end of this hoping to
produce a tutorial on fedora/gsearch/solr. This seems to be the last thorn in
the bush.
Cheers,
Alistair
From: Gert Schmeltz Pedersen <gerts...@gmail.com<mailto:gerts...@gmail.com>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Tuesday, 13 August 2013 20:51
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
No separate xslt, gsearch uses the same indexing xslt, whether triggered by an
ingest/update of the Fedora object, or by an updateIndex fromPid operation, or
by an updateIndex fromFoxmlFiles operation. If you get different indexing
documents in these three cases, then the way to find out why, is to compare the
log lines in debug mode. I may help, if you send me those log lines.
Gert
On 13/08/2013, at 18.47, Alistair Young wrote:
Nup , not that. Even uploading the DCTERMS first it doesn't index them. Is
there a separate xslt it uses for updating a resource? The symptoms are
identical to when the main xslt was lacking the dcterms namespace.
Alistair
-------------------
Alistair Young
Àrd Innleadair air Bathair-bog
UHI@Sabhal Mòr Ostaig
From: Alistair Young <alistair.yo...@uhi.ac.uk<mailto:alistair.yo...@uhi.ac.uk>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Tuesday, 13 August 2013 16:44
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
I think the problem is down to the multi part commit technique being used when
uploading to Fedora:
1 – new object pid is generated by a POST
2 – the DC datastream is then PUT
3 - the DCTERMS datastream is then PUT
4 - the content datastream is then POST
so I suspect 3 (or 4) is not getting to solr. gsearch indexes on 2 but not on 3
I'm assuming. All the datastreams are present in the foxml files when doing a
new index from scratch so they work.
Is it possible to turn off auto indexing on upload and instead force it via a
REST call to gsearch once the complete resource is uploaded?
Alistair
--
mov eax,1
mov ebx,0
int 80h
From: Alistair Young <alistair.yo...@uhi.ac.uk<mailto:alistair.yo...@uhi.ac.uk>>
Reply-To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Date: Tuesday, 13 August 2013 16:10
To: "Support and info exchange list for Fedora users."
<fedora-commons-users@lists.sourceforge.net<mailto:fedora-commons-users@lists.sourceforge.net>>
Subject: [fcrepo-user] gsearch not indexing dcterms on upload
I think I must be missing a file mod or something. Indexing from scratch, using
from foxml files works fine and dcterms in all resources are indexed as the
namespaces are in the xslt. When uploading a new resource to fedora it gets
indexed but only on dc, not dcterms. Does gsearch use a different xslt when
indexing from an upload rather than a clean start?
Alistair
--
mov eax,1
mov ebx,0
int 80h
------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net<mailto:Fedora-commons-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net<mailto:Fedora-commons-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
<fglogs.tar.gz>------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net<mailto:Fedora-commons-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net<mailto:Fedora-commons-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users