But we just saw from your logs that there were dcterm fields both on initial
ingest and via pid. So gsearch generates the expected index document, and I do
not think that solr would treat the same index document differently at two
different times. There must be something else going on.
Gert
On 19/08/2013, at 14.02, Alistair Young wrote:
> thanks for looking Gert. The problem seems to be, when indexing from scratch,
> after creating an empty index and restarting solr, the resource is indexed
> but only using its DC. So no searches on its dcterms work. It only indexes on
> DCTERMS when the resource is indexed via pid.
>
> Alistair
>
> --
> mov eax,1
> mov ebx,0
> int 80h
>
> From: Gert Schmeltz Pedersen <gerts...@gmail.com>
> Reply-To: "Support and info exchange list for Fedora users."
> <fedora-commons-users@lists.sourceforge.net>
> Date: Monday, 19 August 2013 12:35
> To: "Support and info exchange list for Fedora users."
> <fedora-commons-users@lists.sourceforge.net>
> Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
>
> Thank you, Alistair. Sorry that I replied to your previous email, the later
> one was placed in my spam folder by my email client, and I only noticed it
> this morning. I include that email in the history below for completeness.
>
> These two log extracts
>
>
>> the initial ingest from all foxml files for one specific resource that shows
>> the behaviour - initial–ingest-fgs.log
>>
>> the log from indexing that resource from pid - index–from-pid.log
>
>
> both show the generated index document, and they are identical. As I said in
> the previous reply, I thought that the problem was that one of them would
> lack the dcterms fields. Since they are identical, which is as it should be,
> I think I need your explanation again about what the problem is.
>
> Gert
>
>
>
> On 19/08/2013, at 10.26, Alistair Young wrote:
>
>> the later email should contain that info Gert. Just wondering if the initial
>> indexing gets hold of a cached version of the resource (before the DCTERMS
>> namespace was renamed to be the same as the XSLT).
>>
>> thanks,
>>
>> Alistair
>>
>> --
>> mov eax,1
>> mov ebx,0
>> int 80h
>>
>> From: Gert Schmeltz Pedersen <gerts...@gmail.com>
>> Reply-To: "Support and info exchange list for Fedora users."
>> <fedora-commons-users@lists.sourceforge.net>
>> Date: Friday, 16 August 2013 09:25
>> To: "Support and info exchange list for Fedora users."
>> <fedora-commons-users@lists.sourceforge.net>
>> Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
>>
>> Thank you, Alistair. Your log lines show me part of one index document
>> generation, which contains the dct (dcterms) fields. However, if I
>> understand you right, the problem is that you get an index document without
>> the dct fields, when you call the updateIndex in one of the other ways
>> (fromPid, fromFoxmlFiles, triggered by ingest/update). So, in order to
>> compare the successful case and the problematic case, I need to see also the
>> log lines from the problematic index document generation, and I need to see
>> all the log lines of both operations, that is, from the INFO log line
>> starting the operation to the INFO log line at the end of the operation.
>>
>> Gert
>
>
> On 15/08/2013, at 10.58, Alistair Young wrote:
>
>> I've dug deeper into the logs Gert and attached are:
>>
>> the initial ingest from all foxml files for one specific resource that shows
>> the behaviour - initial–ingest-fgs.log
>>
>> the log from indexing that resource from pid - index–from-pid.log
>>
>> and the corresponding solr log extracts for both indexing job -
>> solr–extract.log
>>
>> There appear to be two ingests initially for that resource into solr?
>> DEBUG - 2013-08-15 09:31:47.790
>> DEBUG - 2013-08-15 09:34:29.375
>> they're from the same indexing job, not run separately
>>
>> thanks,
>>
>> Alistair
>>
>> --
>>
>>
>> On 14/08/2013, at 17.52, Alistair Young wrote:
>>
>>> Hope this is of some use…
>>>
>>> Alistair
>>>
>>>
>>> -----------------
>>> mov eax,1
>>> mov ebx,0
>>> int 80
>>>
>>> From: Gert Schmeltz Pedersen <gerts...@gmail.com>
>>> Reply-To: "Support and info exchange list for Fedora users."
>>> <fedora-commons-users@lists.sourceforge.net>
>>> Date: Wednesday, 14 August 2013 16:15
>>> To: "Support and info exchange list for Fedora users."
>>> <fedora-commons-users@lists.sourceforge.net>
>>> Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
>>>
>>> Let me see the log lines.
>>>
>>> Gert
>>>
>>>
>>> On 14/08/2013, at 17.04, Alistair Young wrote:
>>>
>>>> Having said that, I have to force index via pid to get it to index the
>>>> updated resources.
>>>>
>>>> Renamed all dcterms datastreams namespaces to be same as xslt in the
>>>> objectStore. Rebuilt the resourceIndex and database. Blew away the solr
>>>> index. Index from scratch and it won't index dcterms on the resources that
>>>> had their dcterms renamed. Have to force update on them by pid and then it
>>>> indexes their dcterms.
>>>>
>>>> Any idea why an initial index would fail while an individual index via pid
>>>> would work?
>>>>
>>>> Alistair
>>>>
>>>> -----------------
>>>> mov eax,1
>>>> mov ebx,0
>>>> int 80
>>>>
>>>> From: Alistair Young <alistair.yo...@uhi.ac.uk>
>>>> Reply-To: "Support and info exchange list for Fedora users."
>>>> <fedora-commons-users@lists.sourceforge.net>
>>>> Date: Wednesday, 14 August 2013 09:53
>>>> To: "Support and info exchange list for Fedora users."
>>>> <fedora-commons-users@lists.sourceforge.net>
>>>> Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
>>>>
>>>> Thanks for that Gert. Finally tracked it down to different representations
>>>> of the same namespace in the repo. Some objects have their dcterms
>>>> namespace declared as per the xslt, others don't. I feel some metadata
>>>> munging coming on. Thanks for confirming the single xslt though. At the
>>>> end of this hoping to produce a tutorial on fedora/gsearch/solr. This
>>>> seems to be the last thorn in the bush.
>>>>
>>>> Cheers,
>>>>
>>>> Alistair
>>>>
>>>>
>>>> From: Gert Schmeltz Pedersen <gerts...@gmail.com>
>>>> Reply-To: "Support and info exchange list for Fedora users."
>>>> <fedora-commons-users@lists.sourceforge.net>
>>>> Date: Tuesday, 13 August 2013 20:51
>>>> To: "Support and info exchange list for Fedora users."
>>>> <fedora-commons-users@lists.sourceforge.net>
>>>> Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
>>>>
>>>> No separate xslt, gsearch uses the same indexing xslt, whether triggered
>>>> by an ingest/update of the Fedora object, or by an updateIndex fromPid
>>>> operation, or by an updateIndex fromFoxmlFiles operation. If you get
>>>> different indexing documents in these three cases, then the way to find
>>>> out why, is to compare the log lines in debug mode. I may help, if you
>>>> send me those log lines.
>>>>
>>>> Gert
>>>>
>>>>
>>>> On 13/08/2013, at 18.47, Alistair Young wrote:
>>>>
>>>>> Nup , not that. Even uploading the DCTERMS first it doesn't index them.
>>>>> Is there a separate xslt it uses for updating a resource? The symptoms
>>>>> are identical to when the main xslt was lacking the dcterms namespace.
>>>>>
>>>>> Alistair
>>>>>
>>>>> -------------------
>>>>> Alistair Young
>>>>> Àrd Innleadair air Bathair-bog
>>>>> UHI@Sabhal Mòr Ostaig
>>>>>
>>>>> From: Alistair Young <alistair.yo...@uhi.ac.uk>
>>>>> Reply-To: "Support and info exchange list for Fedora users."
>>>>> <fedora-commons-users@lists.sourceforge.net>
>>>>> Date: Tuesday, 13 August 2013 16:44
>>>>> To: "Support and info exchange list for Fedora users."
>>>>> <fedora-commons-users@lists.sourceforge.net>
>>>>> Subject: Re: [fcrepo-user] gsearch not indexing dcterms on upload
>>>>>
>>>>> I think the problem is down to the multi part commit technique being used
>>>>> when uploading to Fedora:
>>>>>
>>>>> 1 – new object pid is generated by a POST
>>>>> 2 – the DC datastream is then PUT
>>>>> 3 - the DCTERMS datastream is then PUT
>>>>> 4 - the content datastream is then POST
>>>>>
>>>>> so I suspect 3 (or 4) is not getting to solr. gsearch indexes on 2 but
>>>>> not on 3 I'm assuming. All the datastreams are present in the foxml files
>>>>> when doing a new index from scratch so they work.
>>>>>
>>>>> Is it possible to turn off auto indexing on upload and instead force it
>>>>> via a REST call to gsearch once the complete resource is uploaded?
>>>>>
>>>>> Alistair
>>>>>
>>>>> --
>>>>> mov eax,1
>>>>> mov ebx,0
>>>>> int 80h
>>>>>
>>>>> From: Alistair Young <alistair.yo...@uhi.ac.uk>
>>>>> Reply-To: "Support and info exchange list for Fedora users."
>>>>> <fedora-commons-users@lists.sourceforge.net>
>>>>> Date: Tuesday, 13 August 2013 16:10
>>>>> To: "Support and info exchange list for Fedora users."
>>>>> <fedora-commons-users@lists.sourceforge.net>
>>>>> Subject: [fcrepo-user] gsearch not indexing dcterms on upload
>>>>>
>>>>> I think I must be missing a file mod or something. Indexing from scratch,
>>>>> using from foxml files works fine and dcterms in all resources are
>>>>> indexed as the namespaces are in the xslt. When uploading a new resource
>>>>> to fedora it gets indexed but only on dc, not dcterms. Does gsearch use a
>>>>> different xslt when indexing from an upload rather than a clean start?
>>>>>
>>>>> Alistair
>>>>>
>>>>> --
>>>>> mov eax,1
>>>>> mov ebx,0
>>>>> int 80h
>>>>> ------------------------------------------------------------------------------
>>>>> Get 100% visibility into Java/.NET code with AppDynamics Lite!
>>>>> It's a free troubleshooting tool designed for production.
>>>>> Get down to code-level detail for bottlenecks, with <2% overhead.
>>>>> Download for free and get started troubleshooting in minutes.
>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
>>>>> Fedora-commons-users mailing list
>>>>> Fedora-commons-users@lists.sourceforge.net
>>>>> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>>>>
>>>> ------------------------------------------------------------------------------
>>>> Get 100% visibility into Java/.NET code with AppDynamics Lite!
>>>> It's a free troubleshooting tool designed for production.
>>>> Get down to code-level detail for bottlenecks, with <2% overhead.
>>>> Download for free and get started troubleshooting in minutes.
>>>> http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
>>>> Fedora-commons-users mailing list
>>>> Fedora-commons-users@lists.sourceforge.net
>>>> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>>>
>>> <fglogs.tar.gz>------------------------------------------------------------------------------
>>> Get 100% visibility into Java/.NET code with AppDynamics Lite!
>>> It's a free troubleshooting tool designed for production.
>>> Get down to code-level detail for bottlenecks, with <2% overhead.
>>> Download for free and get started troubleshooting in minutes.
>>> http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
>>> Fedora-commons-users mailing list
>>> Fedora-commons-users@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>>
>> ------------------------------------------------------------------------------
>> Get 100% visibility into Java/.NET code with AppDynamics Lite!
>> It's a free troubleshooting tool designed for production.
>> Get down to code-level detail for bottlenecks, with <2% overhead.
>> Download for free and get started troubleshooting in minutes.
>> http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
>> Fedora-commons-users mailing list
>> Fedora-commons-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>
> ------------------------------------------------------------------------------
> Get 100% visibility into Java/.NET code with AppDynamics Lite!
> It's a free troubleshooting tool designed for production.
> Get down to code-level detail for bottlenecks, with <2% overhead.
> Download for free and get started troubleshooting in minutes.
> http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk_______________________________________________
> Fedora-commons-users mailing list
> Fedora-commons-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite!
It's a free troubleshooting tool designed for production.
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users