Re: Other document data.

Erik Hatcher Tue, 15 Jun 2010 04:35:59 -0700

Note that lowernames=true will take care of the case change on theSolr side anyway. But "literal." does need to be prepended, anddoesn't really make sense for the SQL query to "as" rename it withthat prefix. LCF should prepend it automatically.


        Erik


On Jun 15, 2010, at 7:31 AM, <[email protected]> wrote:

When you write the query, you have complete control over the columnnames.We can certainly always attach "literal." to the front of eachmetadata parameter, which I'm happy to do, but mapping case etc Ithink should be under the user's control.
So, your query should look something like this:

SELECT DDOCAUTHOR as ddocauthor, ... FROM ...

Does this work for you?

Karl


-----Original Message-----
From: ext [email protected] [mailto:[email protected]]
Sent: Tuesday, June 15, 2010 7:05 AM
To: [email protected]
Subject: RE: Other document data.


Hi,

Yes the metadata is sent to solr. But it is being sent like this.
webapp=/solr path=/update/extractparams={fmap.content=text&DDOCAUTHOR=sysadmin&uprefix=attr_&literal.id=/idc/groups/public/documents/sunil_layout/mfe001251.xml&lowernames=true&captureAttr=true}
as you can see DDOCAUTHOR is sent in the params part which is acustom metadata.
But to add custom data as metadata in Solr we should pass in theliteral parameter along with the file.
So, there will be a small change in the code.

Instead of sending the column name in the params, we should send

literal.<<columnname>> (columnnames in small case)


Thanks & Regards,
Rohan G Patil
Cognizant Programmer Analyst Trainee,Bangalore || Mob # +919535577001
[email protected]


-----Original Message-----
From: [email protected] [mailto:[email protected]]
Sent: Tuesday, June 15, 2010 4:03 PM
To: [email protected]
Subject: RE: Other document data.
That's hard to determine without looking at the solr logs. I am notfamiliar with the log options available, but unless I'm mistaken thedefault configuration should be dumping every request to standard out.
Karl
________________________________________
From: ext [email protected] [[email protected]]
Sent: Tuesday, June 15, 2010 5:23 AM
To: [email protected]
Subject: RE: Other document data.

Hi,

Yes I get it. Thanks for the clarification.
I was doing the similar thing before and it used to run, now itdidn't. So I got confused.
Is there any way to check if metadata is actually sent to solr?Because I am experiencing some problem there and I don't seem tofigure out where it is going wrong.
Thanks & Regards,
Rohan G Patil
Cognizant Programmer Analyst Trainee,Bangalore || Mob # +919535577001
[email protected]

-----Original Message-----
From: [email protected] [mailto:[email protected]]
Sent: Tuesday, June 15, 2010 2:13 PM
To: [email protected]
Subject: RE: Other document data.
LCF is an incremental crawler. The version query is used todetermine whether data needs to be refetched and reindexed. If itreturns the same thing each time the document is examined, the dataquery will not be run the second time. I therefore suggest eitherthe following:
(1) Supply no version query at all. That signals to the connectorthat there is no version information and the data must be reindexedon every job run.(2) Supply a version query that properly reflects changes to thedata. For instance, if there's a timestamp in each record, you canuse that by itself ONLY if any metadata changes also are associatedwith a change in that timestamp. If not, you will need to glom themetadata into the version string as well as the timestamp. Is thisunderstood?
If you want to FORCE a reindex, there is a link in the crawler-uifor the output connection which allows you to force reindexing ofall data associated with that connection.
If this still doesn't seem to describe what you are seeing, pleaseclarify further.
Thanks,
Karl

________________________________________
From: ext [email protected] [[email protected]]
Sent: Tuesday, June 15, 2010 12:51 AM
To: [email protected]
Subject: RE: Other document data.

Hi,
When we specify the metadata content, It runs fine the first time,The second time it doesn't run the data query at all. What must bethe problem ?
Thanks & Regards,
Rohan G Patil
Cognizant Programmer Analyst Trainee,Bangalore || Mob # +919535577001
[email protected]


-----Original Message-----
From: [email protected] [mailto:[email protected]]
Sent: Sunday, June 13, 2010 7:04 AM
To: [email protected]
Subject: RE: Other document data.
No. The data query (the same one that returns the blob info) cannow include additional columns. These columns will be sent to Solras metadata fields.
Karl

________________________________________
From: ext [email protected] [[email protected]]
Sent: Friday, June 11, 2010 2:28 AM
To: [email protected]
Subject: RE: Other document data.

Hi,

I see that the issue is resolved.

Now is there a new query where in we can specify the metadata fields ?

Thanks & Regards,
Rohan G Patil
Cognizant Programmer Analyst Trainee,Bangalore || Mob # +919535577001
[email protected]


-----Original Message-----
From: [email protected] [mailto:[email protected]]
Sent: Thursday, June 10, 2010 4:12 PM
To: [email protected]
Subject: RE: Other document data.
It is not possible to properly glom other fields onto a BLOB unlessyou know that the blob's contents are always encoded text. So Isuggest you create a jira enhancement request in the LuceneConnector Framework project to describe this enhancement (addingmetadata support to JDBC connector).
The url is: http://issues.apache.org/jira
You may need to create an account if you don't already have one.Let me know if you have any difficulties.
Thanks,
Karl


-----Original Message-----
From: ext [email protected] [mailto:[email protected]]
Sent: Thursday, June 10, 2010 6:39 AM
To: [email protected]
Subject: RE: Other document data.


Hi,
Using solution 1 was not a bad idea, but the problem is the contentis stored as BLOB in the database and gluing other fields with BLOBis not possible (Is it ?) .
Regarding 2 : Yes I guess I can do that modification, and anyway itall depends on how we show it to the user.
Thanks & Regards,
Rohan G Patil
Cognizant Programmer Analyst Trainee,Bangalore || Mob # +919535577001
[email protected]

-----Original Message-----
From: [email protected] [mailto:[email protected]]
Sent: Thursday, June 10, 2010 3:19 PM
To: [email protected]
Subject: RE: Other document data.
(1) The JDBC connector is currently relatively primitive and doesnot have any support for "document metadata" at this time. You can,of course, glom together multiple fields into the content field withit, but that's pretty crude.(2) The LCF convention for how to identify documents uniquely in thetarget index is to use the URL of the document. All documentsindexed with LCF have such a URL and it is likely to be both usefuland unique. This url is how LCF requests deletion of the documentfrom the index, if necessary, and also overwrites the document. Soit maps pretty precisely to literal.id for the basic solr setup.Now, it may be that this is too tied to the example, and that thesolr connector should have a configuration setting to allow the nameof the id field used to be changed - that sounds like a reasonablemodification that would not be too difficult to do. Is thissomething you are looking for?
Karl
________________________________________
From: ext [email protected] [[email protected]]
Sent: Thursday, June 10, 2010 4:52 AM
To: [email protected]
Subject: Other document data.
I am using JDBC connection to search for the documents in thedatabase.
The issue is some document data(Check in date etc) is present inthe other columns. How to send this data to Solr so as to index it.
Why is the URL of the file taken as ID in Solr.

Thanks & Regards,
Rohan G Patil
Cognizant Programmer Analyst Trainee,Bangalore || Mob # +919535577001
[email protected]<mailto:[email protected]>

This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorized review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance onthis
e-mail is strictly prohibited and may be unlawful.




This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorized review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance onthis
e-mail is strictly prohibited and may be unlawful.



This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorized review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance onthis
e-mail is strictly prohibited and may be unlawful.



This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorized review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance onthis
e-mail is strictly prohibited and may be unlawful.



This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorized review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance onthis
e-mail is strictly prohibited and may be unlawful.



This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorized review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance onthis
e-mail is strictly prohibited and may be unlawful.

Re: Other document data.

Reply via email to