dataimport, both splitBy and dateTimeFormat
I'm trying out the dataimport capability. I have a column that is a series of dates separated by spaces like so: 1996-00-00 1996-04-00 And I'm trying to import it like so: field column=r_event_date splitBy= dateTimeFormat=-MM-dd / However this fails and the stack trace suggests it is first trying to apply the dateTimeFormat before splitBy. I think this is a bug... dataimport should apply DateFormatTransformer and NumberFormatTransformer last. ~ David Smiley -- View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: dataimport, both splitBy and dateTimeFormat
Hi David, I think you meant RegexTransformer instead of NumberFormatTransformer. Anyhow, the order in which the transformers are applied is the same as the order in which you specify them. So make sure your entity has transformers=RegexTransformer,DateFormatTransformer. On Thu, Oct 16, 2008 at 6:14 PM, David Smiley @MITRE.org [EMAIL PROTECTED]wrote: I'm trying out the dataimport capability. I have a column that is a series of dates separated by spaces like so: 1996-00-00 1996-04-00 And I'm trying to import it like so: field column=r_event_date splitBy= dateTimeFormat=-MM-dd / However this fails and the stack trace suggests it is first trying to apply the dateTimeFormat before splitBy. I think this is a bug... dataimport should apply DateFormatTransformer and NumberFormatTransformer last. ~ David Smiley -- View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Shalin Shekhar Mangar.
Re: dataimport, both splitBy and dateTimeFormat
The wiki didn't mention I can specify multiple transformers. BTW, it's transformer (singular), not transformers. I did mean both NFT and DFT because I was speaking of the general case, not just mine in particular. I thought that the built-in transformers were always in-effect and so I expected NFT,DFT to occur last. Sorry if I wasn't clear. Thanks for your help; it worked. ~ David Shalin Shekhar Mangar wrote: Hi David, I think you meant RegexTransformer instead of NumberFormatTransformer. Anyhow, the order in which the transformers are applied is the same as the order in which you specify them. So make sure your entity has transformers=RegexTransformer,DateFormatTransformer. On Thu, Oct 16, 2008 at 6:14 PM, David Smiley @MITRE.org [EMAIL PROTECTED]wrote: I'm trying out the dataimport capability. I have a column that is a series of dates separated by spaces like so: 1996-00-00 1996-04-00 And I'm trying to import it like so: field column=r_event_date splitBy= dateTimeFormat=-MM-dd / However this fails and the stack trace suggests it is first trying to apply the dateTimeFormat before splitBy. I think this is a bug... dataimport should apply DateFormatTransformer and NumberFormatTransformer last. ~ David Smiley -- View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Shalin Shekhar Mangar. -- View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20016178.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: dataimport, both splitBy and dateTimeFormat
Thanks David, I have updated the wiki documentation http://wiki.apache.org/solr/DataImportHandler#transformer The default transformers do not have any special privilege it is like any normal user provided transformer.We just identified some commonly found usecases and added transformers for that. Applying a transformer is not very 'cheap' it has to do extra checks to know whether to apply or not. On Fri, Oct 17, 2008 at 12:26 AM, David Smiley @MITRE.org [EMAIL PROTECTED] wrote: The wiki didn't mention I can specify multiple transformers. BTW, it's transformer (singular), not transformers. I did mean both NFT and DFT because I was speaking of the general case, not just mine in particular. I thought that the built-in transformers were always in-effect and so I expected NFT,DFT to occur last. Sorry if I wasn't clear. Thanks for your help; it worked. ~ David Shalin Shekhar Mangar wrote: Hi David, I think you meant RegexTransformer instead of NumberFormatTransformer. Anyhow, the order in which the transformers are applied is the same as the order in which you specify them. So make sure your entity has transformers=RegexTransformer,DateFormatTransformer. On Thu, Oct 16, 2008 at 6:14 PM, David Smiley @MITRE.org [EMAIL PROTECTED]wrote: I'm trying out the dataimport capability. I have a column that is a series of dates separated by spaces like so: 1996-00-00 1996-04-00 And I'm trying to import it like so: field column=r_event_date splitBy= dateTimeFormat=-MM-dd / However this fails and the stack trace suggests it is first trying to apply the dateTimeFormat before splitBy. I think this is a bug... dataimport should apply DateFormatTransformer and NumberFormatTransformer last. ~ David Smiley -- View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20013006.html Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Shalin Shekhar Mangar. -- View this message in context: http://www.nabble.com/dataimport%2C-both-splitBy-and-dateTimeFormat-tp20013006p20016178.html Sent from the Solr - User mailing list archive at Nabble.com. -- --Noble Paul