I think the usual practice is to use BLOB types to store data that is not a 
character stream.  So you case is probably pretty rare.  If casting solves the 
issue, then why not?  I think people use casts all the time to solve these 
types of compatibility issues.  Then again if CLOBTransformer was changed to 
handle BLOBs also, I do not see the harm.  But I would think it would be a much 
more common case that users would be putting binary-format documents in BLOBs 
then feeding them to tika or something to extract the text.

James Dyer
Ingram Content Group
(615) 213-4311


-----Original Message-----
From: geeky2 [mailto:gee...@hotmail.com] 
Sent: Friday, May 17, 2013 1:34 PM
To: dev@lucene.apache.org
Subject: have developer question about ClobTransformer and DIH

hello,

this is my first post to this forum - 

if this question is not correct for this forum (or has been addressed in
another jira) - just let me know ;)

environment: 
   solr 3.5
   informix 11.x
   centos
   
Problem statement: ClobTransformer
(./solr/contrib/dataimporthandler/src/java/org/apache/solr/handler/dataimport/ClobTransformer.java)
stopped working when two columns in the table were converted from CLOB to
Text.

More Detail:
recently i ran in to an issue while attempting to use the DIH against an
informix table.  the DIH and ClobTransfomer were working well with two (2)
fields that were defined as CLOB.  to resolve another informix specific
issue - the two fields were changed to Text fields (another type of informix
blob).

after the change - another full import was done and it was discovered that
these two fields were being returned with the classic hex address that
denotes a binary field in the schema.

after quite a bit of experimentation and discussion with the DBA's, i cast
the two columns as clob.

example:

    cast(att.attr_val AS clob) as attr_val,
    cast(rsr.rsr_val AS clob) as rsr_val,

after doing this - the issue was resolved.

Questions:

1) is this a known issue?
2) is this the prescribed remedy for this type of situation - using this
version of solr (3.5)?
3) can i get more detail on why the ClobTransfomer does not work with other
blob like fields?

finally - i looked at the code for ClobTransformer (and Transformer) and was
wondering if it is possible to change or add another class that would handle
this use case "out of the box".

thx
mark






--
View this message in context: 
http://lucene.472066.n3.nabble.com/have-developer-question-about-ClobTransformer-and-DIH-tp4064256.html
Sent from the Lucene - Java Developer mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org




---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to