Progress on loading stuff into babel<http://babel.gpcnetwork.org/> continues...

3 down, but...

Joe,

I'm having trouble with wierd quotes in the c_basecode field the Marshfield CSV 
data:


/usr/bin/psql --host localhost --dbname i2b2 --user i2b2metadata --no-password 
-c copy mcrf_terms(C_HLEVEL, C_FULLNAME, C_NAME, C_SYNONYM_CD, 
C_VISUALATTRIBUTES, C_TOTALNUM, C_BASECODE, C_METADATAXML, C_FACTTABLECOLUMN, 
C_TABLENAME, C_COLUMNNAME, C_COLUMNDATATYPE, C_OPERATOR, C_DIMCODE, C_COMMENT, 
C_TOOLTIP, UPDATE_DATE, DOWNLOAD_DATE, IMPORT_DATE, SOURCESYSTEM_CD, 
VALUETYPE_CD) from stdin with (format csv, header true, null 'NULL')

ERROR:  value too long for type character varying(50)
CONTEXT:  COPY mcrf_terms, line 42163, column c_basecode: ") Miscell. 
(Med.Supl.;Non-Drugs),,concept_cd,concept_dimension,concept_path,T,=,\i2b2\Medications\*N..."

The offending line is:

5,\i2b2\Medications\*Not Available\DEVICES\Syringe with Needle (Disp)\Syringe 
with Needle (Disp) (Syringe 3cc/21Gx1-1/2\,Med:29443,N,LA ,0,) Miscell. 
(Med.Supl.;Non-Drugs)",,concept_cd,concept_dimension,concept_path,T,=,\i2b2\Medications\*Not
 Available\DEVICES\Syringe with Needle (Disp)\Syringe with Needle (Disp) 
(Syringe 3cc/21Gx1-1/2\,,Med:29443,2013-10-16 00:00:00.000,2013-10-16 
00:00:00.000,2013-10-16 00:00:00.000,RDW,


I'm working on encoding issues in the UTSW data.

And it looks like the WISC data is tab-separated, not CSV...

--
Dan

Reply via email to