Re: [R-SIG-Mac] ODBC driver with postgresql

Marc Schwartz Thu, 17 Dec 2009 07:23:03 -0800

Tim,

Coming to this with a bit of a delay, but one of the things that youmay wish to try is to set 'rows_at_time = 1' in your calls tosqlQuery(). See the help file for the function.

Prior to RODBC version 1.3-0, this defaulted to 1. However, beginningwith 1.3-0, the default changed to 100.

I had problems with the new default value using RODBC connecting to anOracle server. These were typically queries returning 0 rows, where Iknew that there should be more than that and typically in thehundreds. The problem manifested itself when the view names were > 11or 12 characters. This had suggested the possibility of a pointer goneawry, stepping on and corrupting the query result. However, I couldnever confirm that and Prof. Ripley could not replicate the problem onhis end.

I think that lacking further investigation, it may be representativeof a bug in the ODBC driver itself.

Setting 'rows_at_time' to 1 has resolved the problem for me and youshould see if that resolves anything in your situation.

BTW, the error message that you get below after loading RODBC on Linuxsuggests that you need to re-install it. It appears to have been builtusing a prior version of R, before dynamically generated help fileswere introduced in recent versions.


HTH,

Marc Schwartz

On Dec 15, 2009, at 2:36 PM, Tim Coote wrote:

Well my fix is clearly inadequate as I'm getting malloc errors whenI re-read the information :-(
On 15 Dec 2009, at 17:49, Tim Coote wrote:
I now have a simplistic fix for this error. I have no idea how tocreate test or submit a patch. Indeed, i've got zero familiaritywith the testing model.
Here's the skinny. The RODBC.c driver code does not handleSQL_FLOATs (ie DATA_TYPE 6 in my original posting) returned by theodbc driver, so they fall through to the default encodingmechanism, which is character based (I think). As far as I can see,all that's needed is to include SQL_FLOAT to all of the SQL_DOUBLEcases in three switch statements. (one in cachenbind, two inRODBCFetchRows.)
I'd guess that the reason that I've found this bug on the mac isthat my linux box (Fedora Core 10) uses an older version of RODBC(rpm: R-RODBC-1.2-2.fc10.i386) and the bug's crept in between thetwo versions. However, I've not diff'd the sources. It could justbe that no one else uses RODBC + PostgreSQL.
What's the process for submitting a suitable patch? or is the abovedescription enough to be picked up by the package maintainers?
Tim
On 2 Dec 2009, at 22:49, Tim Coote wrote:
For some reason I've got a problem with double precision numbersin postgres coming across into R as integers (they're justtruncated).
I'm using:
R 2.10.0 GUI 1.30 Leopard build 32-bit (5511)
R-RODBC version 1.3
I've tried both the packaged postgresql odbc driver for the macand rebuilt from source with the same results(psqlodbc-08.04.0100), against postgres 8.4 on my mac and against8.3 on Fedora Linux.
the odbc driver works as expected with python + pyodbc, and with R2.10, RODBC (R-RODBC-1.2-2.fc10.i386, R-2.10.0-2.fc10.i386).
Here's some noddy results (I create a table called x with with onedouble precision column) - I'm ignoring the warning, which seemsto be a distro error for fc10:
From R on Linux:
> library (RODBC)
Warning message:
package 'RODBC' was built under R version 2.8.1 and help may notwork correctly
> c=odbcConnect ("PostgreSQL",  uid="yy", pwd="xxx")
> sqlQuery (c, "select * from x")
    x
1  1.234
2 43.989
> sqlColumns(c, "x")
TABLE_QUALIFIER TABLE_OWNER TABLE_NAME COLUMN_NAME DATA_TYPETYPE_NAME1 cases public x x 6float8PRECISION LENGTH SCALE RADIX NULLABLE REMARKS COLUMN_DEFSQL_DATA_TYPE1 15 8 0 10 1<NA> 6SQL_DATETIME_SUB CHAR_OCTET_LENGTH ORDINAL_POSITION IS_NULLABLEDISPLAY_SIZE1 NA NA 1<NA> 22
FIELD_TYPE AUTO_INCREMENT PHYSICAL NUMBER TABLE OID
1        701              0               1     16517


However, on the mac:
> c=odbcConnect ("Hg", uid="yy", pwd="xxx")
> sqlQuery(c, "select * from x")
x
1  1
2 43
> sqlColumns (c, "x")
TABLE_QUALIFIER TABLE_OWNER TABLE_NAME COLUMN_NAME DATA_TYPETYPE_NAME PRECISION LENGTH SCALE RADIX NULLABLE REMARKS1 cases public x x 6float8 15 8 0 10 1COLUMN_DEF SQL_DATA_TYPE SQL_DATETIME_SUB CHAR_OCTET_LENGTHORDINAL_POSITION IS_NULLABLE DISPLAY_SIZE FIELD_TYPE1 <NA> 6 NANA 1 <NA> 22 701
AUTO_INCREMENT PHYSICAL NUMBER TABLE OID BASE TYPEID
1              0               1     16517           0
Uid and pwd obfuscated. As far as I can see, the driver on the macis correctly identifying the type of the column 'x' as a float8,but then truncating before the decimal point. I have seensomewhere that Oracle drivers used to suffer from this issue, whenthey truncated type 6 variables.
python looks ok on the mac:
>>> import pyodbc
>>> c=pyodbc.connect (dsn="Hg")
>>> cu=c.cursor().execute ("select * from x")
>>> print cu.fetchall()
[(1.234, ), (43.988999999999997, )]
I cannot find a pyodbc equivalent to sqlColumns. As far as I cansee, R and python on my mac are using a common stack, and R seemsto work ok with the fedora odbc drivers. Is it likely that I'vestill got bugs in my odbc set up or is there an issue with R +ODBC? I'm not sure where else to look for what's going wrong.
Tim


_______________________________________________
R-SIG-Mac mailing list
R-SIG-Mac@stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/r-sig-mac

Re: [R-SIG-Mac] ODBC driver with postgresql

Reply via email to