Thank you Blaser:
This is the exact solution I came up with but when comparing 8M rows even on
an 8G machine, one runs out of memory. To run this effectively, I have to
break the DF into smaller DFs, loop through them and then do a massive
rmerge at the end. That's what takes 8+ hours to
Hi Rainer:
Thanks for the reply. Posting the large dataset is a task. There are 8M
rows between the two of them and the first discrepancy in the data doesn't
happen until at least the 40,000th row on each dataframe. The examples I
posted are a pretty good abstraction of the root of the issue.
Thank you Arun (and everyone else)-- this is in the right the direction.
Ill post the code that worked shortly for everyone else in case you were
curious.
-Original Message-
From: arun [mailto:smartpink...@yahoo.com]
Sent: Thursday, May 02, 2013 7:09 AM
To: Adeel Amin
Cc: R help
Subject:
3 matches
Mail list logo