Re: Comparing strings

John W. Kennedy Mon, 11 Apr 2005 12:08:38 -0700

Craig Cardimon wrote:

I am working with huge ASCII text files and large text fields.


Define "huge" and "large"

As needs and wants have changed, I will be reprocessing data we have already gone through to see if more records can be extracted.

I will need to compare strings to ensure that records I am inserting into our SQL Sever 2000 database are not duplicates of records already there.
I have come up with two ways:
(1) use string length (number of characters a string holds):
$length = length($name);
If both strings are identical in length, I have a duplicate and would cease processing.


How do you know that?

(2) compare strings (or 200-characer substrings thereof) directly:

if ( $str1 eq $str2 )
{
   ....
}


If the test-for-identical-length thing is actually valid, fill an array
with 1 for each length that is in use.

Or, if all the keys will fit into RAM, fill a hash with the
already-loaded keys and test each new record for presence in the hash.

If not, the best thing may simply be to make the field a UNIQUE key (if
the SQL Server of the Beast will allow it) and /try/ to insert the
record. If you can't, don't.

--
John W. Kennedy
"The poor have sometimes objected to being governed badly; the rich have
always objected to being governed at all."
  -- G. K. Chesterton.  "The Man Who Was Thursday"


_______________________________________________
ActivePerl mailing list
[email protected]
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs

Re: Comparing strings

Reply via email to