I had come across the Tokenizer class in the lang package, and I agree that it offers much more control and flexibility. Here however, from a developer's perspective, I had meant to discuss the functionality of the StringUtils.split() method, regardless of such functionality available elsewhere.
Ash
-----Original Message----- From: Inger, Matthew [mailto:[EMAIL PROTECTED]
try the new "Tokenizer" class. You can get it from CVS.
-----Original Message----- From: Ash .. [mailto:[EMAIL PROTECTED] Subject: [lang] StringUtils.split() functionality wrt separator repeats
The StringUtils.split(), suggest as an alternative to StringTokenizer, ignores adjacent separators. However, I feel that it would be better if they were not ignored, and empty strings be returned when such a case occurs. Or, there be an overloaded option which lets the user specify whether to ignore separator repeats or not.
As I see it, it is more often that a separator repeat occurs due to the absence of data in a specified series of, say, cells. Taking the typical case of a CSV file line read, finding two commas would mean a cell is empty,
not that the resultant array should have one element less. It must rather have the expected number of elements with the repeat case entailing an empty
String. Comments.
Ash
---------------------------------------------
Run, rabbit run. Dig that hole, forget the sun, And when at last the work is done Don't sit down it's time to dig another one.
_________________________________________________________________ On the move? Get Hotmail on your mobile phone http://www.msn.co.uk/msnmobile
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
