Just a request, the senderbase hosts cache is using
the "-" character as a separator, so, the textual part of
the entries is something like

tld-orgname-domain

now, the above imVHo may cause confusion and for sure
makes more complex to split the entries in case one wants
to extract the infos from the file, since, in case you have an
entry like

us-cheshire-medical-dartmouth-hitchcock-hospital---da-cheshire-med.com

it will be rather difficult correctly reparsing the line and getting
out the correct orgname and domain name; so, I'm asking if
it may be possible to use a different char to separate the various
columns... for example a pipe "|" or any other char which won't
for sure be used inside the data; in such a case the above entry
may look like

us|cheshire-medical-dartmouth-hitchcock-hospital---da|cheshire-med.com

and reparsing the entry, separating the components would be
rather easy; I'm asking this since I'm grepping and parsing the
cache in an attempt to build a list of "good" entries, that is of
entries which one won't usually block; while writing the parsing
code I faced the above issue so... I'm here ... begging :)



------------------------------------------------------------------------------
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to