Here is one way of doing it. I would suggest that you read in the
data with readLines and then combine into one single string so that
you can use substring on it. Since you did not provide provide
commented, minimal, self-contained, reproducible code, I will take a
guess at that your data looks
How long is it taking? Can you send me the code that you are using.
Another technique is to recode you characters into numbers and store
them as integers. You can then sample the values and reconstruct the
output. Here is a faster way:
# create some test data -- might be read in the readLines
2 matches
Mail list logo