Hi, I was wondering if there is a way to scan a table based on the timestamps. For example, is there a way to set a range based on the timestamp portion of the key?
I know that standard practice is to add a timestamp as part of the row id, but in this particular case I probably cannot use that technique. The reason I can't use it is that I need to find the most recent data in a preexisting Accumulo instance. Not all of the information was stored with timestamps as appended to the row id. I can't go back and change the data, I just have to work with what is there. So, given a large amount of preexisting data without time information in the row id, column family or column qualifier, how would you scan for the most recent data? Specifically, is there any way to scan/sort by the timestamp portion of the key. I did not see any way to make a Range with times. I also really do not want to run a job over all the data to make a new copy of the table that is sorted. I have a lot of data here and such a replication would take a very long time. Thanks, Jon
