You guessed correctly as to why we implemented this, getting exact line counts
on very large files is a time consuming process. You can still get an exact
line count using the Line/Word/Character count tool in the Text Manipulation
If you're interested in the way it currently works, the first 1MB of a large
file is read, and a line number approximation is made from that and the
assumption that line lengths don't vary dramatically throughout the file. It
would slow down the metadata setting, but for a personal galaxy instance you
could certainly increase that number, or disable the estimation entirely.
On Jul 31, 2011, at 11:11 AM, Chaolin Zhang wrote:
> I noticed that the current version of galaxy shows approximate number of
> lines for history items, when it is relatively big. I guess this is due to a
> consideration of performance, but it is quite annoying, because the exact
> line numbers provide a very easy way for users to get simple statistics.
> Sometimes the approximation can be really off. For instance, for one file
> with > 1 M lines, it shows ~850,000 lines. Any thought?
> Please keep all replies on the list by using "reply all"
> in your mail client. To manage your subscriptions to this
> and other Galaxy lists, please use the interface at:
Please keep all replies on the list by using "reply all"
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at: