One of the firstthings any data analyst learns while working with tones of
documents(html, pdf texts, ...) is that there are always edge cases which
arenot fully syntactically reducible, that you must eyeball, you can'tsafely
deal with them with code.
Once you have allURLs of edge cases, the best safe method (using neither
browserjavascript-based "addons", nor plain javascript) is simplyopening each
page at a time and remove the browser cache ideally eachtime. You would go like:
_URLs="<textfile with lines of URLs>"
# it restart firefoxeverytime
xargs -n1 ./firefox-new-tab < "${_URLs}"
but in this way youstart an instance every time. I need to:
1) open a number oftabs at once;
2) once I close thelast one ff should clear the cache or shut down and re start
It shouldn't be sohard, but I haven't figured it out yet.
lbrtchx