Today I was trying 3 different linux text browsers, lynx, w3m and links,
but found that all 3 could not fully capture the links in the google
search engine page after doing a search query to www.google.com
For example Google would give as the header line
We have found our results
then the URL line:
http://www.foundourresults.com/our/data/.../is/here
and some explanation below that...
But notice the ellipsis ... in the url line. All 3 linux browsers would
faithfully record the url line with the ellipsis, and thus the search
results were unresolvable when used for a brand new lookup. However
google carefully embeds the correct url in the header line which is
where the browser actually goes next, after manually mouse clicking on
that find.
I noticed that Startpage doesn't use ellipsis in the url line, so it is
simple to capture the url's using the -dump option to a file and later
parse these for further internet lookup and they all work.
How would one go about recovering the full urls from the Google search
results so that a text browser successfully captures the fully specified
URL reference?
Randall
--
*CONFIDENTIAL:*/This email message and/or any attachments is for the
sole use of the intended recipient(s) and may contain confidential
information. _Any unauthorized review, use, copying, dissemination,
disclosure, retention or distribution is strictly prohibited._ If you
are not the intended recipient, please contact the sender by reply email
and destroy all copies of the original message along with any
attachments. This communication (including attachments) is covered by
the Electronic Communication Privacy Act, U.S. Code Title 18 ยง2510-2521./
_______________________________________________
PLUG mailing list
[email protected]
http://lists.pdxlinux.org/mailman/listinfo/plug