Today I was trying 3 different linux text browsers, lynx, w3m and links, but found that all 3 could not fully capture the links in the google search engine page after doing a search query to www.google.com

For example Google would give as the header line

We have found our results

then the URL line:

http://www.foundourresults.com/our/data/.../is/here

and some explanation below that...

But notice the ellipsis ... in the url line. All 3 linux browsers would faithfully record the url line with the ellipsis, and thus the search results were unresolvable when used for a brand new lookup. However google carefully embeds the correct url in the header line which is where the browser actually goes next, after manually mouse clicking on that find.

I noticed that Startpage doesn't use ellipsis in the url line, so it is simple to capture the url's using the -dump option to a file and later parse these for further internet lookup and they all work.

How would one go about recovering the full urls from the Google search results so that a text browser successfully captures the fully specified URL reference?

Randall


--
*CONFIDENTIAL:*/This email message and/or any attachments is for the sole use of the intended recipient(s) and may contain confidential information. _Any unauthorized review, use, copying, dissemination, disclosure, retention or distribution is strictly prohibited._ If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message along with any attachments. This communication (including attachments) is covered by the Electronic Communication Privacy Act, U.S. Code Title 18 ยง2510-2521./
_______________________________________________
PLUG mailing list
[email protected]
http://lists.pdxlinux.org/mailman/listinfo/plug

Reply via email to