Re: wget and meta name=robots content=noindex,nofollow

2002-07-06 Thread Cédric Rosa
://www.robotstxt.org/wc/norobots-rfc.txt. [...] Bye, Cedric. - Original Message - From: Hack Kampbjørn [EMAIL PROTECTED] To: Cédric Rosa [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Sent: Saturday, July 06, 2002 8:21 PM Subject: Re: wget and meta name=robots content=noindex,nofollow Cédric Rosa wrote

Re: user-agent string for IE

2002-07-01 Thread Cédric Rosa
Hello Jakub, try: --user-agent=Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0) (with quotes) Regards, Cedric Rosa. At 10:35 01/07/2002 +0100, you wrote: Thanks for reply and sorry for my long response. I was trying to use: --user-agent=Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)

Re: Honesly, wget as a webcrawler?

2002-06-24 Thread Cédric Rosa
Hello Jason, I'm working on a search engine project and I'm thinking about to use wget as webcrawler. I've just tested fwget and I'm not sure it is better than wget because it was not updated since one year ... I'm interested about any idea on this subject. Cedric. At 10:55 24/06/2002

wget and meta name=robots content=noindex,nofollow

2002-06-24 Thread Cédric Rosa
Hello, Is-it normal that wget saves web pages which contain meta name=robots content=noindex ? Or does wget considerate that it is not a search engine and respects only the follow/nofollow rules ? Or is-it a bug ? :) Thanks. Cedric.

Fwd: Bug with wget ? I need help.

2002-06-21 Thread Cédric Rosa
this problem ? Date: Fri, 21 Jun 2002 16:37:02 +0200 To: [EMAIL PROTECTED] From: Cédric Rosa [EMAIL PROTECTED] Subject: Bug with wget ? I need help. Hello, First, scuse my english but I'm french. When I try with wget (v 1.8.1) to download an url which is behind a router, the software wait for ever even

Re: Bug with wget ? I need help.

2002-06-21 Thread Cédric Rosa
thanks for your help :) I'm installing version 1.9 to check. I think this update may solve my problem. Cedric Rosa. - Original Message - From: Hack Kampbjørn [EMAIL PROTECTED] To: Cédric Rosa [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Sent: Friday, June 21, 2002 7:27 PM Subject: Re: Bug