a SED need

2005-12-27 Thread Jack Stone
I have some HTML files with hundreds of URLs that I need to modify using a 
search/replace string. I assume that SED(1) is the right tool to use, but 
every syntax I've tried has not worked.


Here is what I'm trying to do:
Change full URLs to relative paths, in other words, chop off the
http://www.example.com/; portion:


From this:

lia href=http://www.example.com/model/many.html;
To this:
lia href=model/many.html

I think it is the slashes and quotes that are giving me fits as I'm very 
much a novice on SED(1) syntax.


Would appreciate any tips on how to do the above so I can search and replace 
all of the hundreds of URLs.


Many thanks and Happy New Year!

Regards,
Jack

_
Don’t just search. Find. Check out the new MSN Search! 
http://search.msn.click-url.com/go/onm00200636ave/direct/01/


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: a SED need

2005-12-27 Thread Dmitry Sidorov
On Tue, 2005-12-27 at 09:18 -0600, Jack Stone wrote:
 I have some HTML files with hundreds of URLs that I need to modify using a 
 search/replace string. I assume that SED(1) is the right tool to use, but 
 every syntax I've tried has not worked.
 
 Here is what I'm trying to do:
 Change full URLs to relative paths, in other words, chop off the
 http://www.example.com/; portion:
 
 From this:
 lia href=http://www.example.com/model/many.html;
 To this:
 lia href=model/many.html
 
 I think it is the slashes and quotes that are giving me fits as I'm very 
 much a novice on SED(1) syntax.
 
 Would appreciate any tips on how to do the above so I can search and replace 
 all of the hundreds of URLs.
 
 Many thanks and Happy New Year!
 
 Regards,
 Jack
 
 _
 Dont just search. Find. Check out the new MSN Search! 
 http://search.msn.click-url.com/go/onm00200636ave/direct/01/
 
 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to [EMAIL PROTECTED]

Try this:
sed s/http:\\/\\/www.example.com// your_file

-- 
Dmitry Sidorov
PEM QA Engineer
SWsoft, Inc.
E-mail: [EMAIL PROTECTED]
ICQ UIN: 864582

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: a SED need

2005-12-27 Thread David Kelly
On Tue, Dec 27, 2005 at 09:18:56AM -0600, Jack Stone wrote:
 I have some HTML files with hundreds of URLs that I need to modify using a 
 search/replace string. I assume that SED(1) is the right tool to use, but 
 every syntax I've tried has not worked.
 
 Here is what I'm trying to do:
 Change full URLs to relative paths, in other words, chop off the
 http://www.example.com/; portion:
 
 From this:
 lia href=http://www.example.com/model/many.html;
 To this:
 lia href=model/many.html
 
 I think it is the slashes and quotes that are giving me fits as I'm very 
 much a novice on SED(1) syntax.

Am sure sed is the right high power production tool for getting the job
done but I get such things done easier in awk. Am sure many say the same
about perl. Sed, awk, perl, is the evolutionary order.

Save this as something like example.awk and chmod +x to make it
executable for easy reuse. Or you could awk -f example.exe input 
output

By saving to a file you bypass the need to escape characters from the
shell (which will be different depending on csh vs. sh) and yet again
from the RE parser. The escapes below are to make sure the literal
character is used for regular expression rather than a possible RE
interpretation.

Contains two patterns to match. The first matches the thing you are
looking to change. The match regular expression is repeated in gsub()
where its replaced with the plain text you desire. Print causes the
line to be outputed, and next ends the processing of that input line
so the next pattern isn't tried. Therefore the next match-all pattern
prints everything the first skipped.

#!/usr/bin/awk -f

/a href=\http:\/\/www.example.com\// {
gsub(/a href=\http:\/\/www.example.com\//, a href=\)
print
next
}

{ print }


-- 
David Kelly N4HHE, [EMAIL PROTECTED]

Whom computers would destroy, they must first drive mad.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: a SED need

2005-12-27 Thread Mike Jeays
On Tue, 2005-12-27 at 09:18 -0600, Jack Stone wrote:
 I have some HTML files with hundreds of URLs that I need to modify using a 
 search/replace string. I assume that SED(1) is the right tool to use, but 
 every syntax I've tried has not worked.
 
 Here is what I'm trying to do:
 Change full URLs to relative paths, in other words, chop off the
 http://www.example.com/; portion:
 
 From this:
 lia href=http://www.example.com/model/many.html;
 To this:
 lia href=model/many.html
 
 I think it is the slashes and quotes that are giving me fits as I'm very 
 much a novice on SED(1) syntax.
 
 Would appreciate any tips on how to do the above so I can search and replace 
 all of the hundreds of URLs.
 
 Many thanks and Happy New Year!
 
 Regards,
 Jack
 
 _
 Dont just search. Find. Check out the new MSN Search! 
 http://search.msn.click-url.com/go/onm00200636ave/direct/01/
 
 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to [EMAIL PROTECTED]

sed will allow other characters than '/' as its delimiter, which makes
it much easier to get escape sequences right, or avoid them altogether.

sed -e 's=/http=/https=g' is an example

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]