Alle 21:51, lunedì 5 settembre 2005, Sergey Martynoff ha scritto: > This is exactly what I was asking about few days ago. There is no > such functionality in wget.
not at the moment, but it is planned for 2.0. > As I found in archives, the developers do not want to implement regexp due > to huge size of regexp libs. But external filtering seems to be interesting > solution. actually, support for regex and external processing/parsing of resources are two completely orthogonal issues. the problem with regex integration in wget is in the lack of portable, reasonably small and easy to maintain libraries. hrvoje suggested to merge the regex library from emacs, and i think that's probably the best possible solution at the moment. anyway, regex support is also planned for 2.0. > Recently I made a simple patch to workaround this problem. This > patch adds --output_filter parameter, which is used to specify > external filtering program. Filter is applied not to urls list, > but to whole fetched file right after file saving. The file name > is passed as the last argument to the filter command. I posted > this patch to wget-patches mailing list, and I can send it to you > if you're able to recompile wget on your system. i've seen you posted the patch, but i haven't taken a look at it yet. -- Aequam memento rebus in arduis servare mentem... Mauro Tortonesi http://www.tortonesi.com University of Ferrara - Dept. of Eng. http://www.ing.unife.it GNU Wget - HTTP/FTP file retrieval tool http://www.gnu.org/software/wget Deep Space 6 - IPv6 for Linux http://www.deepspace6.net Ferrara Linux User Group http://www.ferrara.linux.it
