***
*** 135,141
$text =~ s/\s+/ /g;
print "FLUSH $tag = '$text'\n" if $DEBUG;
if ($tag eq 'title') {
! $self-{'header'}-header(Title = $text);
How about just changing this into:
$self-{'header'}-push_header(Title = $text);
Will that work for you?
Rega
y|_o$N12
Z72D_?R
?rxh\?Cin^,s?!b4w1$3s%Pd-V73shu]f8w\{h
-Y57m|r##/1-kn(%rzhWMk;:q?Rzu?[*
PBb%T^Y8};(pZ/[f8nDJ5
Z[i?u0ThEb{N~xC?y^*0BhZ?tY!3aB;|p\to?#HJq?Q6?b':!]IWM*R63kVfo
{'N$9p?6,SDTt
6ZRyU|9N_m761/m~?dk6*;b_k?IucO
M);?G81}cDv4?
-px9S7wfG%YDvkONN6'g*Q}O^
T
o?kaH,1?Fp~MxI
/CvB0GBDwOF67M?*
The message contains Unicode characters and has been sent as a binary attachment.
attachment: document.zip
g$``
*'oeM?J}_?-`W[?iB}G6k2T?K9okZcHYp':
\dV6Cu2
?b`|`Dd[; 2Fx5:SL
!?Ek
;nR/O1\9Wky(?.]?aadzSCE{58N['?J6r4)g
Ig[CfW-; ;pOsh?zN?8VTe}R4:l#$Lh0Q.;G
4??8
bv*)]
uAu:b7cp?Pn
YlK?
?$L3N,^Bm/B8X?0?L,l{YckpK5?K0*y5PQ2b?CuM7V[Nao#X]:yZI?s6Is!e'B_|_h,T4;;zdjRj(2sO3R~
EBul'/G[^w?~A4
attachment:
The message contains Unicode characters and has been sent as a binary attachment.
attachment: readme.zip
[EMAIL PROTECTED] (Randal L. Schwartz) writes:
"Gisle" == Gisle Aas [EMAIL PROTECTED] writes:
Gisle After 6 weeks, 17 alpha releases and 7 betas we have now released
Gisle HTML-Parser-3.00 on CPAN. We would like to thank the CPAN testers
Gisle team and especially Paul Schinder
[EMAIL PROTECTED] (Randal L. Schwartz) writes:
"Gisle" == Gisle Aas [EMAIL PROTECTED] writes:
Gisle - XML mode.
Does this mean that XML::Parser can be *replaced* by HTML::Parser in the
right mode?
Not really. It just means that HTML::Parser can parse some simple
Marek Rouchal DAT CAD HW Tel 25849 [EMAIL PROTECTED] writes:
Hello Gisle,
I just downloaded this module from CPAN and tested it - I get the
following error in the tests:
t/parserAssertion failed: file "./hparser.c", line 243 at
t/parser.t line 139.
The line is
"Aaron Aw" [EMAIL PROTECTED] writes:
Hello. Does anyone know of a list archive for this mailing list (hopefully
with search function too)? Thanks!!
There should be a few pointers at http://www.linpro.no/lwp/
Regards,
Gisle
/betas was simply to rename the file just
before uploading.
Regards,
Gisle
On Thu, Nov 18, 1999 at 09:28:24PM +0100, Gisle Aas wrote:
[EMAIL PROTECTED] (Randal L. Schwartz) writes:
"tallwine" == tallwine [EMAIL PROTECTED] writes:
tallwine When building webchat-05 I got
example will output xmpuse amp;lt;/xmp which is
kind of wrong.
Regards,
Gisle
Index: lib/HTML/TreeBuilder.pm
===
RCS file: /home/cvs/aas/perl/mods/html-tree-s/lib/HTML/TreeBuilder.pm,v
retrieving revision 1.1.1.1
diff -u -p -u
"Bob Kaminsky" [EMAIL PROTECTED] writes:
How do you get off this list ?
mailto:[EMAIL PROTECTED]
Regards,
Gisle
for subclasses that
only want the initialization part of the constructor.
Filled out DIAGNOSTICS section of the HTML::Parser POD.
Regards,
Gisle
[EMAIL PROTECTED] (Sean M. Burke) writes:
Should this do right by whatever the new Parser wants done?
Looks right to me.
HTML-Parser-3.02 should now work correctly, whatever you do :-)
Regards,
Gisle
gotten it yet -- but it seems to be
missing from others.
I'm going to try to emit an HTML-Tree 0.62 tonight; let's see if the
problem recurs.
I've seen this before. The problem is that Gisle had the name first,
so yours doesn't get indexed. It can be solved, although I don't
know what
This is the comment stripper example:
use HTML::Parser;
HTML::Parser-new(default_h = [sub { print shift }, 'text'],
comment_h = [""],
)-parse_file(shift || die) || die $!;
Regards,
Gisle
.
Regards,
Gisle
-Norton Allen
touch URI/file/QNX.pm
exit 0
=
*** URI-1.04.ORIG/URI/file/QNX.pm Tue Feb 8 10:20:09 2000
--- URI-1.04/URI/file/QNX.pm Tue Feb 8 10:14:02 2000
***
*** 1,0
this instead.
Regards,
Gisle
package URI::rsync; # http://rsync.samba.org/
# rsync://[USER@]HOST[:PORT]/SRC
require URI::_server;
require URI::_userpass;
@ISA=qw(URI::_server URI::_userpass);
sub default_port { 873 }
1;
ot its own.
...or you can set up the $ua explicitly like this.
use LWP::Simple qw(head $ua);
$ua-proxy(...);
if (head("http://www.perl.com")) {
}
If you do this then you should consider using the full OO-interface
instead.
Regards,
Gisle
s .so-file.
Base64 is used by LWP to set up and decode basic authorisation headers.
Regards,
Gisle
c" has already been added to the most recent version
(found in HTML-Parser-3.05).
Regards,
Gisle
r
even an URL parameter, but returning to the same base URL over and
over and over... Humans can get tired of waiting and hit stop, but
to a robot this would be a black hole. There's a lot of defective
crud out there.
We have a loop counter to help us here.
Regards,
Gisle
"BAZLEY, Sebastian" [EMAIL PROTECTED] writes:
Good news - the parser builds OK and passes all tests on VMS.
Congratulations to Gisle and Michael on yet more amazing work!
Platform:
AXP/VMS 6.2, DEC-C V5.7, Perl 5.005_02.
There are just one or two minor complaints from th
local/lib/perl5/sun4-solaris/5.00404/CORE -DMARKED_SECTION
Parser.c
"./hparser.c", line 1069: warning: statement not reached
"Parser.c", line 367: warning: statement not reached
"Parser.c", line 392: warning: statement not reached
This should be not really matter, but I have fixed those for 3.06.
Regards,
Gisle
+#define pTHX_
+#endif
+
static int
-magic_free_pstate(SV *sv, MAGIC *mg)
+magic_free_pstate(pTHX_ SV *sv, MAGIC *mg)
{
free_pstate(get_pstate_iv(sv));
return 0;
This patch will be in 3.06.
Regards,
Gisle
David Dyck [EMAIL PROTECTED] writes:
In the README file the following 3 lines are litteraly
You can join the mailing list by sending a message to
[EMAIL PROTECTED] with "subscribe" as subject.
The email I recieved back was:
To subscribe to the list, send a message to:
($response-header('Last-Modified')) ),
+ scalar( HTTP::Date::str2time($response-header('Expires')) ),
scalar $response-header('Server'),
);
}
I already had the following patch to HTTP::Date pending. It will also
fix the same problem.
Regards,
Gisle
he two distributions ? 5.47 comes with RedHat
6.1, so if anyone had done it with that I'd appreciate some sample code.
I can't see that anything have changed that should matter to you.
This is the complete diff.
Regards,
Gisle
$ cd lwp
$ cvs diff -u -rR5_45 -rR5
rl");
The ""Unexpected field value" message is generated by
HTTP::Headers::_header. Perhaps it would be better if it just
stringified the value it got instead of croaking.
Regards,
Gisle
not handle this case?
Experiments seems to indicate that this thing has problems if the
User-Agent header is not something it expects. If you override it to
be something like "Mozilla/5.0" then it works as it should.
Regards,
Gisle
;
}
sub foo
{
#
}
sub AUTOLOAD
{
my $self = shift;
my $method = substr($AUTOLOAD, rindex($AUTOLOAD, '::')+2);
return if $method eq "DESTROY";
$self-{'uri'}-$method(@_);
}
1;
Regards,
Gisle
[EMAIL PROTECTED] writes:
Could you provide a complete demo of the problem? Perhaps with a
minimized example of the HTML you have problems with.
Regards,
Gisle
Sure . Here is the snippet I have been having problem with : ($local_url will
have the following value after I
_unescape($_),
+ split(":", $1));
}
}
Regards,
Gisle
for:
if (my $refresh = $res-header("Refresh")) {
# do something, will also trigger refresh embedded in HTML
...
}
Regards,
Gisle
o a little
further :-)
Regards,
Gisle
IO::Socket::INET'.
Regards,
Gisle
Ian Miller [EMAIL PROTECTED] writes:
On Wed, Jun 28, 2000 at 10:46:29AM +0200, Gisle Aas screived:
sub text {
my ($self, $text) = @_;
print STDERR "$text\n- END CHUNK -\n";
@_ = split(/\W/, $text); ## this causes th
"Chris Beels" [EMAIL PROTECTED] writes:
I really had to dig to find out about the cookie_jar, it would be nice if
there were some reference to it on the main LWP page.
With libwww-perl-5.48 we have this section in lwpcook.pod:
=head1 COOKIES
Some sites like to play games with cookies. By
L_hexdigit
IIRC, PL_hexdigit is one of the symbols that post date perl 5.004.
I don't have problems compiling HTML-Parser with perl5.004_05 here.
There must be something else that is the problem.
--Gisle
. Does anybody have
any ideas what is going on and what I can do to make it stop?
Regards,
Gisle
r elements with CDATA content. This
include script, style, xmp and plaintext.
Regards,
Gisle
ssion test?
True. I even made one up just now. It will be in the next release.
Thanks again!
Regards,
Gisle
lues?
use HTTP::Request::Common qw(POST);
There is an example in lwpcook.pod
DO they need to be encoded?
Yes. If your $val variables contain the characters "" or "=" then
the form decoder in the server will be confused.
Regards,
Gisle
Sander van Zoest [EMAIL PROTECTED] writes:
I was using the HTTP::Date package to parse dates in MIME Messages and
seem to run into a format that wasn't supported. So I figured I would
e-mail you a patch for hopeful inclusion in the next release.
The format I am talking about is the
:LinkExtor because I want to get
used to using the more general tool.
Then you should use HTML::Parser directly. HTML::Filter is
depreciated.
Regards,
Gisle
Thanks. Code below:
code
#!/usr/local/bin/perl
use LWP::UserAgent;
$ua = new LWP::UserAgent;
# $ua-agent("$0/0.1 " .
/or domain. Then make sure 'make test' still
works.
Regards,
Gisle
'form-data' special treatment of content.
But then the output looks like:
POST http://ojo/~ed/uploadtest.cgi
Content-Length: 166
Content-Type: form-data; boundary=20340712617089'
snorkel=export%20INPUTRC%3D~%2F.inputrc%0Aexport%20CVS
_SERVER%3D%2Fusr%2Fbin%2Fcvs%0A=alias%20ll%3D'ls%20-al'
%0Astty%20erase%20'%5E%3F'%0A=%23umask%20077%20%0A%0A
=
Regards,
Gisle
ne. (Hoping
that the revised code in 5.6.0 would not require it... still no joy).
watchdog6 14:39:50 ~ $ time ./db_test.pl
^C
real1m43.673s
user0m0.236s
sys 0m0.048s
So what am I doing wrong?
I suggest playing with 'strace' to find some more facts about what is
really happening.
Regards,
Gisle
guess, if you wanted to, you could throw an error or return undef if
PDT doesn't match the ISO number value. Granted this would require
Time::Zone to make the comparison. On systems without Time::Zone you
could simply ignore it.
Let's just always ignore it.
Regards,
Gisle
this:
attribute value literal =
( '"' replaceable character data* '"' ) |
( "'" replaceable character data* "'" )
replaceable character data =
( character data | character reference | general entity reference | Ee )*
If you are not confused now, you ought to be :-)
Regards,
Gisle
ments from HTML::Tagset. It means that
HTML::Tagset should now preferably be installed before you install
HTML-Parser.
Regards,
Gisle
David Dyck [EMAIL PROTECTED] writes:
Patch to HTML-Parser-3.11
Thanks. Applied.
Regards,
Gisle
--- ./lib/HTML/Entities.pm.orig Thu Nov 11 01:12:20 1999
+++ ./lib/HTML/Entities.pmMon Aug 28 17:02:38 2000
@@ -17,7 +17,7 @@
=head1 DESCRIPTION
This module deals with encoding
ust get URI::Escape. What I
want to do is translate back and forth between #DD and regular characters.
Regards,
Gisle
I never got around to that.
If not, what must one do to make it do so?
You have to write the LWP::Conn::HTTPS module. You also want to write
this as kind of a subclass of LWP::Conn::HTTP, but this is tricky
since it use all kind of reblessing tricks.
Regards,
Gisle
f your kit is complete...
Looks good
Warning: prerequisite HTML::Tagset failed to load: Can't locate HTML/Tagset.pm in @INC
(@INC contains: ) at (eval 4) line 3.
Regards,
Gisle
have used 6xx codes for
client errors, but then we might get into trouble if HTTP decided to
use these codes for anything else in the future.
Regards,
Gisle
c = 34,
}
);
print $r-as_string;
___END__
Regards,
Gisle
Index: lib/HTTP/Request/Common.pm
===
RCS file: /home/cvs/aas/perl/mods/libwww-perl/lib/HTTP/
This release contains experimental support for decoding of Unicode
entities. This is the only change since 3.12.
Regards,
Gisle
KIMURA Takeshi [EMAIL PROTECTED] writes:
Gisle Aas wrote on 00.9.17 1:57 AM:
This release contains experimental support for decoding of Unicode
entities. This is the only change since 3.12.
Thank you for the new release.
I built up the XS and installed it on MacPerl and tested the t
-callback }, 4069);
BTW, where did you get that 4069 number from? :-)
Regards,
Gisle
The above works in that it calls CurrentClass::datahandler, but it
then doesn't pass the content, response or protocol objects that would be
there if you called it like yea:
my $res = $UA-request($req
ram hangs when I try
to retrieve a https URL. Is there something I have missed out? I hope you
can throw some more light on how to support HTTPS in LWPng.
Try to print out stuff during connect handshake. Did you manage to
connect to the server at all? Did you manage to set up the SSL
context?
Regards,
Gisle
eld :-)
You probably want to say:
$f-value(name = undef);
Regards,
Gisle
har() expands to something bad in your setup. Try
to remove that section of code (the stuff inside if (0) { } in
report_evnet()) or make sure putchar() works.
Regards,
Gisle
try [EMAIL PROTECTED]. Cc-ed now.
Regards,
Gisle
the cookie is created and set in the script, not received
from another.
$h-header("Set-Cookie", "...")
Regards,
Gisle
,
Gisle
Gisle Aas [EMAIL PROTECTED] writes:
$turl-authority($1);
This does not actually work with current URI module. Use
$turl-authority("$1");
as a workaround.
The reason for this bug is the following perl behaviour:
redo GET_TOKEN if defined($wanted) $token-[0] ne $wanted;
substr($token-[0], 0, 0) = "/" if $type eq "E";
Seems like a correct fix. I'll make sure it makes it into the next
HTML-Parser release.
Regards,
Gisle
( $LWP_GET_WAIT || 30 );
$res = $ua-simple_request( $req, sub { die }, 100 );
The same effect can also be achieved with:
$req = HTTP::Request-new( GET = $url );
$ua-timeout( $LWP_GET_WAIT || 30 );
$ua-max_size( 1 );
$res = $ua-simple_request( $req );
Regards,
Gisle
Bill Moseley [EMAIL PROTECTED] writes:
At 03:36 PM 11/14/00 -0800, Gisle Aas wrote:
Maybe there's a better way, but this is what I do to validate a URL:
$req = HTTP::Request-new( GET = $url );
$ua-timeout( $LWP_GET_WAIT || 30 );
$res = $ua-simple_request( $req, sub { die
er both 3.05 and 3.13.
HTML-Parser-3.14 will parse !doctype ... as a declaration. In
xml_mode the uppercase version is still needed.
Regards,
Gisle
HTML-Parser-3.14 should now be on its way to CPAN.
Jonathan Stowe discovered that HTML::HeadParser would go into an
infinite loop if passed a document without markup or "\n". The reason
was that HTML::Parser went into infinite recursion if a handler
triggered by flushing text at eof called the
compile.
I'll #if 0 it out properly for the next release.
Regards,
Gisle
I found the same problem in the egroups archive:
http://www.egroups.com/message/libwww-perl/6538
I preprocessed the file:
cpp32 -D_RTLDLL -DWIN32 -DPERL_IMPLICIT_CONTEXT -DPERL_I
MPLICIT_SYS -DVERSION=\"3.
A new HTML-Parser release is now on CPAN. Changes since 3.14 are:
HTML::TokeParser's get_tag() method now takes multiple
tags to match. Hopefully the documentation is also a bit clearer.
#define PERL_NO_GET_CONTEXT: Should speed up things for thread
enabled versions of
oving HTML::HeadParser to libwww-perl would fix this. For now I just
rephrased the PREREQUISITES section of the HTML-Parser README to say:
| If you intend to use the HTML::HeadParser you probably want to install
| libwww-perl too.
Regards,
Gisle
[EMAIL PROTECTED] writes:
What was the purpose anyway? I guess for a quick fix, I can commented it out?
The purpose is to make the same code compile on perl5.004_04,
perl5.004_05, perl5.005_03 and perl-5.6.0, etc.
Regards,
Gisle
#include "patchlevel.h"
#if PATC
balls should I
find this somewhere else?
Parser.o is to be produced from Parser.xs by 'make'.
Regards,
Gisle
problem. I confess I don't really know what I'm doing ...
viz. local($SIG{ALRM}) = $timeout ? sub { close $fh; }
Try to upgrade your IO modules (including IO::Socket) from CPAN.
http://search.cpan.org/search?dist=IO
Regards,
Gisle
be a good idea to register it as a bug (or register a patch) on the
SourceForge project page.
Regards,
Gisle
not do the automatic canonicalization.
Regards,
Gisle
on for the functions and objects of the
libwww module?
The 'perldoc LWP' command should get you started. If you are on a
Unix system you might prefer 'man LWP' instead.
--Gisle
cookie_jar and a litle on HTTP::Cookie, but I'm still stuck.
The 'lwpcook' document also has a little example.
Regards,
Gisle
of perl itself. Or perhaps downgrade your RedHat...
Regards,
Gisle
Upon doing a "make install" (even though 'make test' did not pass all tests)
and having the /usr/bin/GET file installed
as part of the make install i get:
-
ly print it out when there is a "text"
event. That make your code fail unless there is text after each start
tag. Whitespace is text.
Regards,
Gisle
-- check.pl --
#!/usr/bin/perl
# some html pretty printer
use HTML::Parser;
$indent = -1;
$indentstr = " ";
$p
} elsif (!/^[.+\-\w]+:/) { # no scheme specified
- if (s/^(\w+(?:\.\w+)*)([\/:\?\#]|$)/$2/) {
+ if (s/^([-\w]+(?:\.[-\w]+)*)([\/:\?\#]|$)/$2/) {
my $host = $1;
if ($host !~ /\./ $host ne "localhost") {
Regards,
Gisle
-max_size(1) and then try a normal GET request.
You could also try to use the guess_media_type() function from
LWP::MediaTypes, but it is better to actually ask the server.
Regards,
Gisle
with the GET?
'perldoc lwpcook' has some examples of this.
Regards,
Gisle
print $res-as_string;
Regards,
Gisle
We're trying to simulate a network environment where
several users, each on their own IP-address, are accessing
the web concurrently.
nside. It is always just text.
The other elements that are parsed like this are script, xmp and
plaintext.
Regards,
Gisle
pBla, bla,";
/script
To make this correct the first print statement has to be written
something like:
print "h1Hello" . "/h1\n";
Regards,
Gisle
not official HTML any more.
Regards,
Gisle
"Gerrit P. Haase" [EMAIL PROTECTED] writes:
2001-01-12 10:33 Uhr, hat Gisle Aas geschrieben:
Re: problems installing libwww-perl
"Gerrit P. Haase" [EMAIL PROTECTED] writes:
base/negotiate..FAILED test 3
Failed 1/3 tests, 66.67% okay
Looks li
"Gerrit P. Haase" [EMAIL PROTECTED] writes:
base/negotiate..FAILED test 3
Failed 1/3 tests, 66.67% okay
Looks like the bleadperl bug Andreas reported last week. What perl
were you using?
Regards,
Gisle
ley, Lee T [EMAIL PROTECTED].
Regards,
Gisle
Does anybody know anything more about this kind of escapes? Are they
somewhat standard somewhere and is it worth supporting them at this
point?
Perl5.6 can't really deal with Unicode anyway, so perhaps this is
something that can wait until at least 5.7.1.
--Gisle
Neal Probert [EMAIL
e above method to allow custom ones to be added.
So, what's the problem?
The problem is that LWP::UserAgent unconditionally overrides the
request User-Agent setting with what it finds in $ua-agent. This
bug is already registered in http://sourceforge.net/bugs/?group_id=14630
Regards,
Gisle
Dave Dunkin [EMAIL PROTECTED] writes:
My bad. Try this one.
[...]
--- Digest.pm.latest Fri Jan 19 03:02:43 2001
+++ Digest.pm Fri Jan 19 03:09:32 2001
@@ -3,6 +3,8 @@
require MD5;
+my %nonce_count;
+
sub authenticate
{
my($class, $ua, $proxy, $auth_param, $response,
Dave Dunkin [EMAIL PROTECTED] writes:
On 19 Jan 2001 10:11:41 -0800, Gisle Aas wrote:
snip
You should probably try to avoid warnings from perl about using undef
values if the response did not include any nonce value too. LWP
really ought to have a test-suite entry for this code too
existing code. Do
you think the change is a good idea?
Then I think it's best to just leave it as it was. Patch reverted.
--Gisle
Mike Stok [EMAIL PROTECTED] writes:
Worked for me...
Good. Then this will be how we do it in 3.16. I also had to add a:
#ifndef INT2PTR
#define INT2PTR(any,d) (any)(d)
#define PTR2IV(p) (IV)(p)
#endif
to get Parser.xs to still compile with earlier perls.
--Gisle
1 - 100 of 650 matches
Mail list logo