Hi,

I'm using the new version of the Plucker Desktop, and I'm trying to Pluck an
article that spans several pages, but I'm not having much luck, could someone
point out what I'm doing wrong?

The article is at:
"The Next Ages of Game Development"
http://avault.com/developer/getarticle.asp?name=bsawyer1

and each page is linked thus:
http://avault.com/developer/getarticle.asp?name=bsawyer1&page=2
http://avault.com/developer/getarticle.asp?name=bsawyer1&page=3
etc etc.

I set 
http://avault.com/developer/getarticle.asp?name=bsawyer1
as the URL, and Maximum Depth to 22(!), "Ignore links to a server that is
different that is different from a starting page's server" to TRUE and used an
URL pattern filter as ".*avault.com/developer/getarticle.asp?name=bsawyer1.*".

The output is as follows:

<snip>
Initializing Plucker spidering engine...
 
-----------------------------------------------------------
Updating channel: Ages of Development...
-----------------------------------------------------------
Pluckerdir is 'E:\Plucker-Desktop'...
Using proxy '' with authentication for user ''...
ZLib compression turned on
Using exclusion list E:\Plucker-Desktop\exclusionlist.txt
Using exclusion list E:\Plucker-Desktop\exclusionlist.txt
Regexp pattern is '.*avault.com/developer/getarticle.asp?name=bsawyer1.*'
---- 0 collected, 1 to do ----
Processing http://avault.com/developer/getarticle.asp?name=bsawyer1...
  Retrieved ok.
  Not fetching image http://avault.com/images/layout/avault.gif
  Not fetching image http://avault.com/images/layout/page_developer.gif
  Not fetching image http://avault.com/images/layout/spacer.gif
  Not fetching image http://avault.com/images/layout/menu_logo-anim.gif
  Not fetching image http://avault.com/images/layout/menu_sections.gif
  Not fetching image http://avault.com/images/layout/menu_inside.gif
  Not fetching image http://avault.com/images/layout/menu_site.gif
  Not fetching image http://avault.com/images/layout/spacer.gif
  Not fetching image http://avault.com/developer/images/bsawyer11a.jpg
  Not fetching image http://avault.com/images/layout/next.gif
  Not fetching image http://avault.com/images/layout/spacer.gif
  Parsed ok; added 1 document link.
---- 1 collected, 1 to do ----
Processing mailto:[EMAIL PROTECTED]...
  Retrieved ok.
  Parsed ok.
---- all 2 pages retrieved and parsed ----
Writing out collected data...
Writing document 'Ages of Development' to file 
E:\Plucker-Desktop\channels/AgesofDevelopment/AgesofDevelopment.pdb
Converting mailto:[EMAIL PROTECTED]...
Converted   11:  mailto:[EMAIL PROTECTED]
Converting http://avault.com/developer/getarticle.asp?name=bsawyer1...
Converted    2:  http://avault.com/developer/getarticle.asp?name=bsawyer1
Default charset is MIBenum 4 (ISO-8859-1)
New document <PluckerIndexDocument 'plucker:/~special~/index' at 12000940> added
Converted    1:  plucker:/~special~/index
New document <PluckerMetadataDocument 'plucker:/~special~/metadata' at 12000492> added
Converted    5:  plucker:/~special~/metadata
Wrote 1 <= plucker:/~special~/index
Wrote 2 <= http://avault.com/developer/getarticle.asp?name=bsawyer1
Wrote 5 <= plucker:/~special~/metadata
Wrote 11 <= mailto:[EMAIL PROTECTED]
Done!
Installing channel output to destinations...
Setting channels new due date
Tasks completed for all channels.

</snip>

I'm sure its something to do with the regex - can anyone help?

Thanks for your time, and my apologies for the length of the email.

Cheers,
Ian

-- 
fortune says:

In an orderly world, there's always a place for the disorderly.

~~~~~~~~~~~ Made in Ireland using GNU Emacs ~~~~~~~~~~~
Ian Swainson          Kia Ora!          [EMAIL PROTECTED]
~~~~~~~~~~~~~~~~ http://www.clients.ie ~~~~~~~~~~~~~~~~

_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to