Re: Gedcom.pm 1.17 released

2013-01-01 Thread Stephen Woodbridge

Hi John,

Yes, I would love to get that and I'm willing to share my code once I 
have time to fix the most critical issues.


Maybe Paul would like to create a contrib directory in the repository 
that could be used to make code like this available. I guess we could 
all make public repos on github if we have code to contribute but I 
think that makes it harder to find it. And/or we could create wiki 
page(s) that describes these contrib items and how to use them.


Anyway, please send your code when you get a chance. It might be a month 
+- before I get to look at it and give it a try.


Thanks,
  -Steve

On 1/1/2013 10:37 AM, John Washburn wrote:

Dear Stephen:

I have a perl program that walks the HTML pages of e-FamilyTree.net and
pipes it out to a GEDCOM file.  It is also interruptible in that you can run
the program for a while, stop it and upon restart it will pick up where the
search left off.

It uses the HTML:Tree builder and the wonderful look-down functionality
mentioned by Ron.  It also used Date::Manip quite heavily.

I would be happy to zip up the code and send the archive to you if you are
interested.  The code is a bit bloated because the code grew over time and I
had to add special cases for some of the errors in the e-familynet HTML
structure and to implement the 1-generation look ahead.


-Original Message-
From: Ron Savage [mailto:r...@savage.net.au]
Sent: Saturday, December 29, 2012 10:06 PM
To: perl-gedcom@perl.org
Subject: Re: Gedcom.pm 1.17 released

Hi Stephen

On 30/12/12 11:03, Stephen Woodbridge wrote:

On 12/29/2012 5:26 PM, Paul Johnson wrote:



What I noticed was that the data way nicely tag in the HTML so I am
writing a parser to read the HTML can generate a Gedcom file. I have
the basics working, but I have to do more work on it to fix bugs and
collect more of the data than I current am. I'm side tracked with work
at the moment so it is on hold. When I'm done it will have generated a
40K+ person Gedcom file. This should be able able to create a gedcom
from any Second Site generated website assuming it is similar to the
link above. Or you can ask the site owner for a copy of the gedcom :),
but this seemed like a worth challenge at the time.


Are you using HTML::TreeBuilder and the v-e-r-y nice look_down() method?

--
Ron Savage
http://savage.net.au/
Ph: 0421 920 622
-
No virus found in this message.
Checked by AVG - www.avg.com
Version: 2013.0.2805 / Virus Database: 2637/5997 - Release Date: 12/30/12





RE: Gedcom.pm 1.17 released

2013-01-01 Thread John Washburn
Dear Steven:

The code is currently in an SVN repo on XP-Dev.  I could make the repo
public, but the code in its current stage is definitely what I call work
bench code.  It works and does the task for which it is designed, but, even
though I plan to use it for a long time, the code was written with the idea
that it would not leave my workbench.  The next step up would be
production-ready code.

I liken this kind of code to the improvised and one-off tools my carpenter
and machinists friends have.  The tools work and work well, but the tools
were not made with the intent to distribute beyond their work-shop.

Production of code (or tools) requires you to factor in many maintenance and
support factors that the workbench tools don't have.

But, just like I can borrow the custom work-bench vise my machinist friend
has, I can certainly share this bit of work bench code.  Let me collect up
the pieces into a ZIP archive.


-Original Message-
From: Stephen Woodbridge [mailto:wood...@swoodbridge.com] 
Sent: Tuesday, January 01, 2013 9:49 AM
To: j...@washburnresearch.org
Cc: perl-gedcom@perl.org; 'Ron Savage'
Subject: Re: Gedcom.pm 1.17 released

Hi John,

Yes, I would love to get that and I'm willing to share my code once I have
time to fix the most critical issues.

Maybe Paul would like to create a contrib directory in the repository that
could be used to make code like this available. I guess we could all make
public repos on github if we have code to contribute but I think that makes
it harder to find it. And/or we could create wiki
page(s) that describes these contrib items and how to use them.

Anyway, please send your code when you get a chance. It might be a month 
+- before I get to look at it and give it a try.

Thanks,
   -Steve

On 1/1/2013 10:37 AM, John Washburn wrote:
 Dear Stephen:

 I have a perl program that walks the HTML pages of e-FamilyTree.net 
 and pipes it out to a GEDCOM file.  It is also interruptible in that 
 you can run the program for a while, stop it and upon restart it will 
 pick up where the search left off.

 It uses the HTML:Tree builder and the wonderful look-down 
 functionality mentioned by Ron.  It also used Date::Manip quite heavily.

 I would be happy to zip up the code and send the archive to you if you 
 are interested.  The code is a bit bloated because the code grew over 
 time and I had to add special cases for some of the errors in the 
 e-familynet HTML structure and to implement the 1-generation look ahead.


 -Original Message-
 From: Ron Savage [mailto:r...@savage.net.au]
 Sent: Saturday, December 29, 2012 10:06 PM
 To: perl-gedcom@perl.org
 Subject: Re: Gedcom.pm 1.17 released

 Hi Stephen

 On 30/12/12 11:03, Stephen Woodbridge wrote:
 On 12/29/2012 5:26 PM, Paul Johnson wrote:

 What I noticed was that the data way nicely tag in the HTML so I am 
 writing a parser to read the HTML can generate a Gedcom file. I have 
 the basics working, but I have to do more work on it to fix bugs and 
 collect more of the data than I current am. I'm side tracked with 
 work at the moment so it is on hold. When I'm done it will have 
 generated a
 40K+ person Gedcom file. This should be able able to create a gedcom
 from any Second Site generated website assuming it is similar to 
 the link above. Or you can ask the site owner for a copy of the 
 gedcom :), but this seemed like a worth challenge at the time.

 Are you using HTML::TreeBuilder and the v-e-r-y nice look_down() method?

 --
 Ron Savage
 http://savage.net.au/
 Ph: 0421 920 622
 -
 No virus found in this message.
 Checked by AVG - www.avg.com
 Version: 2013.0.2805 / Virus Database: 2637/5997 - Release Date: 
 12/30/12


-
No virus found in this message.
Checked by AVG - www.avg.com
Version: 2013.0.2805 / Virus Database: 2637/6001 - Release Date: 01/01/13



Re: Gedcom.pm 1.17 released

2013-01-01 Thread Philip Durbin

On 01/01/2013 04:46 PM, Ron Savage wrote:

Ideally such code would be made CPAN-ready before release, without - of
course - forcing the author to make it production-ready.


That's what http://prepan.org is for! :)

Phil