Dear Steven: The code is currently in an SVN repo on XP-Dev. I could make the repo public, but the code in its current stage is definitely what I call "work bench code". It works and does the task for which it is designed, but, even though I plan to use it for a long time, the code was written with the idea that it would not leave my workbench. The next step up would be production-ready code.
I liken this kind of code to the improvised and one-off tools my carpenter and machinists friends have. The tools work and work well, but the tools were not made with the intent to distribute beyond their work-shop. Production of code (or tools) requires you to factor in many maintenance and support factors that the workbench tools don't have. But, just like I can borrow the custom work-bench vise my machinist friend has, I can certainly share this bit of "work bench code". Let me collect up the pieces into a ZIP archive. -----Original Message----- From: Stephen Woodbridge [mailto:wood...@swoodbridge.com] Sent: Tuesday, January 01, 2013 9:49 AM To: j...@washburnresearch.org Cc: perl-gedcom@perl.org; 'Ron Savage' Subject: Re: Gedcom.pm 1.17 released Hi John, Yes, I would love to get that and I'm willing to share my code once I have time to fix the most critical issues. Maybe Paul would like to create a "contrib" directory in the repository that could be used to make code like this available. I guess we could all make public repos on github if we have code to contribute but I think that makes it harder to find it. And/or we could create wiki page(s) that describes these contrib items and how to use them. Anyway, please send your code when you get a chance. It might be a month +- before I get to look at it and give it a try. Thanks, -Steve On 1/1/2013 10:37 AM, John Washburn wrote: > Dear Stephen: > > I have a perl program that walks the HTML pages of e-FamilyTree.net > and pipes it out to a GEDCOM file. It is also interruptible in that > you can run the program for a while, stop it and upon restart it will > pick up where the search left off. > > It uses the HTML:Tree builder and the wonderful look-down > functionality mentioned by Ron. It also used Date::Manip quite heavily. > > I would be happy to zip up the code and send the archive to you if you > are interested. The code is a bit bloated because the code grew over > time and I had to add special cases for some of the errors in the > e-familynet HTML structure and to implement the 1-generation look ahead. > > > -----Original Message----- > From: Ron Savage [mailto:r...@savage.net.au] > Sent: Saturday, December 29, 2012 10:06 PM > To: perl-gedcom@perl.org > Subject: Re: Gedcom.pm 1.17 released > > Hi Stephen > > On 30/12/12 11:03, Stephen Woodbridge wrote: >> On 12/29/2012 5:26 PM, Paul Johnson wrote: > >> What I noticed was that the data way nicely tag in the HTML so I am >> writing a parser to read the HTML can generate a Gedcom file. I have >> the basics working, but I have to do more work on it to fix bugs and >> collect more of the data than I current am. I'm side tracked with >> work at the moment so it is on hold. When I'm done it will have >> generated a >> 40K+ person Gedcom file. This should be able able to create a gedcom >> from any "Second Site" generated website assuming it is similar to >> the link above. Or you can ask the site owner for a copy of the >> gedcom :), but this seemed like a worth challenge at the time. > > Are you using HTML::TreeBuilder and the v-e-r-y nice look_down() method? > > -- > Ron Savage > http://savage.net.au/ > Ph: 0421 920 622 > ----- > No virus found in this message. > Checked by AVG - www.avg.com > Version: 2013.0.2805 / Virus Database: 2637/5997 - Release Date: > 12/30/12 > ----- No virus found in this message. Checked by AVG - www.avg.com Version: 2013.0.2805 / Virus Database: 2637/6001 - Release Date: 01/01/13