I'd be happy to share code once I clean it up. Until then, here's my
general "Rube Goldberg" approach to importing content.

1.   Create a reference table to map between the old URL and the new URL. 
2.   Spider to obtain a local copy of the old site
3.   Duplicate the directory structure under /myApp/www
3.   Have a CF program read through all of the local site, 
      for each HTML page, check to see if the reference table has an
entry for this URL,
                   If not, create a UUID and store the information in
the reference table
                   If no redirect file exists, create a redirect file 
4.   Have a CF program read each HTML file into a variable, myPage,
      Massage myPage as needed (for example, stripping out old headers
and footers) but
                  at a minimum, ensure that myPage just has the
contents within the <body>
                  tags.
       Use regular expressions to find all the internal links to other
pages on this site
       For each internal link, look it up on the Reference table and
update the link
5.    Create a Naviagation Node for the page
6.    Create a dmHTML for the page.

Some notes:

The spider I used was HTTRACK (free!) which has the ability to change
all the internal urls to absolute urls.  So '../../x.html' became
http://abc.com/y/z/x.html'.

When I created an dmHTML object, I stored the original file name with
the directory into the commentlog.  This was very helpful in
debugging.

A great deal can be done w/ regular expressions when you massage the
contents of an html file.  For example, pulling out the title to store
as the title and label in the dmHTML object.

What I currently use has some additional features for rerunning parts
or all of the old site through the process again, ignoring some pages,
aslo storing .txt files... but the above is the general concept..

On Thu, 21 Oct 2004 01:06:28 +1000, Gavin Cooney <[EMAIL PROTECTED]> wrote:

> That's very interesting. How did you do this? Any chance you'd make
> your code public so that we could learn from what you did?
> 
> 
> 
> 
> On Wed, 20 Oct 2004 10:59:15 -0400, Beth Bowden <[EMAIL PROTECTED]> wrote:
> > To add to what Mike said:
> >
> > We have two different issues. One is supporting Mac users.
> >
> > The other is not being able to use the current functionality with
> > node-heavy branches.
> >
> > We imported some 15,000 html pages into Farcry from a legacy
> > non-farcry site. We mirrored their existing site tree by adding a node
> > for each directory and a node each file. We then added a dmHTML page
> > under the node created for each file.
> >
> > Their existing site tree had hundreds of pages under in one directory.
> > Now we need to rearrange all of this into a newer organization.
> >
> > Using the existing site tree functionality doesn't help for in this
> > rearranging because of the sheer volume of nodes mirrored from the old
> > site. Once we've rearranged, the cut and paste functionality will work
> > in most cases (perhaps, all cases).
> >
> > On Thu, 21 Oct 2004 00:47:12 +1000, Mike Britton <[EMAIL PROTECTED]> wrote:
> > > We're trying to appease our Mac users, and need the ability to move nodes
> > > around without relying on dynamic HTML and JavaScript.
> > >
> > > On <nj:move> --
> > >
> > > I'm building a simple "mover" in my site's customadmin, and before I
> > > reinvent the wheel I want to see if navajo's <nj:move> tag can be used.  I'm 
> > > trying
> > > to use this tag as you do in your javascript, to no avail.  Is this tag
> > > reliant on the javascript API to function?  If so, in this case I need to
> > > call it when a form submits instead.
> > >
> > > On cut-n-paste functionality --
> > >
> > > I tried pulling overview.cfm from cvs and replacing mine with the new one
> > > containing the new cut-n-paste functionality, and got the following error
> > > from my 2.2 build:
> > >
> > > Element ADMINBUNDLE is undefined in a Java object of type class
> > > [Ljava.lang.String; referenced as
> > >
> > > *What other files* in core are necessary to enable this functionality?
> > >
> > > Thanks in advance,
> > >
> > >
> > > Mike Britton
> > >
> > >
> > >
> > >
> > > ---
> > > You are currently subscribed to farcry-dev as: [EMAIL PROTECTED]
> >
> >
> > > To unsubscribe send a blank email to [EMAIL PROTECTED]
> > > Aussie Macromedia Developers: http://lists.daemon.com.au/
> > >
> >
> > ---
> > You are currently subscribed to farcry-dev as: [EMAIL PROTECTED]
> 
> 
> > To unsubscribe send a blank email to [EMAIL PROTECTED]
> > Aussie Macromedia Developers: http://lists.daemon.com.au/
> >
> 
> ---
> You are currently subscribed to farcry-dev as: [EMAIL PROTECTED]
> To unsubscribe send a blank email to [EMAIL PROTECTED]
> Aussie Macromedia Developers: http://lists.daemon.com.au/
>

---
You are currently subscribed to farcry-dev as: [EMAIL PROTECTED]
To unsubscribe send a blank email to [EMAIL PROTECTED]
Aussie Macromedia Developers: http://lists.daemon.com.au/

Reply via email to