I think the problem of remove file b if file a is the only file that calls
it could be solved with recursion, ie, running the process several times.
First time will remove file a, second time will remove file b.  Not as
efficient, but the algorithm for calculating which files need to be deleted
is also a lot simpler.  

-----Original Message-----
From: S. Isaac Dealey [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, August 31, 2005 3:59 PM
To: CF-Talk
Subject: Re: [SOT] Finding which files in a site aren't used anymore

> I'm back to a problem that I run into a lot. I started
> working at a
> place that does not do a good job of cleaning up files in
> a websites
> directory. (as well as tons of unused stored procedures
> and views)
> I've found plenty of tools to track down database objects,
> even some
> that check your files for calls to the DB objects; I just
> can't find
> anything for cleaning up the files. I know I can get a
> list of all the
> files in all sub directories and grep for each, but I'm
> hoping someone
> can recommend me a nicer GUI utility instead. Another nice
> feature
> would be smarter than grep and if file A calls file B but
> nothing
> calls file A then they are both obsolete. Does anyone know
> of a
> utility like this. In the past I've used grep or similar
> but in this
> case there are just too many files.

You also need to consider the possibility of template names being
stored in the database -- in "url" columns if not columns indicating
include template paths... Unfortunately I don't have any
recommendations -- I use regular expression searches in dreamweaver if
I want to check for more than one file name, but that obviously
doesn't handle the semantic user-interface you're talking about where
you want the IDE to know when it's looking for file a that file b
should be removed if only file a references it. I would think also
that for the tool to be really effective, it mind need to have some
comprehension of the syntax of the language... though even then it may
be questionable if there are variables used in paths, i.e.
href="#approot#/path/to/file.cfm". An IDE that isn't tied _directly_
to CF and/or knows the web-root and uses an http engine to figure out
how those url's resolve won't be able to know what file is being
referenced. Which ultimately may be a good part of the reason why you
haven't found any good tools to perform this task.

Sorry I couldn't be more helpful...

I would recommend checking the db(s) for any columns with the terms
"URL" or "HREF" or "link" in the column name...

s. isaac dealey   954.522.6080
new epoch : isn't it time for a change?

add features without fixtures with
the onTap open source framework

http://www.fusiontap.com
http://coldfusion.sys-con.com/author/4806Dealey.htm






~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:217049
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54

Reply via email to