|  | Posted by Joel Shepherd on 06/17/05 06:57 
In article <1118946530.850531.150290@g14g2000cwa.googlegroups.com>,"Travis Newbury" <TravisNewbury@hotmail.com> wrote:
 
 > Does anyone know of a program that can crawl a website and tell what
 > files are not used any more?
 >
 > The servers are running on IIS
 
 Short answer: no, I don't.
 
 Hand-wavy answer: Are you talking static HTML files? Or image files? In
 that case, I'd be inclined to trawl the server logs, run a find on the
 web root to get a list of all files, and do a diff. I know it's not
 quite that easy on a Windows box perhaps, but that'd be the basic idea.
 
 If you're talking about some sort of server-side scripts, it might be
 possible to do the same thing, and also grep around to see which scripts
 are included in which.
 
 --
 Joel.
  Navigation: [Reply to this message] |