|
Posted by Alan J. Flavell on 06/07/06 13:33
On Wed, 7 Jun 2006, RICHARD BROMBERG wrote:
> Their work habits were no better than mine and now there are dozens
> of images that were uploaded to the Host and likewise dozens of old
> HTML files that are completely orphaned, i.e. nothing references
> them.
>
> Does anyone have a suggestion or know of a utility that will
> identify these unused files so I can clean up the site.
Recent versions of Xenu link checker can do that
http://home.snafu.de/tilman/xenulink.html
A fast and generally recommended link checker, but it has some known
shortcomings. It uses Windows' own routines for accessing URLs, and
they perform some silent fixups (e.g correcting "\" to "/") before the
link checker gets to see them, which means those kind of broken link
cannot be found.
See http://members.chello.nl/f.visser3/xenu/10-orphaned-files.html for
thirdparty documentation.
However, a comment I found elsewhere says that the ophan check doesn't
recognise images that are only called-out from a stylesheet.
Personally, when getting rid of believed-to-be-orphaned files, I
set their filemode on the server so that the HTTPD cannot read them,
and then watch out for any corresponding errors in the logs for a
while, before finally deleting them. That way, it's easy to reinstate
any that weren't really orphaned.
h t h
Navigation:
[Reply to this message]
|