Re: Copying Website Contents, esp. Message Boards — HTML

You are here: Re: Copying Website Contents, esp. Message Boards « HTML « IT news, forums, messages

Posted by Phil Earnhardt on 02/07/06 22:22

On Tue, 7 Feb 2006 19:28:31 +0000, "Alan J. Flavell"
<flavell@physics.gla.ac.uk> wrote:

>On Tue, 7 Feb 2006, Phil Earnhardt wrote:
>
>> If the queries are wired into the HTML links of the pages you wish
>> to grab, the automated tools to recursively capture an entire
>> website may be able to pull them down.
>
>You'd better not try that on a wpoison web site! ;-)
>http://www.monkeys.com/wpoison/

Go look at the "safety" page on that site.

wpoison uses the Robot Exclusion Protocol already discussed here; only
programs that ignore the robots.txt guidelines that should wind up in
an infinite maze of twisty passages -- all different.

Now, it's a certainty that there are poisoned sites that don't honor
the REP; one certainly does have to be careful doing such things. And,
you're right: in general, it's a pretty pointless (and potentially
risky) operation to go around grabbing copies of websites.

--phil

Navigation:

Next in forum: Re: putting a banner at the top of a page
Prev in forum: Re: Copying Website Contents, esp. Message Boards
Thread view: Re: Copying Website Contents, esp. Message Boards

[Reply to this message]

Удаленная работа для программистов • Как заработать на Google AdSense • England, UK • статьи на английском • PHP MySQL CMS Apache Oscommerce • Online Business Knowledge Base • DVD MP3 AVI MP4 players codecs conversion help

Home • Search • Site Map • Set as Homepage • Add to Favourites

Сайт изготовлен в Студии Валентина Петручека —
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация