Reply to Re: Can you avoid that googlebot indexes PHPSESSID pages?

Your name:

Reply:


Posted by Chung Leong on 04/03/06 17:24

CAH wrote:
> Hi
>
> Can you avoid that googlebot indexes PHPSESSID pages? Googlebot is
> indexing pages with PHPSESSID, which makes it think my page has a
> infinite number of pages. How can one avoid this?

Well, one way to handle this is to check the User-Agent header to see
if the client is Googlebot and not enable session. Obviously if a page
is dependent on session then it ceases to be indexible.

> Here is an exsample of url that google register, that might make is
> more clear what is happening
>
> www.winches.dk/winches.php?artnr=500735&PHPSESSID=d22126f0d46334659ff...
> www.winches.dk/winches.php?artnr=500735&PHPSESSID=95fc5b6aed41fc142ea...
>
> I do use session registred ID, but if I visit my site I never see those
> kind of urls? So how come google gets a hold of them?

If session.use_trans_sid is enabled, then PHP tries to compensate for
the lack of cookie by inserting the session id into any possible links.

I think you have quite a problem on your hand. Once those links are in
Google's database, the bot will keep returning to them. You'll need to
detect the condition and tell Googlebot to buzz off so it doesn't eat
up your bandwidth quota.

[Back to original message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация