|
Posted by noone on 04/03/06 22:39
CAH wrote:
> CAH skrev:
>> Hi
>>
>> Can you avoid that googlebot indexes PHPSESSID pages? Googlebot is
>> indexing pages with PHPSESSID, which makes it think my page has a
>> infinite number of pages. How can one avoid this?
>>
>> Here is an exsample of url that google register, that might make is
>> more clear what is happening
>>
>> www.winches.dk/winches.php?artnr=500735&PHPSESSID=d22126f0d46334659ff...
>> www.winches.dk/winches.php?artnr=500735&PHPSESSID=95fc5b6aed41fc142ea...
>>
>> I do use session registred ID, but if I visit my site I never see those
>> kind of urls? So how come google gets a hold of them?
>>
>> Best regards
>> Mads
> I am now testing this as a solution
> "Using .htaccess often, you need to put the following two lines in the
> ..htaccess file, if your host is using PHP as an Apache module:
> php_value session.use_only_cookies 1
> php_value session.use_trans_sid 0 "
> The downside is my site now only functions when user has cookies
> enabled, and I am still not sure whethers this will do the trick.
IIRC, google and other sites search for a file called robots.txt that give
directives on what it can and cannot index. Do a google search for
robots.txt to see... (to verify, look in your webserver log files - it
does show up as a request in my apache log files...)
If your robots.txt includes the following directive - it will skip the
entire site.
User-agent: *
Disallow: *
or to limit the scope of it's search:
User-agent: *
Disallow: /cgi-bin/
Disallow: /images/
Disallow: *.php
[Back to original message]
|