You are here: Re: Can you avoid that googlebot indexes PHPSESSID pages? « PHP Programming Language « IT news, forums, messages
Re: Can you avoid that googlebot indexes PHPSESSID pages?

Posted by CAH on 12/20/94 11:44

R. Rajesh Jeba Anbiah skrev:

> CAH wrote:
> > > > www.winches.dk/winches.php?artnr=500735&PHPSESSID=d22126f0d46334659ff...
> > > > www.winches.dk/winches.php?artnr=500735&PHPSESSID=95fc5b6aed41fc142ea...
> > >
> > > Such a change in session id shouldn't happen in a normal site.
> >
> > Why not? I would think a session ID should be unique. If you think I am
> > doing something wrong, what could that be then?
>
> It shouldn't happen in a single session--session id remains same for
> the single session unless:
> 1. Crawler is returning and caching in multiple run

I would think this is what happens.

> 2. You have used session_regenerate_id()
> 3. There are random absoulte links poining in from your site to your
> site (instead of relative links)
>
> > Also,
> > > AFAIK Google will remove the PHPSESSID from URL (after crawling(?)).
> >
> > you can try this seach in google site:www.winches.dk
> >
> > or click her
> >
> > http://www.google.com/search?q=site:www.winches.dk&num=100&hl=en&lr=&safe=off&filter=0
> >
> > Look at the last 100 entries or so.
>
> It doesn't seem to strip session id as I thought. If your site
> contents doesn't rely on session (for non-members), you may safely turn
> off trans sid> <news:1111603962.594721.154710@l41g2000cwc.googlegroups.com> (
> http://groups.google.com/group/comp.lang.php/msg/ce24f27f2b7ac610 )
> --even you can selectively turn off only for the crawler by sniffing
> user agent string and or IP.
>
> But, if your site depends on session (for non-members and hence
> crawler)

it does denpend on sessions for non-members

and you'd like to enable session for crawler, but doesn't want
> the trans sid, you need to go for some other hack. If that is your
> situation, I may help you with the hack.

Thanks, that is very kind of you. I think the robot text might be doing
the trick, and then no further tricks og hacks should be needed. But I
am following Google closely. Now, session.use_trans_sid, what does that
do, does it not turn of sessions I URL, and force cookies on the users?


I found this at another site

if(strpos($_SERVER['HTTP_USER_AGENT'],"google")!==false or
strpos($_SERVER['HTTP_USER_AGENT'],"MSIECrawler")!==false)
{
ini_set("url_rewriter.tags","");
}

http://www.mtdev.com/2002/06/why-you-should-disable-phps-session-use_trans_sid/

But have not testet it.

>
> --
> <?php echo 'Just another PHP saint'; ?>
> Email: rrjanbiah-at-Y!com Blog: http://rajeshanbiah.blogspot.com/

 

Navigation:

[Reply to this message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация