Welcome to WebmasterWorld Guest from 54.196.244.206

Forum Moderators: phranque

Message Too Old, No Replies

Millions of duplicate pages, JSESSIONID

Are robots seeing ;JSESSIONID=#*$! as unique URLs?

     
8:44 am on Dec 3, 2006 (gmt 0)

Junior Member

10+ Year Member

joined:Aug 9, 2003
posts:118
votes: 0


I'm getting bit by this issue:
http#//example.com/post/javas-seo-blunder-jsessionid

Google, Alexa and Exalead are each crawling each and every page of my site dozens of times each day. Good you say? Bad, because they're getting pages that differ ONLY in the URL:

93.47.80.51 - - [28/Nov/2006:16:05:36 -0800] "GET
/awards.do;jsessionid=68B86DFF8E4A8597B210531C3431965D HTTP/1.1" 200
17195 "-" "Exabot/3.0"
193.47.80.51 - - [28/Nov/2006:16:17:30 -0800] "GET
/awards.do;jsessionid=0621414681C92E1A00A9428A7800AC30 HTTP/1.1" 200
17195 "-" "Exabot/3.0"
193.47.80.51 - - [28/Nov/2006:17:00:36 -0800] "GET
/awards.do;jsessionid=0079FCD91ED8E5B86902228D285CCEEF HTTP/1.1" 200
17195 "-" "Exabot/3.0"
193.47.80.51 - - [28/Nov/2006:20:41:50 -0800] "GET
/awards.do;jsessionid=DE9B61384D3D75DE9EB38A21F066E433 HTTP/1.1" 200
17195 "-" "Exabot/3.0"

So folks: watch out for this. You may have to move away from Tomcat if you want to solve this issue.

[edited by: txbakers at 11:39 am (utc) on Dec. 4, 2006]
[edit reason] examplified URL [/edit]

1:39 pm on Dec 3, 2006 (gmt 0)

Administrator

WebmasterWorld Administrator phranque is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Aug 10, 2004
posts:10542
votes: 8


not sure what tomcat is but you must disable sessions for bots - especially the search spiders.
11:40 am on Dec 4, 2006 (gmt 0)

Senior Member

WebmasterWorld Senior Member txbakers is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Sept 1, 2001
posts:4392
votes: 0


Tomcat is an open source JSP application server
 

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week

Featured Threads

Free SEO Tools

Hire Expert Members