Forum Moderators: open

Message Too Old, No Replies

aspx, "?", " = " signs and search engines

how crawlers crawl the site

         

Nova

6:10 pm on Nov 4, 2003 (gmt 0)

10+ Year Member



Hi everyone.

I am thinking about making changes to my website and I frankly don't know how search engines deal with the following passes: [websitename...]

Do they (I mean crawlers)follow the phisical path www.websitename/folder/folder/filename.aspx or they don't know anything about the physical path and follow the one described above?

P.S. The structure of the web site is stored in XML model and I don't know what will happen if I change the phisical path to a page but leave the XML model intact.

Any help is appriciated.

Thanks,
Nova

korkus2000

6:39 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to WebmasterWorld Nova,

Search engines do crawl these URLs as long as there is not any session ids or a lot of name value pairs. Try to keep your name value pairs under 3 pairs. Your URL is laid out like MSDN's pages and they are crawled all over the web.

<edit>Fixed some BAD typos.</edit>

[edited by: korkus2000 at 8:06 pm (utc) on Nov. 4, 2003]

Nova

7:30 pm on Nov 4, 2003 (gmt 0)

10+ Year Member



Many Thanks korkus2000,

So as long as I don't leave the physical path anywhere, crawlers will not know about it. Right?

I have one more question. I read about the penalties for mirroring pages and websites. And everything is clear except one thing:

Let's say that we have many urls pointing to the pages that have been removed from our web site. So we placed a customized page "page could not be found, it was removed.... please ...".

If search engines come by following those urls and they see the same page with the same content but with different urls(redirect is implemented using script so they most likely don't know about redirecting)can we be penalized?

Thanks again,
Nova

korkus2000

8:02 pm on Nov 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Make sure that your error page is giving a 404 code in the header. If it doesn't then yes you could run into some problems.

>So as long as I don't leave the physical path anywhere, crawlers will not know about it. Right?

If no links point to the physical path, then they won't know about it. They just index urls that they find. They do not interpret them at all.

Nova

8:15 pm on Nov 4, 2003 (gmt 0)

10+ Year Member



That's what I was looking for! Many thanks.

Nova