Forum Moderators: open
Okay so i have dynamic links on my main page, one that goes to a news script, and one that goes to my reviews, the URL's both have the same format:
News:
[wwwfoobar.com...]
Reviews:
[wwwfoobar.com...]
Problem is Google can crawl the news links no problem, but it cannot crawl the reviews, the reviews are broken down into catagories and these can be crawled with ease, the bot just never goes further BUT for one catagory it did crawl the reviews, funny thing is google see the url as this:
[wwwfoobar.com...]
Personally I dont care how it does it as long as it does. My site is content drive with over 200+ Reviews and google only sees maybe 30 of those....does anyone have any ideas what could be causing it or a way to get around this? I have tried various fixes, changing the url to look like this
[wwwfoobar.com...]
Google picked this up as
[wwwfoobar.com...]
This was fine and i thought it may be a fix but then google dropped the pages a day later, the day after that it was back in google, but now has dissapeared again... I am open to just about anything and will even pay someone if they can help with this.
Double check everything that the page is visible and indexable by a crawler.
Check your robots.txt (if you have one), and check any bot related meta tags you may have.
Validate the page - maybe the bot isn't seeing your links because of some minor html error.
Make more links to your dynamic content. Each individual page should have atleast two inbound links to it.
If so, do some research into RewriteRule - it should go something like this...
Options +FollowSymLinks
RewriteEngine On
RewriteRule ^/news/index([a-z]*)-([0-9]*)\.html$ /news/index.php?action=$1&ID=$2
- or you may need to use news/ instead of /news/
If mod_rewrite is enabled, and the above code goes into your .htaccess, or something similar (like I said, research it, get it right before you do it) you should be able to access the news stories (let's say story 99) by using www.domain.com/news/indexfull-99.html
It looks like you are running your links through some function that encodes your url. Look at the sourcecode of the page that is output it reads like this:
index.php?action=fullreview&id=199
or
index.php?action=fullreview&id=162
It seems like your running your urls through some function like htmlentities(), urlencode() or rawurlencode() ... which you should NOT do. When viewing your source they should look like plain, normal urls.
btw: You should replace those .jpg header on the right columns with .gifs, that would look much cleaner.
http://www.foobar.com/reviews/index.php?action=fullreview&id=200
Google picked this up and listed the url as
http://www.foobar.com/reviews/index.php&action=fullreview&id=200
The files were dropped from google the day after though :(
How could i check everything that the page is visible and indexable by a crawler?
EG
http://www.foobar.com/reviews/index.php%3Faction=fullreview&id=192