Forum Moderators: phranque
This works great for spiders, they eat up all those pages and index them apparently a lot better than if I had used a URL scheme that gives away the fact that they are dynamic. For instance, putting the date in a query string.
This is a problem, though, because a spider can wonder around in this endless site for days and find pages. Most are identical, especially as they get into the distant past or future, but most spiders don't seem to care (especially scooter). I would like to steer, or encourage the spiders to stay in the material that is somewhat current, or at least the date periods that have more useful information.
I can think of some ways to do it, for instance, use robots.txt to prevent them from hitting years other than this one (or so), or making my application return a redirect (or something) to pages well in the distant past or future, or some other hacks. I bet you guys can come up with a more creative solution, though, or perhaps have even solved something similar. Any good ideas?
You have the right idea about putting up "stoppers" in the form of dead end links such as missing years in your calendar.
You really should be careful. I have no doubt you are going to get spotted from some se. They don't take kindly to those types of db sites. You'll get spotted as dupe content on all those pages that only have different dates.