Forum Moderators: open
This is my first full post on this forum. I have been working as an SEO executive for 6 months, and recently i have been told by a web-designer friend that search engine spiders, especially from Google, find it difficult to read tables, that the crawl speed is very reduced. Through leaning basic HTML etc tables are my core design technique as my skills are limited. Should i creating pages without tables to allow better crawling of my site?
Kind Regards
I think I see the problem.
>very reduced
My guess is that his theory is that the html is often heavier with tables therefore it must slow the spider. From a practical standpoint, at least for basic seo, I've not known tables to cause any problems. I suppose that in very, very competitive serps -everything else being the same- one might argue that table-less design might edge out a table-based page. The trouble is, of course, that everything else is never the same.
Tell him to stick with his color palette.
>tell him to stick with his colour palette
:) i might just do that!
If I was writing a search engine, I would preserve all of the HTML in my internal copy. The aim would be for all of the tags to be recognised, even if the current algorithm did not interpret the use of a particular tag or element. Titles, headings, links - they might be some of the first tags I'd start trying to interpret, but I'd like to be able to interpret all of them. CSS classes and IDs too ;)
I've also never been that concerned by HTML/text ratio. Either a tag gets interpreted, or it gets ignored. Badly formed tags get mis-interpreted, but other than that, I don't think sloppy code is a major barrier. Maybe a missed opportunity, but not really a barrier, unless it's seriously malformed.
It would be nice to be able to assume that a table contained tabular data, I guess.
I remember years ago when participating in discussions about using <table>s. There were some inherent problems back then when you enclosed the entire design in one encompassing <table>. The UA had to parse that entire <table> before displaying the contents within. Users would end up with a blank page for a second or two while the parsing took place. Back then, the recommendation was to break the site up into multiple tables so they would display one at a time and the user would not be left with a blank page on first load.
I would think that using a tabled layout for today's Internet is against best practice and may also be void of any "real" semantics. I'd like to think that the SEs are processing html markup as it was intended to be processed. I think they always have. They've just perfected that process over the years and refined it to the nth degree, or at least Google have.
I'm sure there are a few <table> enthusiasts here who do quite well with their layouts. I think that is a prime example of how well the bots have progressed in interpreting the meaning of the content. But, they can only "guess" so much. Why leave that to chance?
Realistically I don't see why the search engines would impose any kind of explicit benefit or penalty due to your markup. They are just returning content based upon a query. This however, is just speculation.
You certainly could suffer from an implicit penalty due to the spiders not understanding or not being able to find your content. However both semantic and table-based sites can run into this same problem if the coder didn't know what he was doing. I find it hard to believe (and have yet to see) that nesting a few tables is going to affect your rankings.
Anecdotally I own both table-based and semantics-based websites, and I'm not seeing any perceivable difference when it comes to the SERPS, crawl rate, etc.
But.
We all know that choosing what goes in the title element and the Hn elements, among others, does affect how the site returns in the SERPs, as well as the anchor text of a link affecting that link.
So obviously the HTML does affect how the page is parsed in some way.
I'm now on the fence on this one guys - without some testing that i am too busy to do we can't know what the engines are doing with the HTML. :(
(But I still don't think that tables that linearise properly will be a problem for the crawl)
Edit: trivial spelling
[edited by: leadegroot at 10:04 pm (utc) on Jan. 5, 2009]
How Do Search Engine Robots Work?
[webmasterworld.com...]
search engines consist of five discrete software components:
1. Spider : a robotic browser like program that downloads webpages.
2. Crawler : a wandering spider that automatically follows links found on pages.
3. Indexer : a blender like program that dissects webpages that are downloaded by spiders.
4. The Database : a warehouse of the pages downloaded and processed.
5. search engine Results engine : digs search results out of the database
I would think it is the Indexer that handles the semantic side of things?
I would think it is the Indexer that handles the semantic side of things?
Yes, that is my understanding.
So if we accept this logic of how-the-bots-handle-pages, then a table can't slow down the crawl, beyond page bloat making the page bigger, because it is just markup, and the spider doesn't care. It is the Indexer, conceptually, that could be slowed down by a table. But IMHO it would have to be really broken or mega huge before it was an issue. (I know it is a common opinion that broken html (incorrectly nested tags, etc) can limit the engine's ability to list the page.)
I think that if eg Google had found table based layout were an issue then we would see semantic markup on their pages. We don't, its all old school table based HTML.