Forum Moderators: Robert Charlton & goodroi
I was wondering what do you think about blogs and WordPress. As you know wordpress can have categories in which it'll show certian posts.
So now I can have 3 categories: A,B,C and then make a post which will be posted in all 3 cats...it'll show in each category, as well as on main page and in archives. As you can see there are many places on the site where that certian post shows.
What do you think, is this duplicate content, or not? How does Google treat such a behaviour?
Any clues?
THanks,
Manca
I have been following the whole discussion on duplicate content - simply not an issue, even though we often assign multiple categories to each post. Google relies heavily on what you actually link to and what anchor text you use to serve the results.
So for all of you who believe that we are in business for our readers rather than bots, relax - just create good content and life will be good.
Is there a way I can remove it via robots.txt using a wildcard? Think I heard this on a webmasterradio show but i can't recall the specifics.
Thank you.
Also note that Yahoo's slurp now also supports wild cards in the robots.txt file.
2. Do not allow catagories to get indexed.
3. Generate unique titles and meta desc for the single posts
4. Drop meta desc. from index2,index3,index4, ect.
Google will generate its own unique description based on the content.
5. Generate unique titles for index2,index3,index4, ect.
I did this and the results are rather good!
I have a wordpress blog, around a month or two old, and after a slow start it was very well indexed by googlebot and started to do very well in the serps.
However I noticed that my wp feed started to rank higher than individual posts or my home page. So to counter this I placed some disallow rules in my robots.txt....but I have just seen a big drop in the amount of pages listed in G's index for my blog.
Have I made a mistake in my robots.txt?
User-agent: *
Disallow: /wp-
Disallow: /search
Disallow: /feed
Disallow: /comments/feed
Disallow: /feed/$
Disallow: /*/feed/$
Disallow: /*/feed/rss/$
Disallow: /*/trackback/$
Disallow: /*/*/feed/$
Disallow: /*/*/feed/rss/$
Disallow: /*/*/trackback/$
Disallow: /*/*/*/feed/$
Disallow: /*/*/*/feed/rss/$
Disallow: /*/*/*/trackback/$
[edited by: tedster at 5:33 pm (utc) on Nov. 20, 2006]
To block access to all URLs that include a question mark (?), you could use the following entry:User-agent: *
Disallow: /*?*[google.com...]
Thanks, but I don't think that works with wordpress. I have one header template file for the whole site. All the pages are dynamically generated.
In my header I have this:
<?php
if (is_single() ¦¦ is_page() ¦¦ is_home()) {
echo "<meta name=\"robots\" content=\"index,follow\"/>\n";
} else {
echo "<meta name=\"robots\" content=\"noindex,follow\"/>\n";
}
?>
I want to say noindex for all the pages that are after the homepage, which are just a chronological ordering of posts as they're bumped off the homepage.
Understand your CMS: Make sure you're familiar with how content is displayed on your Web site, particularly if it includes a blog, a forum, or related system that often shows the same content in multiple formats.
Does Home.php = index.php or header.php?
I still can't get this code to work
<?php if(is_home() ¦¦ is_single() ¦¦ is_page()){
echo ‘<meta name="robots" content="index,follow">’;
} else {
echo ‘<meta name="robots" content="noindex,follow">’;
}?>
Can I paste into Header.php as is? I have read something about pipes? what do i need to change in layman terms.
< continued here: [webmasterworld.com...] >
[edited by: tedster at 5:44 am (utc) on Mar. 11, 2007]