Forum Moderators: Robert Charlton & goodroi
I was wondering what do you think about blogs and WordPress. As you know wordpress can have categories in which it'll show certian posts.
So now I can have 3 categories: A,B,C and then make a post which will be posted in all 3 cats...it'll show in each category, as well as on main page and in archives. As you can see there are many places on the site where that certian post shows.
What do you think, is this duplicate content, or not? How does Google treat such a behaviour?
Any clues?
THanks,
Manca
Could someone please tell me what would happen if I just removed the links to the monthly archives from my sidebar template file? (which im thinking of doing anyway)
If there are no links pointing to these archives, will the SEs never be able to see them?
My mistake was what I posted on webmasterworld ...
I have the correct no index follow on my site. Google still shows not even one page or post from my blog. Only the homepage, what I find so darn strange is that I am ranked number one for a phrase with more than 30 million results, a phrase that other site owners also use from thier own site name.
I used to have about 150 posts/pages indexed in google that was bringing me about 2k in visitors during the halloween season.
?
Brian
.
Well I now have an interesting situation to look at too:
A perfectly indexed 160-page site, suddenly shows up with only 120 pages indexed. The pages were all updated 3 weeks ago, wondering if some error was introduced back then.
I wonder if the major bots will understand and follow the rule below
Disallow: /*/trackback/
I know in their guidelines they say the following works
Disallow: /*?
But if you use that and try to use the page removal page it throws an error.
Any thoughts?
[edited by: JeremyL at 4:55 am (utc) on Oct. 15, 2006]
g1smd, thats a good point, thanks. This stuff is all very new to me.
So can you tell me if I need to have a robots.txt file to add all these noindex codes? I think it all goes in the header.php right? I dont even know if I have a robots.txt file... where would I look?
Also, what would happen if I receive links to an index page (other than home page) or a category page, and I'm using noindex for category and index pages? Would there still be any link juice from those links?
Are most netizens (who aren't all website owners) savvy enough to use the permalink when linking to a post instead of linking to an index or category page?
Now what happens with pagerank in the case of redirects.
For example, I'm using the Permalink Redirect Wordpress Plugin, which redirect URLs without a trailing slash to URLs with the trailing slash.
If somebody links to the URL without the trailing slash, is the pagerank passed through to the redirected URL?
Here's what I've done so far:
yes-www plugin: redirect non-www to www
Permalink Redirect Plugin: redirects no trailing slash to trailing slash.
I dont know which plugin is doing this (probably second one) but index.php is redirecting to root
and posts and pages with /trackback/ at the end are redirected to the post's or page's permalink.
-------------
Now can someone please tell me if I added this code together correctly (it looks funny to me):
<?php if ( is_home() ) {?>
<?php if ( $paged < 2 ) {?>
<?php if(is_home() ¦¦ is_single() ¦¦ is_page()){
echo ‘<meta name="robots" content="index,follow">’;
} else {
echo ‘<meta name="robots" content="noindex,follow">’;
}?>
<?php }?>
<?php }?>
---------------
And what exacly will this code do?
Will it only allow indexing of the first index page (which is also the home page), as well as individual posts (permalinks), and individual pages (different than index pages)?
And will it dissallow indexing of all category pages, all index pages beyond page 1, all date archive pages, all feed pages (such as anything with /feed/ added at the end), and basically anything besides what is specifically allowed above?
(btw, i've gotta study for exams now so sorry if i cant get back for a while)
is_home()
When the main page is being displayed.
is_single()
When any single Post page is being displayed.
is_page()
When any Page is being displayed.
is_category()
When any Category archive page is being displayed.
So <?php if(is_home() ¦¦ is_single() ¦¦ is_page()){
echo ‘<meta name="robots" content="index,follow">’;
Will add <meta name="robots" content="index,follow"> into your header.
You can configure it as you want. Make sure that you replace "¦¦" and "‘".
ANd that's basically it. If you do all the listed above you will definitely have a great wp blog w/o any single supp. page in google index, I bet!
Hope it helps...and have a great BLOGGING, cuz blogging rocks ;)
Manca
<?php if ( $paged >1 ) {
echo '<meta name="robots" content="noindex,follow"/>';
}?>
<?php if (is_search() ) {
echo '<meta name="robots" content="noindex,follow"/>';
}?>
<?php if (is_archive() ) {
echo '<meta name="robots" content="noindex,follow"/>';
}?>
<?php if (is_trackback() ) {
echo '<meta name="robots" content="noindex,follow"/>';
}?>
Hope Google will pick it up without errors.
[edited by: Kangol at 5:00 pm (utc) on Oct. 19, 2006]
In August I lost many of the pages on the Google index, along with lots of other people, apparently. I had about 90 pages indexed and it went down gradually to 4.
It has recovered twice and then disappeared twice - at the moment only four pages are listed, my home page and three random others.
My traffic has dropped a lot, obviously, and I'm tempted to start listing more posts on the home page, so that there are more potential hits for search engines.
This has the unwelcome side-effect of meaning that more content is displayed in two places, in the categories as well as on the main page.
It looks like a choice between few posts on the main page and almost zero Google hits until the indexing issue recovers, or many stories on the main page with the attendant risk of duplicate content penalties.
Any ideas?
I remember having a dialog with Adam of Google at one point and him alluding to Google being aware of how wordpress works and posts can occur under more than one URL.
I've even got my blog configured to show thousands of posts on the category page and I don't have problems. I do however use the "more" tag to keep the majority of the content on the page and not elsewhere.
It may be that google is aware about blogs and their whole concept but my theory is, prevention is better than cure ;)
That's all I can say about this issue. And my advice is: Just do everything to avoid duplicate content on your pages and you'll be fine with Google, I am almost 100% sure ;)
Manca
No idea how to fix this, it's 301'd for the root, but what I've done for the time being is bar Googlebot from the blog in robots.txt altogether, so now the right pages are back in and ranking as they were, but the Toolbar now shows PR0 for the homepage. No problem with the other engines, just Google.