Forum Moderators: Robert Charlton & goodroi
I was wondering what do you think about blogs and WordPress. As you know wordpress can have categories in which it'll show certian posts.
So now I can have 3 categories: A,B,C and then make a post which will be posted in all 3 cats...it'll show in each category, as well as on main page and in archives. As you can see there are many places on the site where that certian post shows.
What do you think, is this duplicate content, or not? How does Google treat such a behaviour?
Any clues?
THanks,
Manca
Also, the way that permalinks are set up in WordPress is critical from day one.
<IfModule mod_rewrite.c>
RewriteEngine On
RewriteBase /
RewriteCond %{HTTP_HOST} ^site\.com
RewriteRule (.*) http://www.example.com/$1 [R=301,L]
</IfModule>
However this only redirects non www to www, and doesn't add / at the end of pages... Also another problem could arise here, for real files on that domain, like wp-admin.php or wp-login.php, it wouldn't be so cool to have / at the end of them...domain.com/wp-admin.php/ is definitely a not way to do...
Thanks for great posts!
Manca
[edited by: engine at 5:10 pm (utc) on Oct. 19, 2006]
[edit reason] examplified [/edit]
Setup a mod rewrite, or index.php url system for the site first.
Make the index page feed only a summary of the last twenty posts.
Limit the home page full posts to a particular number (I choose 4), and use the 'excerpt' plugin to dictate how many charaters you will show for each summary post on the front page (I chose 120)
Use category-1, category-2 to display your category system instead of index.php.
Redirect the non www to www
Redirect index.php by it's own to root.
Noindex tag on all achives by month, week etc except per post.
Use the head meta plugin to pull the first 90-100 words from the post and use as description for each individual post.
For example, when you manage categories, track the id number associated with each category and write it down. Sports could be cat 2 for example.
Create a file named category-2.php and place it in your themes folder, and decorate / make it how you want.
By default Wordpress should use that category-2.php file instead of routing to that category via the index.php, so it gives you full control over meta tags, description and the page look for each category.
1. Redirect non www. to www
2. redirect index.php and index.php/ itself to www
root if the index.php (non mod rewrite) option is used on your site. Only rewrite those specific urls (not any other character searches are the trailing slash on index.php)
3. If early enough in the process, use template code to determine whether a page is a tag, post, category or root and write a description. Depending on some circumstances you may want to redirect the tag to root.
4. Use templating (category-1) system to create a unique page with unique content for each category in your site.
5. Many Wordpress versions use a paging system (category_name=5&paged=2). If you choose to page your category, make sure you exclude those pages by way of noindex, nofollow. I do not use paging and yet some indexed 'paged' categories I had to redirect to the main category.
6. Archives by date are usually ok, however, they use the same description in some setups that the root page uses. If this is so, try to change the meta descrip by using php on the root page and using is_date()
7. Use the head-meta-desc plugin which will create a unique description for each post based upon the number of characters you wish.
Hope this helps.
[edited by: CainIV at 6:07 am (utc) on Oct. 2, 2006]
Sticky me if interested. Works on Wordpress 2.0 only.
It's still early days, as to its effectiveness. Google is doing a jig with its indexing of the site it's used on (my blog).
I'm quietly hopeful. Otherwise, it's back to good ole HTML (and a few time-saving tools), for me!
http://www.example.com/2006/10/07/my-great-post/
http://www.example.com/my-great-post/
Would produce the same content back to the browser as:
http://www.example.com/2006/10/07/my-great-post/trackback/
http://www.example.com/my-great-post/trackback/
Most WordPress templates have a link titled "permalink" that appears below the post in the meta div that when clicked on will result in the same content.
This could also cause issues if people grab that permalink URI and use it to link to you versus the more simple URI.
In my experience, Google is more likely to recognize things like this and ignore them, not indexing the /trackback/ pages. (But certainly not always!) However Yahoo seems to grab anything it can that looks like a URI. Therefore I often take my raw server logs and filter them to identify every URI that Yahoo is trying to index. I often find examples like this where the bots are finding URIs that probably shouldn't be indexed and then I can take appropriate action.
I have thought about (read worried) the dup penalty for a while now so I was pleased to read this great thread (webmasters world should have a wordpress or blogging) category.
One of these wordpress sites is a "halloween blog" and currently ranks number one for that term on google and "ranked" also well for terms like "scary game" and "scary games". I launched this site last July, each "post" contains it's own meta description and title. However I had the same content on many of the various pages due to wordpress structure. Categories, Index, Permalink, ect ....
I set up the header.php file with the code supplied on this thread, I also thought it made more sense to have wordpress only pull the "excerpt" of posts for the catagories and index. Posting the entire post to the permalinked page.
This way if I write my own excerpt the content will be very different on a category page and the permalinked page so I set up my blog with the new structure as well.
My blog has had these changes now for about 8 or 9 days, last night google dumped every dam page, post and category from my blog. The site was pulling in a ton of traffic from google and now my traffic has dropped by probably 95% ...
I guess I will have to wait till next year to really rank well in google for my seasonal site. I also made these changes on a "Tour de france" blog.... that has ranked incredibly well now for about 6 months. Gooogle has not dumped any "blog" pages or categories from that site.
Stuff should not be this dam hard, I should be able to create unique content that people enjoy, get some inbound links and wait for ranking.
Too dam hard google ... lets go back to a simpler time....
Brian
I have doubts on how many pages Googlebot will follow one after the other in a huge established blog before getting bored.
If you step outside the webmaster box of thinking, and just look at the average wordpress sites with categories, I find it hard to believe that G would filter/penalize those sites for duplicate content.
If your wordpress blog has dropped off G, then I would look at the big picture.. Inbound links, quality of content, etc..
- added "no index, no follow to a bunch of pages
- changed the layout in sidebar (removed archive links)
- changed content on category pages from full posts to excerpts
I think I will return in due time just too bad it has happened during the middle of October ... I just cost myself a couple of thousands bucks ;-(
Thank goodness for yahoo and msn.