How to Fix Canonicalization Issues For Better PageRank

The first step on the way to fix canonicalization issues is to find them. Here is what I do.

  1. Go to www.google.com
  2. Click on Advanced Search next to the text box.
  3. Enter * in the "all these words" text box.
  4. Select the drop down for Results per page and set it to 100 results.( the maximum).
  5. In the "Search within a site or domain" enter your site name to be canonicalized.
  6. Hit Advanced Search.
  7. Once you get the results, browse to the bottom of the page and click on the link which says "repeat the search with the omitted results included."

  8.  Copy as many results you can copy and paste into an Excel file.Or you may continue working on the Google page. From here on, its a bit of manual work as you find out which pages are showing up as duplicates.

I run a script on my webserver that sends me an error message every time a page is not found by Googlebot. This helps take care of any over correction of canonical issues.

You can also use Virante's duplicte content finder tool. There is a link at the bottom of this article.

Here is another way to find duplications. Go to Google Webmaster tools. click Diagnostics, then click HTML Suggestions.
On this page, you will see a list of error descriptions with the number of pages on the right side.

For example, while writing this article, I found the following error. fix canonical issues with Google Webmaster Tools


If any of the items show 2 or more pages, you have issues.

When I clicked the duplicate meta descriptions link in the above image, I got this. How to fix canonical issues
Why do you think Google thinks there are two pages?
Look closely and you will find that one page has /Articles\ArticleDisplay.aspx?...
while the other has /Articles/ArticleDisplay.aspx?...

I have canonical issues because of a "/" in the wrong direction!

Now, once you have found the pages that are duplicated, here is how you can fix the most common canonicalization issues.

  1. Remove all duplicate pages but read what you need to do prior to this.
  2. Identify the pages you are going to keep and add the following "canonical hint" to their head sections.
    <link rel="canonical" href="http://www.example.com/product.php?item=swedish-fish" />
  3. Use a 301 redirect in the header.This tells google that this page has now moved and is available at the new location.
  4. Do not use www.your homepage.com and http://yourhomepage.com. Pick one and be consistent. I like to use http://www.NobleRiver.com.
  5. Use all lowercase if possible. I happened to use First letter upper case at the beginning, so I try to stick with it.
  6. Try to find issues that are causing duplication and get rid of them.
  7. ASP.NET websites can use the global.asax BeginRequest method to check for url inconsistencies.
  8. Use the robots.txt file to block all access to the duplicate pages. The following code will disallow all access to the "oldfiles" folder.
    <meta name="robots" content="noindex,nofollow">
  9. You can request your pages be removed from the Google Index via the Crawler Access link under Site Configuration in Google WebMaster Tools. Read what you need to do prior to this. Here is how to get there.
    • Login to Google Webmasters.
    • Select your website.
    • Click Site Configuration
    • click Crawler access.
    • click Remove URL.
  10. Google Webmaster tools will allow you to adjust parameter settings. To get to this page,
    • Login to Google Webmasters.
    • Select your website.
    • Click Site Configuration
    • Click Settings.
    • Click Adjust Parameter Settings
    Here add the Parameter value you need Google to disregard and choose action = "Ignore". This will allow googlebot to strip out the extra parameters in the URL so that all pages lead to the canonical content will have the same URL.

If you are handy with ASP.NET coding, and you have an ASP.NET based website, here's how to point to your canonical page from your duplicate pages

This needs to go into the Page_Load section of your code behind page.

Response.Clear
Response.Status = "301 Moved Permanently"
Response.AddHeader "Location", www.yourdomain.com/newpageURL
Response.Flush
Response.End

Resources:


If you found this FREE article useful, please help others by sharing it!
Share this article with your friends on Facebook

Comments :

Please let us know if you liked this article. Even if you disliked it too ...
Whatever you do, don't go without leaving a note!

And do follow the house rules:
No punching, kicking or hitting below the belt. Screaming is allowed!
No links to places you don't want your mama to see!

Add your comments here:

Comment
( Sorry, no html tags in the comment box)

Name

Email(not for display )

Website

Enter your comments below.Write answer here.

Subscribe to our FREE Marketing Briefings
1. Pull ahead of your competition with "up-to-date"
Marketing Tips, Tools,
Tricks And Techniques !
2. Download FREE - Marketing Books That Have Made Millions!

First Name: *
Email Address: *

1)You'll receive an email from
Noble River Marketing;
2)Click the Confirmation Link to
get access to Members Area.



Are You Ready To Take Your Business
To The Next Level ?


Click here to Steal The #1 Secret of
The Most Successful Entrepreneurs
 


Articles on Marketing
Full List of Articles
 
Material Connection Disclosure
You should assume that the owner of this website has an affiliate relationship and/or another material connection to the providers of goods and services mentioned on this website and may be compensated when you purchase from a provider. You should always perform due diligence before buying goods or services from anyone via the Internet or offline!
Privacy Policy .:||:. Write for this Site .:||:. SiteMap
Copyright 2008, 2009, 2010 - All rights reserved.
Deep Janardhanan & Noble River Resources ™
9517 Craigs Mill Dr., Glen Allen,Va - 23060