Subscribe to this Blog

Subscribe to this Blog

Enter your email address:


Add to Technorati Favorites

About Us

Internet Marketing Blog is an effort from talented, online marketing professionals who likes to share knowledge of Internet marketing, advertising, and search engine strategies that we have gained

Search


Helpful Websites

Find2k Network

« Cyber Monday -Amazon Again Top Retail Site | Main | Google Japan Lost Trust for Paid Links »

Canonical Tag - To Fight Duplication

By Find2k | February 16, 2009


After a long time we are back on our regular blog posts and we are happy that the first news update is a “Good News” for sure.

The story line of “Canonical Tag”
All the search engines met together and talked about a common issue and that is duplicate content. They discussed the difficulties they are facing to get the right page ranked on their search listings.

They thought of a solution and they announced that their will be a common tag which will be placed in the header of the page with a canonical tag which will tell which is the main page/original page to be followed and the task of searching for a good page will reduce. The crawlers will do the page identification but it will keep the information in consideration while doing it, which will reduce the time and efforts.

Hereby Google, Yahoo and Microsoft have united to offer a way to reduce duplicate content clutter and make things easier for everyone with the help of “Canonical tag”.

Here are few more updates from the renowned Search Engine Publishing sites:

Google, Yahoo & Microsoft Unite On “Canonical Tag” To Reduce Duplicate Content Clutter
The web is full of duplicate content. Search engines try to index and display the original or “canonical” version. Searchers only want to see one version in results. And site owners worry that if search engines find multiple versions of a page, their link credit will be diluted and they’ll lose ranking.

Today, Google, Yahoo and Microsoft (links are to their separate announcements) have united to offer a way to reduce duplicate content clutter and make things easier for everyone. Webmasters rejoice! Worried about duplicate content on your site? Want to know what “canonical” means? Read on for more details.

Multiple URLs, one page
Duplicate content comes in different forms, but a major scenario is multiple URLs that point to the same page. This can come up for lots of reasons. An ecommerce site might allow various sort orders for a page (by lowest price, highest rated…), the marketing department might want tracking codes added to URLs for analytics. You could end up with 100 pages, but 10 URLs for each page. Suddenly search engines have to sort  through 1,000 URLs.

This can be a problem for a couple of reasons.

Using the new canonical tag
Specify the canonical version using a tag in the head section of the page as follows:
<link rel=”canonical” href=”http://www.example.com/product.php?item=swedish-fish”/>
That’s it!

This tag will operate in a similar way to a 301 redirect for all URLs that display the page with this tag.

Canonical URL best practices
The search engines use this as a hint, not as a directive, (Google calls it a “suggestion that we honor strongly”) but are more likely to use  it if the URLs use best practices, such as:

Can this be abused by spammers? They might try, but Matt Cutts of Google told me that the same safeguards that prevent abuse by other methods (such as redirects) are in place here as well, and that Google  reserves the right to take action on sites that are using the tag to manipulate search engines and violate search engine guidelines.

For instance, this tag will only work with very similar or identical content, so you can’t use it to send all of the link value from the less important pages of your site to the more important ones.

If tags conflict (such as pages point to each other as canonical, the URL specified as canonical redirects to a non-canonical version, or the page specified as canonical doesn’t exist), search engines will sort things out just as they do now, and will determine which URL they think is the best canonical version.

The tag in action
This tag will most often be useful in the case of multiple URLs pointing at the same page, but might also be used when multiple versions of a page exist. For instance, wikia.com is using the tag for previous revisions of a page.

Both http://watchmen.wikia.com/index.php?title=Comedian%27s_badge&diff=4901&oldid=4819 and http://watchmen.wikia.com/index.php?title=Comedian%27s_badge&diff=5401&oldid=4901, reference the latest version of the article (http://watchmen.wikia.com/wiki/Comedian%27s_badge) as the canonical.

The search engines stress that it’s still important to build good URL structure and also note that if you aren’t able to implement this tag, they’ll still keep the processes they have now to determine the canonical. For instance, at SMX West on Tuesday, Maile Ohye of Google explained how Google can detect patterns in URLs if they use standard parameters. For instance, with these URLs:

Maile explained that Google can detect (particularly when looking at patterns across the site) that the sort parameter may order the page differently, but that the URLs with the sort parameter display the same content as the shorter URL (http://www.example.com/buffy?cat=spike).

While it’s rare for the search engines to join forces, this isn’t the first time they’ve come together on a standard. In November 2006, they came together to support sitemaps.org. And in June 2008 they announced a standard set of robots.txt directives. Matt Cutts of Google and Nathan Buggia of Microsoft told me that they want to help reduce the clutter on the web, and make things easier for searchers as well as site owners.

This new tag won’t completely solve duplicate issues on the web, but it should help make things quite a bit easier particuarly for ecommerce sites, who likely need all the help they can get in the current economic conditions. Site owners have been asking for help with these issues for a really long time so this should be a greatly welcomed addition.

Fighting Duplication: Adding more arrows to your quiver

Topics: Google, Internet Marketing, Internet Marketing and MSN, New Updates on Google, Search Marketing, Yahoo |

Comments