Panda-Proof Content Audit for Your Site ~ 6 steps to make sure your content matches Google’s quality criteria
First, the news broke out that Google Panda (a special “filter” designed to de-rank low quality content) is now a part of the search engine’s core ranking algorithm and Panda updates will probably start to roll out faster and more regularly.
Second, over the past two weeks, we’ve seen some major changes in Google’s search results. And though confirmed to be unrelated to Panda, these massive SERP turbulences also seem to be in connection with content quality.
It looks like in 2016, the “quality” of your content is not just an empty word, but something that you NEED to optimize your site for. And today we’ll dive deeper into what the Panda quality algorithm is and how to run a thorough Panda-proof content audit.
What’s Panda & how does it affect SERPs?
The Panda algorithm (named after Google engineer Navneet Panda) is designed to help Google improve the quality of search results by down-ranking low quality content.
The basic principle here is that Google assigns a particular quality score to each website in its index (the score is assigned site-wide, not to separate pages.)
Initially, Panda functioned as a filter applied to a pack of search results that Google considered relevant to a search query. The Panda score was re-ordering them, pushing down the low-scorers, and giving a boost to the highest scored content.
Now, as Panda signals are “baked” into Google’s core ranking algorithm, they no longer re-order the results, but form them together with other Google’s ranking signals.
How can Panda identify high quality content?
Sure thing, there’s no “gut feeling” that helps Panda identify real quality. Panda is only an algorithm that checks your website for a number of factors that Google assume are typical of a high quality website. Then, by applying some math, it gives the site a specific quality score based on the results of this check.
The good news is, if your site’s quality score is based on a number of separate factors, you can influence those factors to improve the score.
The bad news is… Google won’t disclose the exact quality factors it takes into account to calculate the score. So the list of Panda-prone issues below is an educated guess, based on what Google has said on site quality, and what trackable factors it can use to determine it.
What are Panda roll-outs & will they get more frequent?
Sure thing, a quality score is not something you can assign to a website once and for all. Websites change, content gets added and removed, and, clearly, the score needs to be updated every now and then to stay relevant.
The optimal way to deal with problematic content largely depends on the size of your site.
- For a small website (<100 pages), removing low quality content is something you can’t afford. Your key strategy is to improve on every problematic page, rather than delete it.
- For a medium-sized site (100-1000 pages), removing some of the low quality content is possible. But your main focus will be on improving content at least for the most important pages.
- For a large website (>1000 pages), improving all the problematic areas is a huge piece of work, so your focus would be to “weed out” and remove the unnecessary and low quality content.
Step 2. Check for thin content
Imagine you have a category page with only a few lines of meaningless text and hundreds of links to products. This is what’s generally called thin content.
Search engines use content to determine the relevancy of a page for a query. And if you barely provide any information that’s accessible to them, how are they to understand what the page is about?
Refreshing the score non-stop in real time would take too much computing power — that is why up till now Google was launching “Panda updates” once in a couple of months,recalculating website quality scores, and thus changing the filter they apply to the search results.
Will that change now that the Panda quality score is a part of Google’s core ranking algorithm? It doesn’t seem so. The score is still not being calculated in real time; the change only means that the algorithm got more solid and doesn’t need to be modified any further. Thus Google won’t have to test and apply new signals (it’ll only re-apply older ones), so it will be possible to run Panda updates faster and more frequently.
Surely, quality is not all about the word count, since there are cases when you can deliver value in a few hundred words. That is why there’s no “minimum word count” threshold that triggers a low Panda quality score. More to that, sometimes pages with a little over a hundred words do exceptionally well on Google and even get included into its rich answers.
But having too many thin content pages will very likely get you into trouble — so on average,word count under 250 words is a good indicator to locate problematic spots across your site.
Step 3. Check for duplicated/very similar content
Another factor that could be a signal of your site’s low quality is duplicated or very similar content across multiple pages.
Very often, bigger sites have to deal with a huge amount of pages that need to be filled with content. And many of them resort to an easy way to fill out those gaps — by writing boilerplate text that’s the same on each page except for a few variables. This is what Google considers automated, low quality content.
So, besides weeding out the word-by-word duplicated content, pay attention to the similar-looking pieces (say, your page titles are absolutely identical in structure and differ only in a product name) that may be a sign of content automation.
Step 4. Check for aggregated content/plagiarism
What’s also synonymous to quality in Google’s eyes is the “uniqueness” of your content. As Google wants your content to add value and not simply repeat what’s already been said, having non-unique content on your website (e.g. plagiarized content, product descriptions duplicated in feeds used for other channels like Amazon, shopping comparison sites and eBay) is an easy way to get under Google’s Panda filter.
If you suspect that some of your pages may be duplicated externally on other online resources, a good idea would be to check them with Copyscape. (http://copyscape.com/)
Copyscape gives some of its data for free (for instance, comparing two specific URLs), but for a comprehesive check you may need a paid Premium account.
Step 5. Check for proper keyword usage
Keywords and keyword targeting are the most basic and longest-running concepts in SEO. And if you’ve been in the search industry for quite some, you may remember the days when SEO meant just having the right words in your meta keywords tag.
Sure, these times have passed: search engines now try to detect and punish websites deliberately using too many keywords in their content.
However, whether Google will admit it or not, their algorithms are still built upon keywords. And having a keyword in your title tag DOES improve your page’s rankings, meaning you simply can’t afford not optimizing pages for keywords.
So, the only ticklish question here is, “How many is not too many?” And one of the ways to check this is by looking at top ranking competitors (because the sites that rank in top 10 are the sites that pass Google quality test with an A+.)
Remember the Hummingbird algorithm update? The one with which Google learned to recognize the meaning behind a search query and give a common answer to a number of “different-in-keywords” but “same-in-meaning” queries?
This update changed the way SEOs optimize pages — now we no longer think “single keyword optimization”, but try to make our pages relevant for a whole group of synonyms and related terms.
So, adding all kinds of related keywords will help you improve your pages’ rankings and avoid the keyword stuffing issues.
Step 6. Check for user engagement metrics
Though Google generally states that user experience signals are not included into their search ranking algorithm, many experiments show the opposite. And one of the metrics SEOs suspect Google to use is bounce rates.
Think about it — as Google tries to bring users the best search experience, it obviously wants them to find what they were looking for with the first search result they click on. The best search experience is one that immediately lands the searcher on a page that has all the information they need, so that they don’t hit the back button to return to the SERP and look for other alternatives.
Bouncing off pages quickly to return to the SERP and look for other results is called pogo-sticking, and it can be easily measured in terms of bounce rates.
What else to check?
- 1. Check for user-generated content issues
User-generated content and how it affects Panda has been a hot topic recently, and it has gotten to the point where many SEOs are recommending to get rid of all user-generated content, claiming that Google sees it as a signal of poor site quality.
This is far from being true, because we’re still seeing lots of websites based purely on user-generated content (think Quora) that are doing well on Google.
However, user-generated spam — for instance, irrelevant comments on your blog or poorly moderated forum pages — can put your site into trouble.
So if your website features user-generated content, make sure improving your moderation strategy is a priority.
- Check for grammar mistakes
Bad spelling and grammar can both impede user experience and lower the trustworthiness of your content in Google’s eyes, so don’t tempt the fate by leaving obvious grammar errors on your pages. The easiest way to run a spellchecker through your content is to copy the text and paste it into a word processor. This should then highlight the spelling mistakes so you can update the content.
By: Katherine Stepanova
Head of Marketing at SEO PowerSuite
This article first appeared on SEOPowerSuite’s Web site.
I strongly recommend this site to anyone wanting to better understand SEO.