If there's one thing that has driven me nuts over the last 6 months it's the non-stop chatter and search...
Algorithm Updates vs Manual Penalties...
3 Quick Fixes to Enterprise-Level...
Link Pruning - is it...
In this video interview link building expert Jim Boykin explains[…]
Join Nick Mihailovski and Ikai Lan from the Analytics and[…]
Could you please give details on what should be included[…]
| Query Deserves Freshness and Other Temporal Tales |
| Written by David Harry | |
| Wednesday, 27 October 2010 13:03 | |
A Google 'History' LessonIf there is one area that all SEOs really should know about, it's about how search engines deal with 'document freshness'. In the world of Google, we'd know this in terms of real time/social search and of course, the not so often discussed, query deserves freshness, (a.ka. the QDF). We all know the interest in timely content in the SERPs, but does it end there? Is it all about first to market? Not at all... so let's take a close look. While going through the latest patent awards recently, I came across a familiar Google offering; System and methods for determining document freshness It was interesting as it was previously filed/awarded and this was it's second appearance on the radar. Previously we looked at it's sister document, also a re-release, back in 2008. It was curious that these both had been re-worked. Ok, fine, maybe that part is only interesting to me. A distinct possibility. We can get back to that later. This became more interesting in that near completion of this post, Ol Matt mentioned one of the series in a GWC video. (He worked on one of the patents in this line - links at the end);
How temporal data plays into searchAt first glance I am sure you might be thinking, ooooooh this is gonna be about Twitter and realtime and Caffeine right? But that's not the real deal here. Yes, they have been adapted, but this ride started in 2004, well before the current social onslaught. Let us first take a refresher in what types of data they might be looking for. Possible temporal data collected can include;
What is important to note is that beyond the simple historical publication knowledge, they might also use these types of signals to better understand links, fight webspam and even for personalization. It is certainly no one trick pony. As an example, let us consider #3 and #9 from above, (Query & Ranking data). For one potential scoring element, we might look at the query history of a term, the ranking history for your page is that space and related click data. We might then ask;
But Dave, WAIT! - I hear you say? That this model of simple collection is totally spammable? Most certainly. Maybe we latch on #4, (link based) and cross reference some data to ensure it's a good match. Obviously link velocity and decay would tell a story as well. You see where we're headed here? The main point is to always consider how a group of signals such as these might interact together. It also highlights how fast, with so many systems, one can get up to the mystical 200+ (or so? or 300? meh) ranking factors at Google.
Determining freshness of a pageWhich of course brings us back to the recent (re) award. How exactly does Google try and determine what is an isn't new out there? To expand on Bill's original round up, items of interest in determining freshness include;
As with any approach we have to consider other elements and inherent scoring mechanisms that may be in play. From what we can see here, there is certainly a likely combination of discovery, crawling (internal site equity), citations and link (velocity). Keep in mind, this is about temporal data. This is about discovery. It doesn't ensure a 'fresh' page is indexed, never mind it ranking for anything of meaning.
The Social ConnectionAnd of course we're all left to wonder; how does social play into this? Right? Of course. We live in a world of wonders (for the old dog SEOs) where getting the content out there is easier than ever. Thanks to the social web. If you're looking to break the latest news, be first to market with a product, launch a new service, social can rock. What's great about it all is these sites often have some pretty good link equity (Twitter = lots of links in + very few going out). This of course means they are heavily crawled as fresh content + equity means minty fresh crawl rates. But not only can we increase the temporal discovery rates we can also develop (non link) citation velocity and even hit aspects related to user behaviour. Yes, in theory, this system can use the Google Social Graph to watch various user type interactions for search personalization. There is even mention of explicit feedback with 'user generated data', (bookmarks etc..). Personalization is an interesting potential use for this... Anyway, yes, there is certainly some ways it can be used for social (makes you wonder why Google's realtime search isn't better... huh..)
Other handy SEO tipsAt the end of the day, we want to learn something from all of this. Before we get to some tips, let us consider again the QDF. It is important to understand that some query types/spaces are going to react differently than others. Sometimes it can be obvious (historical documents, past events etc..) that haven't had fresh activity. Other times, not so much. This will be part of the art of SEO. Getting intimate with a query space to understand the dynamic. Some spaces are more prone to the QDF than others – know the differences. Beyond that, here are some simple take-away items from this or any related approach;
More than meets the eyeAs always, I am passing along these tidbits to get the gears in motion. To help SEOs think beyond myopic thinking of links, links, links.. Yes, this area as with most, has link relations, but there one can see the rest of the forest as well. Take all that we can learn from this and put it back into the SEO stew. A deeper understanding of temproal elements in search can only help you build smarter programs. I hope you enjoyed the ride... leave any questions/thoughts in the comments and we can continue the dialogue. Past Google Patents of Interest;
More articles by this author | |
| Last Updated on Thursday, 28 October 2010 13:40 |
Home - all the latest on SNC
SEO - our collection of SEO articles
Technical SEO - for the geeks
Latest News - latest news in search
Analytics - measure up and convert
RSS Rack - feeds from around the industry
Search - looking for something specific?
Authors - Author Login
SEO Training - Our sister site
Contact Us - get in touch with SNC
| Digital Marketing Weekly - Issue 4 Linked to last week's Google I/O event in San Francisco there is a lot of new features and updates [ ... ] | Playing with Google Conversational Search Earlier today Danny Sullivan, via SEL, was writing about Google Conversational search, which was an [ ... ] |
Comments
I am going to read that a couple times more.
Could you give a little more info on this point:
"Enable syndication channels – did we mention social? Well, let's go beyond that. At the core of social is the ability to have feeds. This is by and large the way of the web these days, so be sure to spread the word by getting your feeds in strong locales. This will aid in discovery, citations and of course, the potential for QDF inclusion. Push baby...PUSH!"
Do you mean promotion of your blog RSS feed, twitter feed etc ? If so, is it just submitting them to aggregator type sites for better discovery.
As always, mind blowing post Dave.
Back to the post. Need to read it a couple more times. Really great information.
Just on feeds. When discussing syndication of these. Are you just talking about blog rss feed, twitter feed etc and submitting these to aggregator type of sites for better discovery and getting content out there quick ?
The quicker the link gets out there the quicker the index process is. If you have fresh relevant content you need to get it indexed fast. Also if you have old quality content you can give it a boost by renewing parts of it. Am I wrong?
Lately I can see that the trends are to revive older posts and pages and serve it as fresh content although it's not. Relating this trend with the time metrics we can conclude that it's a very good method to build trust, to tell the search engines: ”Yo! This piece of content was relevant then, is relevant now. It's universal”.
Is this an argument for:
-Removing dates from URLs (seems to me that there's more to gain from dates in URLs then removing them).
-Refreshing old content (does updating old content really make it fresh or just manipulated).
RSS feed for comments to this post