Banner

Follow Along

RSS Feed Join Us on Twitter On Facebook

Get Engaged

Banner

Featured Article

Getting a grip on social signals in searchGetting a grip on social signals in  searchIf there's one thing that has driven me nuts over the last 6 months it's the non-stop chatter and search...
Read More >>

Latest Comments

Latest Articles

Will Googles Agent Rank Ever Become a Ranking Factor?Will Google's Agent Rank Ever...
I've seen some interesting discussions recently on the question of whether authority (Agent Rank)...
Read More >>
Algorithm Updates vs Manual Penalties - Some People Still Don’t Get ItAlgorithm Updates vs Manual Penalties...
In the fallout of the last publicly announced (sorta) Panda update and as the...
Read More >>
3 Quick Fixes to Enterprise-Level Technical SEO3 Quick Fixes to Enterprise-Level...
As Google continues to transpose the idea and essence of the real world, physical marketplace...
Read More >>

Our Sponsors

Banner
Banner
Banner

Latest Search Videos

Join Us

Banner
Banner
What can SEOs learn from Google Suggest?
Written by David Harry
Tuesday, 06 July 2010 00:00

A journey into Google’s patent on generating suggestions

Search engines are always looking to make our lives easier, or at least accessing the world’s information in a timely manner. But in the old days they had to wait for the user to take action before they could begin to deliver potential results for a query – not these days – starting to gather search results and even implementing search assist can happen with each keystroke.

You know the one, the suggestions they make as you’re typing in a given query? It looks something like this;

Google Search Assist

I know more than a few SEO peeps have talked about this as a potential problem for some long tail targets and others have pondered if it would make a good keyword research tool. They can also use the same systems for query analysis as far as which documents to return and rank. But what if there is a potential for personalization of this data? Because there just might be; and that would certainly limit its overall effectiveness from a SEO perspective - At least as a research tool.

Which brings me to a recently assigned patent to the mighty Google;

Method and system for auto completion using ranked results – filed November 11, 2004 assigned; February 3, 2009 - Gibbs; Kevin A. (San Francisco, CA), Kamvar; Sepandar D. (Palo Alto, CA), Haveliwala; Taher H. (Mountain View, CA), Jeh; Glen M. (San Francisco, CA)

 

What? A personalized search connection?

Of course, I am nothing if not obsessed with that particular topic; so connect we must. You see, of note in this one, is the presence of our old pal; Sep Kamar (and Kaltix crew) – from our recent look at Personalized PageRank and lead tech in the PS dept. There is also another patent which is referenced in this offering from Sep;

Anticipated query generation and processing in a search engine; Filed June 22 2004 and assigned; Dec 22 2005

That patent, as best put by Bill Slawski, looks at, “returning search results quicker, and enabling personalization to make those results more relevant for the person searching.” The main crux of the systems is that they intercept the queries in a partial form and begin to process and set out probabilistic matches. Why wait for the user to actually type in a full query when you could already be processing before they hit enter?

Or as Bill put it;

“If the search engine captures keyboard strokes as they happen, and starts sending partial queries to the search engine based upon a prediction of what the searcher is looking for, it may speed up the process.”

He then, being the wise turtle that he is, mused;

“What we don’t see with Google Suggest is some of the technology used to create that query list it displays in a dropdown under the query window. We also don’t see the ability to personalize those predictive searches.”

Well it would seem the patent filing on that system would be mere months after the original, it simply took a few passes to make it out in the wild; some 3yrs+ later… And it does cite the other, has related authors; and could certainly be another link in the chain (or the thought pattern at least).

Bill has a great round up of the first one and even CJ took a stab at the latest… let’s just see what any of this might mean to you the happy optimizer…. Shall we?

 

The filter factor

In the simplest terms, a predefined set of common query types can be stored and then Google can start thinking about the results, even as you type in your search term. What’s interesting to us is how they go about that. In some ways it can give us some insight into other ranking processes. A great deal of search engineering these days is in probabilistic learning and predictive capabilities (there’ll be a test on that junk later… he he)

Triggers for the system can include;

  1. When system receives a partial query
  2. Upon completion of the query
  3. Not choosing a suggested query in a given time frame.
  4. Number of characters received
  5. Pause in query input

 

As well as filters which could be used;

  1. Time of day factors
  2. Geo-graphic filters (language, IP)
  3. Temporal and historical (query type spikes)
  4. User types grouped by search activity
  5. Categorization – ‘dog’ and ‘breed’ indicate ‘Animal > Dog’ category
  6. Based on Personalization information

That last one of course is of particular interest. There are layers of personalization everywhere with Google. And so for those of you thinking of using ‘search suggest’ as a keyword research tool, you may want to reconsider and do some testing first.

As you can see there are more than a few factors that seem to play into how search suggest works and even these patents are 4+ years old, so things are likely evolved from this. But let’s at least look at some ranking mechanisms, always enlightening oui?

Can Google predict your future?

 

How they might calculate suggestion ranking

For starters there are the basics of query analysis; submissions with a higher frequency would be ranked higher than terms searched less. That’s sensible. They can also use personalization in the form of search and browsing history. This could also be done on a more granular level as far as using the data from that current search session; although in the official FAQ they state they don’t, “base its suggestions on your personal search history” – which was later updated with the fact they do, in some cases, “log data, like IP addresses, in order to monitor and improve the service

Take that for what U will…

From there they look at using layers of ranking criteria such as first ranking via popularity score and then re-ranking based on secondary information. The potential query suggestions that score well on the given factors, is displayed first and so on down the line.

Also of interest is that they do talk about adaptive ranking schemes.

Let’s say we were ranking our predictions with a heavy lean towards the personalized data. If the user doesn’t choose from the query predictions, then we would potential take user group data as dominant next time or maybe geo-graphic factors. Look at the filters above and use your imagination as to the variations available.

They may even take this into account for ranking busier query spaces in as much as a potential result must satisfy one or more criteria to be considered in the seed set for ranking.

 

Order from the chaos

I know… so friggen what? Well, it's an interesting glimpse, once again, at the ways search engineers think. When your job is to rank documents in search engines, it's good to have some insight. At the end of the day, it seems to me that we can take a few things away from it all;

  • Keyword research - As we noted early on, I’d be wary of using search suggest for more than anecdotal data.
  • Themes and concepts – if you have important concepts, geo-graphic attributes or categorizations related to your query space, make them clear. Ranking and recommendation concepts relating to filters and personalization are best fed by being rock solid on you theme development and targeting.
  • Trend setter – being fast on current news and using generic page titles, that satisfy the query space, is potentially much like the QDF. A lack of links means temporal signals would be important and having more generic targeting early one seems sensible.
  • Rank and stick – remember that the suggestions are part of a process already ranking known queries; you can’t win if you don’t play. Having the money terms would be important to get the extra bump. Being sticky, having strong user engagement, will help with the personalization.

 

And there we have it, a ride inside the world of Google search suggest. As always, we must take patents with a grain of salt. This was written in 2004 and simply was assigned recently. You should be more thinking about the mindset; it seems we keep running into Sep a lot these days (people are starting to whisper).

The next time you see the drop down with search suggest, think of me won’t U?

/end

 

More reading

From Google; Features: Google Suggest - Google Sugest – FAQ - At a loss for words?

Google Suggest Dissected - Server-side Guy
Hacking Google suggest - Adam Stiles

How Google suggest changes SEO - Solo SEO
Google.com finally gets Google suggest– Search Engine Land


Other facts; a few notable parts that I didn’t get to…

Connection devices – mobile searches tend to be different as far as the length of queries and thus they would potentially establish which type of connection device you were using to determine the search suggestions.

Storage trade off – they discuss how there is a trade off in the amount of data that can be stored and the accuracy. Obviously the less data you retain, the weaker the predictions. This is interesting because all too often peeps for get the cost of processing and storage.

Anti-spoofing aka Spamming – another interesting tidbit, if not for the name alone, is the process for detecting artificially generated queries. Sadly, this was only defined by tagging multiple queries from the same user or client computer. It’s sort of lightweight there IMHO. But I like the name…

 

Now go and play with it… make a tool even… like Aaron did - …and that’s all she wrote… l8tr

David Harry -

Hi my name is Dave and I, am an algo-holic

I am an avid search geek that spends most of his time reading about and playing with search engines. My main passion has always been about the technical side of things from a strong perspective rooted in IR and related technologies.You can find me providing SEO consulting services for Verve Developments.

You can also hook up with me via

Read More >>


More articles by this author

Google Hacks & Dorks for fun and  profitGoogle Hacks & Dorks for fun and profit
Recently someone was asking me about Google's advanced operators and...
Read More >>
Last Updated on Sunday, 17 October 2010 21:29
 

Comments  

 
0 #1 JavaGenious 2010-12-30 03:57
Thanks for the information
Quote
 
 
0 #2 nitGreen 2010-12-30 06:48
Dave,

It’s a nice piece of thought you shared with us.

hanks for taking the time to share!
Quote
 
 
0 #3 Object Synergy 2011-10-13 12:56
The old days they had to wait for the user to take action before they could begin to deliver potential results for a query – not these days .starting to gather search results and even implementing search assist can happen with each keystroke.
Quote
 

Add comment


Security code
Refresh

Getting Around the Site

Home - all the latest on SNC
SEO - our collection of SEO articles
Technical SEO - for the geeks
Latest News - latest news in search
Analytics - measure up and convert
RSS Rack - feeds from around the industry
Search - looking for something specific?
Authors - Author Login
SEO Training - Our sister site
Contact Us - get in touch with SNC

What's New?