With all the current chatter about LSI and LSA I decided to re-emphasize something I dropped in a forum years ago : GO words. Not the Go Words defined here, but the ones I defined for myself when first analyzing Google’s love of semantic analysis. Sorry for re-using what appears to be already defined in the search world, but I didn’t know, and I was contrasting “stop” words which also seem to be named on top of an existing definition. (I didn’t say it would be easy).
Anyway, in SEO “stop words” were words which, if they appeared on a page, would cause that page to *not* appear in Google (either for the desired search or for all searches). A stop word was something like “rape”, which triggered some sort of filter and wreaked havoc with search inclusion back then. In those older days of SEO there were clear cases of unrecognized “stop words” causing pages to be dropped, and we SEOs found ‘em and removed them. There were secret lists of stop words, allusions to secret lists of stop words, and all sorts of miscshpellingsh of stop words in order to keep the concept in the on-page text without tripping the
censors filters. Sometimes stop words applied to certain queries, where the presence or absense of the word influenced whether or not that particular page “qualified” for ranking for a specific query. It wasn’t semantic analysis but censorship filtering back then. Today, these stop words would be considered either hard coded filter criteria or theme triggers that trip semantic set dynamics such that whatever LSA Google is doing in the algo, it is influenced by the stop word. You can see hard-coded stop words in action today with AdSense, with a minimal amount of effort (no, I don’t think the search engine and AdSense censor the same ways).
Once Google disclosed the tilde operator, we could play around inside Google’s synonym engine and that is where I was investigating stop words when I discovered “go” words. Go words (to me, at that time) were words which, if added to your page, caused it to rank for a query or thematic set of queries. I’m not talking about keywords specifically, but related words. Page without it, rank #30. Add the “go word”, and it rises to #3. Repeatable; testable. Go words existed and when you found them and included them, you were rewarded.
Because I’m not doing much real “work” to write this post I don’t have any “go words” to show you. I won’t reveal those I work with currently, and since I have many years in the niche markets I work, it is probably true that I still use most that I know about. In other words, I don’t have any throw-away competitive advantages to give you. I will say that it’s not too hard to find them, especially is you have SEO experience in your niche. They key IMHO is to know that they are out there, so that you can test without wasting your time. That is what I am offering here. No, I don’t mean to suggest that Google synonyms are “go” words. Synonyms are great for working with sets, finding overlaps, and testing pages against the current Google search index “corpus”, but in my experience Go words are rare and not simply threshold-triggering synonyms. When you find some you can test that fairly easily and see if you agree.
Now are “go words” hard-coded, filter triggers, or do they merely tip the scales of LSA-like algorithmic features? My experience is they are hard-coded, because a few very specific instances are just simply amazing to witness. However, I really can’t tell a badly tuned algorithmic dependency (a.k.a. “sensitivity”) from a filter or filter threshold setting… nobody but Google can tell you those details. My view is they are truly “necessary yet not sufficient” conditions for ranking at least in some cases. I would expect that as LSA etc. matures within Google, such things will go away. That will happen slowly.
It is refreshing to find a black/white “signal” like this in Google these days. Everything has become so graduated, when you find something with binary-like impact on the algo it is fun to exploit. When hunting, keep in mind that it really doesn’t matter if you find a set overlap threshold you can cross with 4 specific words, or a hard-coded trigger tripped by a single word: you are after the effect - put in, rank, take out, lose rank. Don’t get academic and miss the benefit.