Latent semantic indexing (LSI) is an indexing and information retrieval method used to identify patterns in the relationships between terms and concepts.

With LSI, a mathematical technique is used to find semantically related terms within a collection of text (an index) where those relationships might otherwise be hidden (or latent).

And in that context, this sounds like it could be super important for SEO.

Right?

After all, Google is a massive index of information, and we’re hearing all kinds of things about semantic search and the importance of relevance in the search ranking algorithm.

If you’ve heard rumblings about latent semantic indexing in SEO or been advised to use LSI keywords, you aren’t alone.

But will LSI actually help improve your search rankings? Let’s take a look.

The Claim: Latent Semantic Indexing As A Ranking Factor

The claim is simple: Optimizing web content using LSI keywords helps Google better understand it and you’ll be rewarded with higher rankings.

Backlinko defines LSI keywords in this way:

“LSI (Latent Semantic Indexing) Keywords are conceptually related terms that search engines use to deeply understand content on a webpage.”

By using contextually related terms, you can deepen Google’s understanding of your content. Or so the story goes.

That resource goes on to make some pretty compelling arguments for LSI keywords:

  • Google relies on LSI keywords to understand content at such a deep level.”
  • LSI Keywords are NOT synonyms. Instead, they’re terms that are closely tied to your target keyword.”
  • Google doesn’t ONLY bold terms that exactly match what you just searched for (in search results). They also bold words and phrases that are similar. Needless to say, these are LSI keywords that you want to sprinkle into your content.”

Does this practice of “sprinkling” terms closely related to your target keyword help improve your rankings via LSI?

The Evidence For LSI As A Ranking Factor

Relevance is identified as one of five key factors that help Google determine which result is the best answer for any given query.

As Google explains in its How Search Works resource:

“To return relevant results for your query, we first need to establish what information you’re looking forーthe intent behind your query.”

Once intent has been established:

“…algorithms analyze the content of webpages to assess whether the page contains information that might be relevant to what you are looking for.”

Google goes on to explain that the “most basic signal” of relevance is that the keywords used in the search query appear on the page. That makes sense – if you aren’t using the keywords the searcher is looking for, how could Google tell you’re the best answer?

Now, this is where some believe LSI comes into play.

If using keywords is a signal of relevance, using just the right keywords must be a stronger signal.

There are purpose-built tools dedicated to helping you find these LSI keywords, and believers in this tactic recommend using all kinds of other keyword research tactics to identify them, as well.

The Evidence Against LSI As A Ranking Factor

Google’s John Mueller has been crystal clear on this one:

“…we have no concept of LSI keywords. So that’s something you can completely ignore.”

There’s a healthy skepticism in SEO that Google may say things to lead us astray in order to protect the integrity of the algorithm. So let’s dig in here.

First, it’s important to understand what LSI is and where it came from.

Latent semantic structure emerged as a methodology for retrieving textual objects from files stored in a computer system in the late 1980s. As such, it’s an example of one of the earlier information retrieval (IR) concepts available to programmers.

As computer storage capacity improved and electronically available sets of data grew in size, it became more difficult to locate exactly what one was looking for in that collection.

Researchers described the problem they were trying to solve in a patent application filed September 15, 1988:

“Most systems still require a user or provider of information to specify explicit relationships and links between data objects or text objects, thereby making the systems tedious to use or to apply to large, heterogeneous computer information files whose content may be unfamiliar to the user.”

Keyword matching was being used in IR at the time, but its limitations were evident long before Google came along.

Too often, the words a person used to search for the information they sought were not exact matches for the words used in the indexed information.

There are two reasons for this:

  • Synonymy: the diverse range of words used to describe a single object or idea results in relevant results being missed.
  • Polysemy: the different meanings of a single word results in irrelevant results being retrieved.

These are still issues today, and you can imagine what a massive headache it is for Google.

However, the methodologies and technology Google uses to solve for relevance long ago moved on from LSI.

What LSI did was automatically create a “semantic space” for information retrieval.

As the patent explains, LSI treated this unreliability of association data as a statistical problem.

Without getting too into the weeds, these researchers essentially believed that there was a hidden underlying latent semantic structure they could tease out of word usage data.

Doing so would reveal the latent meaning and enable the system to bring back more relevant results – and only the most relevant results – even if there’s no exact keyword match.

Here’s what that LSI process actually looks like:

LSI process flow chartScreenshot by author, January 2022

And here’s the most important thing you should note about the above illustration of this methodology from the patent application: there are two separate processes happening.

First, the collection or index undergoes Latent Semantic Analysis.

Second, the query is analyzed and the already-processed index is then searched for similarities.

And that’s where the fundamental problem with LSI as a Google search ranking signal lies.

Google’s index is massive at hundreds of billions of pages, and it’s growing constantly.

Each time a user inputs a query, Google is sorting through its index in a fraction of a second to find the best answer.

Using the above methodology in the algorithm would require that Google:

  1. Recreate that semantic space using LSA across its entire index.
  2. Analyze the semantic meaning of the query.
  3. Find all similarities between the semantic meaning of the query and documents in the semantic space created from analyzing the entire index.
  4. Sort and rank those results.

That’s a gross oversimplification, but the point is that this isn’t a scalable process.

This would be super useful for small collections of information. It was helpful for surfacing relevant reports inside a company’s computerized archive of technical documentation, for example.

The patent application illustrates how LSI works using a collection of nine documents. That’s what it was designed to do. LSI is primitive in terms of computerized information retrieval.

Latent Semantic Indexing As A Ranking Factor: Our Verdict

Latent Semantic Indexing (LSI): Is It A Google Ranking Factor?

While the underlying principles of eliminating noise by determining semantic relevance have surely informed developments in search ranking since LSA/LSI was patented, LSI itself has no useful application in SEO today.

It hasn’t been ruled out completely, but there is no evidence that Google has ever used LSI to rank results. And Google definitely isn’t using LSI or LSI keywords today to rank search results.

Those who recommend using LSI keywords are latching on to a concept they don’t quite understand in an effort to explain why the ways in which words are related (or not) is important in SEO.

Relevance and intent are foundational considerations in Google’s search ranking algorithm.

Those are two of the big questions they’re trying to solve for in surfacing the best answer for any query.

Synonymy and polysemy are still major challenges.

Semantics – that is, our understanding of the various meanings of words and how they’re related – is essential in producing more relevant search results.

But LSI has nothing to do with that.


Featured Image: Paulo Bobita/Search Engine Journal



Source link

www.searchenginejournal.com

Three Key Facebook Metrics to Understand Ad Performance

Three Key Facebook Metrics to Understand Ad Performance

My fellow digital marketers – before we talk about Facebook performance metrics, please complete this short survey. Question: Why do you create new Facebook ads?A. Out of pure habit.B. Our creative team never has enough work to do.C. Because ABT – “Always Be Testing”...

14 Strategies to Promote Your Business Through PPC

14 Strategies to Promote Your Business Through PPC

Are you getting low-quality traffic through your PPC campaigns?  Are fraud clicks draining your revenue from the PPC? Is your return on investment on PPC not as expected?  Even though PPC advertising is an integral part of an effective marketing strategy, poor tactics...

Use Customer Lifetime Value to Find More of Your Best Customers

With new privacy rules continually changing the landscape of third-party data, brands are increasingly becoming more focused on understanding their current customers in order to make more sophisticated marketing decisions. One approach to this is utilizing customer...

Tips for Optimizing a Localized PPC Account

Tips for Optimizing a Localized PPC Account

Before jumping into the components of a local PPC account and why it matters, we should first define what constitutes a local PPC account. The basic definition is that it targets customers within a specific region. The strategy for localized PPC specifically involves...

How Automation Hurts Rank, And How to Fix It

Imagine you are offered an opportunity to have control of all the creative, copy, and budget in your Google Ads account (or your paid media platform of choice) put in the hands of an anonymous six-year-old user. Each day, you are allowed to tell them whether they...

Content Marketing and PPC Advertising: Better Together

Content Marketing and PPC Advertising: Better Together

While some businesses invest solely in one type of advertising and marketing, like social media, others thrive by seamlessly combining multiple strategies, like content marketing and pay-per-click (PPC) advertising. Both of these methods can give a boost to your...

Pricing Plans

MediaQuad Membership Levels

Select one of the 8 plans below that best fits your needs.

FAQs

Why wouldn't I just hire a full-time marketing team?

Great question! Hiring a full-time marketing team can be costly, with salaries and benefits easily exceeding $500,000 per year. Plus, you may not always have enough work to keep them busy, leading to wasted resources.

With MediaQuad’s subscription model, you can scale up or down as needed, ensuring you’re only paying for the services you need.

 

Is there a limit to how many requests I can have?

Once subscribed, you’re able to add as many marketing and web development requests to your queue as you’d like, and they will be delivered one by one unless you are on the Enterprise plan.

How fast will I receive my marketing deliverables?

On average, most requests are completed in just a few days. However, more complex requests can take longer.

 

Who are the marketers and developers?

MediaQuad is a team of experienced marketing and web development professionals. You’ll be working directly with our team, ensuring consistent, high-quality results.

How do I pause my subscription?

We understand you may not have enough marketing and web development work to fill up every month. That’s where pausing your subscription comes in handy. You can pause and resume your subscription as often as you need to ensure you’re only paying when you have work available for that month.

What software do you use?

We use a variety of industry-standard tools and software. If you use it, we probably have or currently use it too. Seriously, this is what we do everyday.

How do I request marketing and web development services?

MediaQuad offers a ton of flexibility in how you request services. You can request directly via our platform, share Google docs or wireframes, or even record a brief video. If it can be linked to or shared in our platform, it’s fair game.

What if I don't like the deliverable?

No problem! We’ll continue to revise the deliverable until you’re 100% satisfied.

What if I only have a single request?

That’s fine. You paid for a month’s worth of work, so don’t throw it away. Remember to submit a pause email or pause task in Trello. We’ll note how many business days you have left in your month, and you can come back when you need more marketing or web development services.

Are there any refunds if I don't like the service?

Due to the high-quality nature of our work, we do not issue refunds. However, we’re committed to ensuring your satisfaction and will work with you to address any concerns.

Need to talk first?

Schedule a call

Learn more about how MediaQuad works and how we can serve you.