Entity-oriented search: The evolution of information retrieval, explained - SEO<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Entity-oriented search: The evolution of information retrieval, explained - SEO\" \/>\n<meta property=\"og:description\" content=\"We rarely stop to think about the lightning speed of modern information access. Try picturing a time when answers lived only in libraries – it seems archaic now. Search tools have become so powerful that they grasp the meaning behind your questions, not just the individual words. This capability is the result of an evolution […]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/\" \/>\n<meta property=\"og:site_name\" content=\"SEO\" \/>\n<meta property=\"article:published_time\" content=\"2024-05-13T12:00:00+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Inverted-index-Useful-application.png\" \/>\n<meta name=\"author\" content=\"Fox News\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Fox News\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"15 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/\",\"url\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/\",\"name\":\"Entity-oriented search: The evolution of information retrieval, explained - SEO\",\"isPartOf\":{\"@id\":\"https:\/\/cherylroll.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Entity-oriented-search-The-evolution-of-information-retrieval-explained-800x450.png\",\"datePublished\":\"2024-05-13T12:00:00+00:00\",\"dateModified\":\"2024-05-13T12:00:00+00:00\",\"author\":{\"@id\":\"https:\/\/cherylroll.com\/#\/schema\/person\/e4ce5f7f0c831df260bde5b6992ae397\"},\"breadcrumb\":{\"@id\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#primaryimage\",\"url\":\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Entity-oriented-search-The-evolution-of-information-retrieval-explained-800x450.png\",\"contentUrl\":\"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Entity-oriented-search-The-evolution-of-information-retrieval-explained-800x450.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/cherylroll.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Entity-oriented search: The evolution of information retrieval, explained\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/cherylroll.com\/#website\",\"url\":\"https:\/\/cherylroll.com\/\",\"name\":\"SEO\",\"description\":\"News, Search Engine Optimization (SEO)\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/cherylroll.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/cherylroll.com\/#\/schema\/person\/e4ce5f7f0c831df260bde5b6992ae397\",\"name\":\"Fox News\",\"sameAs\":[\"http:\/\/cherylroll.com\"],\"url\":\"https:\/\/cherylroll.com\/author\/fox-news\/\"}]}<\/script>\n","yoast_head_json":{"title":"Entity-oriented search: The evolution of information retrieval, explained - SEO","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/","og_locale":"en_US","og_type":"article","og_title":"Entity-oriented search: The evolution of information retrieval, explained - SEO","og_description":"We rarely stop to think about the lightning speed of modern information access. Try picturing a time when answers lived only in libraries – it seems archaic now. Search tools have become so powerful that they grasp the meaning behind your questions, not just the individual words. This capability is the result of an evolution […]","og_url":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/","og_site_name":"SEO","article_published_time":"2024-05-13T12:00:00+00:00","og_image":[{"url":"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Inverted-index-Useful-application.png"}],"author":"Fox News","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Fox News","Est. reading time":"15 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/","url":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/","name":"Entity-oriented search: The evolution of information retrieval, explained - SEO","isPartOf":{"@id":"https:\/\/cherylroll.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#primaryimage"},"image":{"@id":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#primaryimage"},"thumbnailUrl":"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Entity-oriented-search-The-evolution-of-information-retrieval-explained-800x450.png","datePublished":"2024-05-13T12:00:00+00:00","dateModified":"2024-05-13T12:00:00+00:00","author":{"@id":"https:\/\/cherylroll.com\/#\/schema\/person\/e4ce5f7f0c831df260bde5b6992ae397"},"breadcrumb":{"@id":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#primaryimage","url":"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Entity-oriented-search-The-evolution-of-information-retrieval-explained-800x450.png","contentUrl":"https:\/\/searchengineland.com\/wp-content\/seloads\/2024\/05\/Entity-oriented-search-The-evolution-of-information-retrieval-explained-800x450.png"},{"@type":"BreadcrumbList","@id":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/cherylroll.com\/"},{"@type":"ListItem","position":2,"name":"Entity-oriented search: The evolution of information retrieval, explained"}]},{"@type":"WebSite","@id":"https:\/\/cherylroll.com\/#website","url":"https:\/\/cherylroll.com\/","name":"SEO","description":"News, Search Engine Optimization (SEO)","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/cherylroll.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/cherylroll.com\/#\/schema\/person\/e4ce5f7f0c831df260bde5b6992ae397","name":"Fox News","sameAs":["http:\/\/cherylroll.com"],"url":"https:\/\/cherylroll.com\/author\/fox-news\/"}]}},"_links":{"self":[{"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/posts\/337","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/comments?post=337"}],"version-history":[{"count":0,"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/posts\/337\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/posts\/337"}],"wp:attachment":[{"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/media?parent=337"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/categories?post=337"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cherylroll.com\/wp-json\/wp\/v2\/tags?post=337"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}

{"id":337,"date":"2024-05-13T12:00:00","date_gmt":"2024-05-13T12:00:00","guid":{"rendered":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/"},"modified":"2024-05-13T12:00:00","modified_gmt":"2024-05-13T12:00:00","slug":"entity-oriented-search-the-evolution-of-information-retrieval-explained-440395","status":"publish","type":"post","link":"https:\/\/cherylroll.com\/entity-oriented-search-the-evolution-of-information-retrieval-explained-440395\/","title":{"rendered":"Entity-oriented search: The evolution of information retrieval, explained"},"content":{"rendered":"

We rarely stop to think about the lightning speed of modern information access. Try picturing a time when answers lived only in libraries – it seems archaic now. <\/p>\n

Search tools have become so powerful that they grasp the meaning behind your questions, not just the individual words. This capability is the result of an evolution from keyword to entity-oriented search. While it may seem complex, today we are going to break it down.<\/p>\n

Think of a simplified world where websites are replaced by books, and answers are found by a team of 1 million dedicated workers. This analogy will help us understand the systems powering entity search, giving you a newfound appreciation for the speed and accuracy we enjoy today.<\/p>\n

Through this exercise, you’ll understand:<\/p>\n

Why search engines started using entities<\/strong>: What problems did they solve?<\/li>\n
The inner workings of a knowledge graph<\/strong>: How does a search engine populate and use information from the knowledge graph? How can this augment your search results?<\/li>\n
How can topical authority further augment returned results? <\/strong><\/li>\n
Practical SEO strategies<\/strong>: How to optimize your content for this new landscape.<\/li>\n<\/ul>\n
Let’s build an entity-based search engine: Your library<\/h2>\n
Imagine you are responsible for a vast library with thousands of books and access to a million diligent workers. Unlike in a normal library, customers want answers to their questions and are not looking for books to read from front to back. <\/p>\n
Customers constantly approach with questions (queries), eager for answers. Your mission is to find the information they need as quickly as possible. <\/p>\n
For your library to be successful, you’ll need to return better answers that save customers time than other libraries. <\/p>\n
Version 1 of your library: Returning based on titles<\/h3>\n
Let’s imagine someone asks, “how fast is the fastest animal”?<\/p>\n
If you were a traditional library you’d begin by scanning titles, hoping for a similarity match. The customer would likely receive a stack of books and it would be their job to read through the books and try to find the answer. <\/p>\n
This process may take hours. Not to mention, there could be better books that just don’t get returned because their titles are too unrelated. <\/p>\n
Introducing the inverted index<\/h3>\n
You decide this process is too slow and that this might be a task for your workforce. To accelerate things, you enlist your million-strong workforce to create a comprehensive index. <\/p>\n
Instead of focusing on whole books or titles like your original index, they catalog each individual page. Each worker meticulously records every word on a page, along with its location.<\/p>\n
The result is what is called an inverted index. The structure looks like this: <\/p>\n
$\"Inverted$ $\"Inverted$ <\/figure>\n
Now, when a customer asks, “What is the fastest animal?” your team consults the index, pinpoints “fastest” and “animal,” delivering a list of relevant pages and any page that is in both lists. <\/p>\n
This mirrors a traditional search engine – we’re finding keywords, but we do not yet understand the deeper meanings. <\/p>\n
Now, the customer is getting a list of hundreds to thousands of pages that may contain the answer. This saves the customer much time as they can jump to relevant pages to hopefully find their answer. <\/p>\n
Isolating entities: Beyond keywords<\/h3>\n
Our inverted indexes were a major leap forward, saving time for both your team and customers. <\/p>\n
Word of your improved system spreads, and soon, patrons are lining up at the door. <\/p>\n
However, complaints start to arise about irrelevant results and factual errors. Striving for excellence, we recognize the need to address these concerns.<\/p>\n
Issues<\/strong><\/p>\n
A word like “apple” leads to an overwhelming response – recipes, science, you name it, are all returned. How can we address this?<\/p>\n
This is a tricky problem, and we will need to train your workforce on a few different approaches. <\/p>\n
The first approach that might make sense is to train the workforce to grasp context <\/strong>to distinguish (disambiguate) between multiple meanings of a word. For example, if “Apple” is followed by “computer” or “iPhone,” it signifies a different entity than when it’s near “pie” or “tree.” <\/p>\n
While using contextual clues is a powerful approach, it’s deceptively difficult. Your workforce needs to learn how to identify the subtle cues that reveal an entity’s true meaning within the surrounding text. This is challenging, requiring a nuanced understanding of language and subject matter expertise that machines may take years to replicate.<\/p>\n
To effectively employ context in distinguishing word meanings, we must first construct a robust foundation that empowers our workforce to reorganize the index.<\/p>\n
Here are the three steps we will achieve and discuss below: <\/p>\n
\n
The librarian’s guidebook:<\/strong> We need a clear system to help your workers understand context. They must be able to identify different meanings of the same word and file books accordingly by looking at the surrounding words. This means we need a detailed catalog of which surrounding words suggest which entities. To achieve this, we will need to start writing down surrounding words and the entities we think are associated, then compare this to the knowledge graph we build next. <\/li>\n
Charting the collection: <\/strong>A visual map of these entities and their relationships will be invaluable. Your workers will use this chart to make connections, improving the quality of the books they suggest to patrons. By identifying an entity and traversing its attributes, we can use this information later to augment our whole process. <\/li>\n
Reorganizing the shelves:<\/strong> Lastly, once we have a knowledge graph, a detailed map of which surrounding words give clues to an entity’s identity, we will need to revamp your library and index. Instead of only relying on traditional terms, we’ll group books by “entities” – the key people, places, things and ideas they discuss.<\/li>\n<\/ul>\n
Step 1: Building the guidebook<\/h2>\n
Your workforce will be trained on the following three steps to help build clues as to which entity is used in the text: <\/p>\n
\n
Surrounding words:<\/strong> Just as search engines analyze nearby words, your workforce will look at the sentences around “apple.” Is it similar to words like “pie,” “baking,” or “recipe”? This suggests the culinary apple.<\/li>\n
Book genre:<\/strong> The book’s overall category offers powerful clues. If it’s a history textbook, “apple” might refer to a historical figure (like Isaac Newton and his apple-inspired discovery). In a science fiction novel, it could even be a futuristic planet!<\/li>\n
Sentence structure:<\/strong> The workforce will learn to pay attention to how “apple” is used. Is it a noun (“The apple fell.”) or an adjective (“Her cheeks were apple-red.”)? This helps them distinguish between the fruit and other meanings.<\/li>\n<\/ul>\n
Over time, these observations form the foundation of your guidebook. It could include:<\/p>\n
\n
A list of words with multiple meanings, like “apple.”<\/li>\n
Common phrases and contexts that signal a specific meaning (e.g., “apple pie” = food).<\/li>\n
Links to subject-specific dictionaries for in-depth research.<\/li>\n<\/ul>\n
Just like search engines, this system isn’t perfect. The workforce will still encounter ambiguity, but the guidebook dramatically increases their ability to identify the correct entity based on context. <\/p>\n
This guidebook can then be used to identify new entities and link existing text to pre-existing entities (called entity-linking). <\/p>\n
Step 2: Creating a knowledge base (hint: we won’t build this from scratch) <\/em><\/h2>\n
Embracing existing knowledge<\/h3>\n
Building a comprehensive knowledge base from scratch would be a mammoth task. Fortunately, resources like encyclopedias provide a valuable foundation. <\/p>\n
Just like Google, we can leverage existing knowledge sources like DBpedia. DBpedia offers well-structured categories and attributes (think of these as specialized tags), giving us a head start in organizing your library’s knowledge.<\/p>\n
A key decision to make about your knowledge graph is what are the ontologies. We will try to develop ontologies that correspond to the types of queries we see coming into your library. <\/p>\n
$\"Ontologies$ $\"Ontologies$ <\/figure>\n
Entity linking: The art of connection<\/h3>\n
Next, your tireless workers must transform raw, unstructured information, such as the words on a page into linked knowledge. They’ll re-analyze the library’s books and incoming content, using contextual clues to identify and connect entities to DBpedia’s structure.<\/p>\n
Example<\/strong>:<\/em> Let’s say a page describes a cheetah’s incredible running speed. Your workers might: <\/p>\n
\n
Recognize “cheetah” as an entity of type “animal.”<\/li>\n
Link it to DBpedia’s cheetah entry, enriching it with its scientific name, habitat information, etc.<\/li>\n
Create a “top speed” attribute, assigning the value found on the page.<\/li>\n<\/ul>\n
Let’s quickly go through an example of the entity linking process: <\/p>\n
$\"Entity$ $\"Entity$ <\/figure>\n
Step 3: The knowledge graph takes shape<\/h2>\n
Each entity and relationship your team identifies becomes a node and edge in your growing knowledge graph – a visual map of connected information! <\/p>\n
This structured format allows us to move beyond simple keyword matching and truly understand the meaning behind text. With the knowledge graph, we can augment our index with entities, not just terms. <\/p>\n
Unlike plain text, entities have rich attributes associated with them. This deeper understanding will empower us to analyze unstructured text more effectively, interpret user queries more accurately, and provide highly relevant answers.<\/p>\n
Get the daily newsletter search marketers rely on.<\/p>\n
\t\t\t\t\t\t\tBusiness email address<\/label><\/p>\n
\t\t\t\t\t\t\t