What is LSI
(Page 1 of 3 )
What is Latent Semantic Indexing or LSI?
Latent Semantic Indexing or LSI has changed the world of search engine optimization. One fine morning, SEO experts found that most of their best ranking sites on Google were in jeopardy. Google has simply updated its crawler-program to accommodate LSI and has moved towards a more relevant rating list!
LSI is a methodology involving statistical probability and correlation that helps deducing the semantic distance between words. It's obviously a complex methodology but can be easily applied to understand the relation between certain words in a paragraph or in a document. This methodology is being used while indexing a page in the search engine's database.
Delving deeper, LSI is concerned not only with studying a document for keywords and listing it in the database, but also with studying a collection of documents and recognizing and identifying the words that are common between these documents. This way it can conclude on the semantic relation between the words being used in these documents. The process then finds out which other documents include or makes use of these semantically close words. The resultant documents are indexed to be related or closely relevant to a context, according to latent semantic indexing.
LSI regards the documents with certain proportion of words being used frequently to be semantically close. If there are fewer words common among documents, they are supposed to be semantically distant.
More Search Engine Tricks Articles
More By Jase Dow