INCREV Query Match Tool - Applying Sentence-BERT Algorithm to analyze Topical Authority for onpage content and link relevance. DOI: 10.5281/zenodo.17571849 (David Vesterlund, IncRev.co)
Description
IncRev SEO Research team has developed QueryMatch, a tool that uses the SBERT Algorithm to measure similarity score between keywords and content. This has several applications within SEO Analysis and AI Search Analysis like: Topic clustering for Topical Authority, opitimize Topicality in articles for onpage SEO, Measure relevance for potential link building oppertunities, audit existing backlink relevance and many more.
In this article we study how Sentence-BERT (SBERT) sentence embeddings and cosine similarity can be operationalized for SEO tasks.
We present Query Match, an INCREV system and tool that scrapes target pages,
vectorizes content at paragraph granularity, and automatically rewrites low-similarity sections to surpass configurable thresholds.
We connect our approach to established distributional semantics (PMI factorization) and recent transformer-based encoders, and we report design heuristics for reliable deployment in multilingual settings.
We also show the high correlation (88-97% depenging on model) in vector spaces between SBERT and both Googles and Open AIs embedding algorithms as well as the linearity of correlation across the vector space. This high correlation indicates SBERT and INCREVs QueryMatch to be a good proxy for understanding hos search engines and AI engines understand and measure topicality and relevance between search queries and the competing subpages in the SERP.
Files
INCREV Query Match Tool - Applying Sentence-BERT Algorithm to analyze Topical Authority for onpage content and link relevance. DOI- 10.5281:zenodo.17571849 (David Vesterlund, IncRev.co).pdf
Files
(603.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:1ef419a63b48aa890dabdc7fab5ccb17
|
603.0 kB | Preview Download |
Additional details
Related works
- Is supplement to
- Preprint: 10.5281/zenodo.17360293 (DOI)
- Is supplemented by
- Preprint: 10.5281/zenodo.17570412 (DOI)
Dates
- Issued
-
2025-11-10
References
- Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. EMNLP. https://arxiv.org/abs/1908.10084
- Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL. https://arxiv.org/abs/1810.04805
- Cer, D., Yang, Y., Kong, S.-Y., et al. (2018). Universal Sentence Encoder. arXiv:1803.11175. https://arxiv.org/abs/1803.11175
- Vaswani, A., Shazeer, N., Parmar, N., et al. (2017). Attention Is All You Need. NeurIPS. https://arxiv.org/abs/1706.03762
- Levy, O., & Goldberg, Y. (2014). Neural word embedding as implicit matrix factorization. NeurIPS. https://proceedings.neurips.cc/paper_files/paper/2014/file/b78666971ceae55a8e87efb7cbfd9ad4-Paper.pdf
- Pennington, J., Socher, R., & Manning, C. (2014). GloVe: Global Vectors for Word Representation. EMNLP. https://aclanthology.org/D14-1162.pdf
- OpenAI (2024). Text Embedding Models v3 Documentation. https://platform.openai.com/docs/guides/embeddings
- Feng, F., Yang, Y., Cer, D., et al. (2020). LaBSE: Language-agnostic BERT Sentence Embedding. arXiv:2007.01852. https://arxiv.org/abs/2007.01852
- Gao, T., et al. (2023). MTEB: Massive Text Embedding Benchmark. arXiv:2304.04663. https://arxiv.org/abs/2304.04663
- Anderson, S. (2025). Strategic SEO 2025 about T*, Q* and topicality https://www.hobo-web.co.uk/wp-content/uploads/hobo-strategic-seo-2025.pdf
- Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. (PageRank). snap.stanford.edu
- Kleinberg, J. (1999). Hubs and Authorities. (HITS framework). cs.brown.edu
- Google. Search Quality Rater Guidelines (E-E-A-T, Needs Met). static.googleusercontent.com
- Google Developers (Cloud Search). Improve search quality—definition of topicality. Google for Developers
- Google / industry coverage. Topic Authority system for surfacing expert sources in news verticals. Search Engine Land+1
- Ahrefs.com article on What is Domain Rating (DR)? https://help.ahrefs.com/en/articles/1409408-what-is-domain-rating-dr
- INCREV® SEO Resources on Zenodo. https://zenodo.org/communities/increvseo/
- INCREV® SEO Research simplified article on the QueryMatch tool https://increv.co/academy/seo-research/querymatch
- INCREV® SEO Research simplified article on semantic vectorization for SEO analysis Https://increv.co/academy/seo-research/semantic-vectorization-seo
- INCREV® SEO Academy article on Google TrustRank https://increv.co/academy/google-trustrank/