Search Algorithms
Searching...
No results found
Articles About

All Posts

5 articles

June 15, 2026 10 min read

Rocchio Relevance Feedback: Vector Space Query Expansion in Search

The Rocchio algorithm optimizes search queries in vector space by adjusting the query vector to shift toward relevant documents and away from non-relevant ones, resolving vocabulary mismatch.

June 15, 2026 9 min read

HITS (Hubs and Authorities): Topic-Scoped Authority in Search

Counting links picks the wrong page. HITS gives each page a hub and an authority score that define each other, computed per query, so authority is measured per topic.

June 14, 2026 7 min read

Locality-sensitive hashing: near-duplicate detection at scale with MinHash and SimHash

SimHash and MinHash compress high-dimensional document features into compact signatures that preserve similarity, letting search engines detect near-duplicates at web scale in sub-linear time.

June 14, 2026 6 min read

PrefixSpan and GSP: finding frequent paths in search sessions

GSP and PrefixSpan both find ordered behavior that appears often enough in a database of sequences. GSP generates candidate patterns and counts them; PrefixSpan projects the database to the suffixes after a prefix and grows patterns inside those smaller databases.

June 14, 2026 8 min read

Finding communities in a graph: Label Propagation, Louvain, and Leiden

Three algorithms split a graph into communities using only local moves, with no preset number of groups. Search systems use them: one in a web-ranking patent, another in language-model retrieval.

Search Algorithms

Worked explainers of search-adjacent algorithms, in an AI-forward way.

By <jr/> JR Oakes

Navigate

  • Home
  • Articles
  • About

Connect

  • jroakes.com
  • GitHub
  • LinkedIn
  • RSS Feed

© 2026 JR Oakes