|  | Posted by boblotz2001 on 11/02/05 23:40 
For anybody that implemented fuzzy searches in SQL Server or any otherdatabase for that matter.  I have implemented edit distance algorithm,
 specifically q-grams, in SQL Server and wanted to get some opinions
 from anyone who used a similar algorithm in a DBMS.
 
 We are looking to implement fuzzy address search and are employing
 number of different techniques including synonyms for strings like
 "drive", "dr", "drv", etc...  Number expansion, so 6 ave = Six Ave =
 6th Ave = 6...  Now, we also have an implementation of edit distance
 algorithm, which would cover misspellings and similar sounding words.
 One of the challenges is putting various techniques together.  Synonyms
 and number expansion techniques go well together but no edit distance.
 
 
 Also, I am trying to figure out how to leverage edit distance
 implementation for 'contains' searches.  So if a string like
 'Mississippi Municipal Building' would be returned if the Search is for
 'Misisipi Municipal'.
 
 Any thoughts?
 [Back to original message] |