|
Posted by boblotz2001 on 11/02/05 23:40
For anybody that implemented fuzzy searches in SQL Server or any other
database for that matter. I have implemented edit distance algorithm,
specifically q-grams, in SQL Server and wanted to get some opinions
from anyone who used a similar algorithm in a DBMS.
We are looking to implement fuzzy address search and are employing
number of different techniques including synonyms for strings like
"drive", "dr", "drv", etc... Number expansion, so 6 ave = Six Ave =
6th Ave = 6... Now, we also have an implementation of edit distance
algorithm, which would cover misspellings and similar sounding words.
One of the challenges is putting various techniques together. Synonyms
and number expansion techniques go well together but no edit distance.
Also, I am trying to figure out how to leverage edit distance
implementation for 'contains' searches. So if a string like
'Mississippi Municipal Building' would be returned if the Search is for
'Misisipi Municipal'.
Any thoughts?
[Back to original message]
|