Is there an implementation of the Jaro-Winkler string comparison in SAS?
It looks like Link King has Jaro-Winkler, but I'd prefer the flexibility of calling the function myself.
Thanks!
Is there an implementation of the Jaro-Winkler string comparison in SAS?
It looks like Link King has Jaro-Winkler, but I'd prefer the flexibility of calling the function myself.
Thanks!
I modified and corrected cmjohns' code. Thanks to him/her for starting me off. Winkler published some examples in his paper Winkler, W. E. (2006). "Overview of Record Linkage and Current Research Directions". Research Report Series, RRS. (See table 6) I used the examples to test my code.
There is no built in function for jaro-winkler distance that I am aware of. @Itzy already reference the only ones that I know of. You can roll you own functions with
proc fcmp
though if you feel up to it. I'll even give you a head start with the code below. I just tried to follow the wikipedia article on it. It certainly isn't close to being a perfect representation of Bill Winkler's strcmp.c file by any means and likely has lots of bugs.I don't think so. It can do the Levenshtein distance (the
complev
function) or a generalized edit distance (compged
), but I haven't seen any other edit distance functions.If you're dead set on doing this in SAS you could write a program in
PROC IML
.