Match rating approach

Match rating approach

A phonetic algorithm developed by Western Airlines in 1977 for the indexation and comparison of homophonous names.

The algorithm itself has a simple set of encoding rules but a more lengthy set of comparison rules. The main mechanism being the similarity comparison which calculates the number of unmatched characters by comparing the strings from left to right and then from right to left and removing identical characters. This value is subtracted from 6 and then compared to a minimum threshold. The minimum threshold is defined by table A and is dependent upon the length of the strings.

The encoded name is known (perhaps incorrectly) as a personal numeric identifier (PNI). The PNI codex can never contain more than 6 alpha only characters.

Match rating approach performs well with names containing the letter "y" unlike the original flavour of the NYSIIS algorithm. For example, the surnames "Smith" and "Smyth" are successfully matched.

MRA does not perform well with encoded names that differ in length by more than 2.


Contents

Encoding rules

  1. Delete all vowels unless the vowel begins the word
  2. Remove the second consonant of any double consonants present
  3. Reduce codex to 6 letters by joining the first 3 and last 3 letters only


Comparison rules

In this section, the words "string(s)" and "name(s)" mean "encoded string(s)" and "encoded name(s)".

  1. If the length difference between the encoded strings is 3 or greater, then no similarity comparison is done.
  2. Obtain the minimum rating value by calculating the length sum of the encoded strings and using table A
  3. Process the encoded strings from left to right and remove any identical characters found from both strings respectively.
  4. Process the unmatched characters from right to left and remove any identical characters found from both names respectively.
  5. Subtract the number of unmatched characters from 6 in the longer string. This is the similarity rating.
  6. If the similarity rating equal to or greater than the minimum rating then the match is considered good.


Minimum threshold

The following table shows the mapping between the minimum rating and the string lengths.

Table A
Sum of Lengths Minimum Rating
≤ 4 5
4 < sum ≤ 7 4
7 < sum ≤ 11 3
= 12 2


Match rating approach examples

The table below displays the output of the match rating approach algorithm for some common homophonous names.

Name MRA Codex Minimum Rating Similarity Comparison Rating
Byrne BYRN 4 5
Boern BRN
Smith SMTH 3 5
Smyth SMYTH
Catherine CTHRN 3 4
Kathryn KTHRYN


External references


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Match Rating Approach — A phonetic algorithm developed by Western Airlines in 1977 for the indexation and comparison of homophonous names. The algorithm itself has a simple set of encoding rules but a more lengthy set of comparison rules.The main mechanism being the… …   Wikipedia

  • New Approach — Racing colours of Princess Haya of Jordan …   Wikipedia

  • Cricket Rating Systems — Cricket is a bat and ball sport that probably originated in England more than 300 years ago. It is a game that lends itself to statistical analysis and cricket fans have used these statistics to argue the merits of individual players and teams… …   Wikipedia

  • Metaphone — Lawrence Philips redirects here. For the football player, see Lawrence Phillips. Metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their English pronunciation. It fundamentally improves on the Soundex… …   Wikipedia

  • chess — chess1 /ches/, n. a game played by two persons, each with 16 pieces, on a chessboard. [1150 1200; ME < OF esches, pl. of eschec CHECK1] chess2 /ches/, n., pl. chess, chesses. one of the planks forming the roadway of a floating bridge. [1425 75;… …   Universalium

  • football — /foot bawl /, n. 1. a game in which two opposing teams of 11 players each defend goals at opposite ends of a field having goal posts at each end, with points being scored chiefly by carrying the ball across the opponent s goal line and by place… …   Universalium

  • Computer chess — 1990s Pressure sensory chess computer with LCD screen Chess+ For the iPad …   Wikipedia

  • Mikhail Botvinnik — Full name Mikhail Moiseyevich Botvinnik Country Soviet Union Born August 17, 1911( …   Wikipedia

  • Economic Affairs — ▪ 2006 Introduction In 2005 rising U.S. deficits, tight monetary policies, and higher oil prices triggered by hurricane damage in the Gulf of Mexico were moderating influences on the world economy and on U.S. stock markets, but some other… …   Universalium

  • Emanuel Lasker — Infobox chess player playername = Emanuel Lasker birthname = Emanuel Lasker country = GER datebirth = December 24, 1868 placebirth = Berlinchen, Prussia (now Barlinek, Poland) datedeath = January 11, 1941 (aged 72) placedeath = New York City,… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”