THE FILES HERE HAVE BEEN UPDATED BY A LATER POST.
Attached is an update to the Street Abbreviations and Secondary Unit translations. There are two files: Address Abbreviations (Long) and Address Abbreviations (Short). The Long version takes 'St' to 'Street' and the Short version flips that around, 'Street' becomes 'St'.
Address Abbreviations (Long).xlsx
Address Abbreviations (Short).xlsx
Directions are now considered which means NW (among others) can translate to Northwest.
The problem mentioned in the first post has been fixed, as well as several other issues. For instance "140 St. Mary's Street Ofc. 204" now translates in long form to "140 St Mary's Street Office 204".
You'll need to create two separate dictionaries, one for the Long form and one for the Short form. Open the spreadsheet and select all of the cells with data (including TRUE) and Copy (Ctrl + C) the entire block. Next go to IOM Configuration -> Dictionaries -> click Add -> click No. Right click in Replacement Values and choose Paste.
These are the titles and descriptions I used:
Address Abbreviations (Long)
Alter Arcane Address Abbreviations. Translate address abbreviations to their long form.
Address Abbreviations (Short)
Alter Arcane Address Abbreviations. Translate address abbreviations to their short form.
These are some mixed long/short examples used in testing:
123 West Hampton Drive Unit 1506
123 W Hampton Dr Unit 1506
123 W. Hampton Dr. Unit 1506
123 W. Hampton Dr. Apt. 1506
123 nw. St. Hampton Dr., Ofc #1506
123 NorthWest. St. Hampton Circle, Office 1506
123 sw St. Hampton Circle, Office 1506
123 St Main Street Po box 124
15800 Countrydrive Avenue blvd., apt. 1506
You will probably find some limitations with with this regular expression set. If you do and can fix it, post your solution back here!
Can one of you explain the (?
Also, for the directions I was using:
N from (?i)(n(\s|\.)|north(\s|\.))
N from (?i)(n|north$)
Does that make sense? That works for the NE in Newton as well since I only take ne if it's followed by a space or . or end of line.
NE from (?i)(ne(\s|\.)|northeast(\s|\.))
NE from (?i)(ne$|northeast$)
it looks like the editor took out my tags in the last one.
The first N has a space in front of it, and second N does not. Same with the NE.
That's what I thought, but I was confused until just now when I copy and pasted the lines from the file.
He is using (?
You're welcome and I am glad you have found the expressions useful!
Below is an update to the Street Abbreviations, Secondary Unit, and Street Directions translations. As before there are two files: Address Abbreviations (Long) and Address Abbreviations (Short). The Long version takes 'St' to 'Street' and the Short version flips that around, 'Street' becomes 'St'.
The directions regular expressions have been tuned to not affect letters other than followed by a space (N ), period (N.), or at the end of a line.
There were two changes made: the directions issue that Christine mentioned is now fixed and the 'Court' to 'Courts' problem is gone.
As before if you find a problem or anything that could be improved upon either let us know or post your solution here.