Working on the search has caused me to mull on a particular problem:
Which searches will currently find "King's head"?
king Yes, ' is a non-word character which matches \b kings No king's Yes
I think ideally that we want the middle one to work also.
Then I thought of the epic "King's Cross St Pancras". How many ways to write that one out?
And I realised that this is not just a search issue but a linking issue as well. I recall the problem we had with "Regent's Park" and "Regents Park", which has been worked around with a redirect.
Also, I'm wondering about having a list of standard abbreviations somewhere, which gets applied in-line as part of the node_name_to_node_title munging:
ave => avenue ct => court gdns => gardens hse => house rd => road st => street st => saint ...oops!
This is the end of my braindump on this. I need input from others.
It will also be generating more test cases for the search :).
Ivor.