On Dec 13, 2004, at 1:10, Dominic Hargreaves wrote:
On Mon, Dec 13, 2004 at 01:02:24AM +0000, Tom Insam wrote:
CGI::Wiki (recent versions therof) isn't fully supportive of all charsets, but I don't care, because the only charset worth caring about is utf-8, and it does that.
I assume you are taking about mixing charsets in the same database? We *do* need to support different charsets in different databases - it is not okay to mandate utf8 everywhere at the moment, IMO.
It's ocurred to me that you might mean 'we have to allow latin-1 as well' here, which I agree with. but I think offering a choice of latin-1 or utf-8 is reasonable. Having said that, I think the code will support any charset you ask of it..
If the latter, we would presumably need to patch CGI::Wiki (I was expecting a handful of fixes needed there anyway, to tell the database what charset to use and so on).
0.62 looks like it's got most of my UTF-8 work - page names in Recent Changes are still handled badly, that's the only major bug I know of in there.
Patches against the current release ok?
That would be fantastic!
Right then. I'm working on this in my svn repository because I hate CVS. So far I have utf8 page contents and names, usernames, other preferences, and metadata, I think.
Resources:
svn repository - http://dev.jerakeen.org/svn/tomi/Projects/OpenGuides working copy - http://dev.jerakeen.org/og_utf8/wiki.cgi?Home (slow, that's just the colo) patch - http://dev.jerakeen.org/files/2004/12/13/utf8.diff
Known bugs:
Without my SVN CGI::Wiki, you'll have broken page names in the recent changes view. This is only for people trying this themselves - the copy linked above is fine. the diff view is broken: http://dev.jerakeen.org/og_utf8/wiki.cgi? id=Home&version=1&diffversion=2 - but then, it's broken in the real thing too. I haven't tried this on 5.6 - I can't get OG to build on 5.6 here. Anyone want to try it? I'm not expecting utf-8 under 5.6, I'm just expecting it to not break.
Anything else anyone can find, I'd love to know about.
tom