Piling up workarounds doesn't improve things and sooner or later leads to a re-write.
So why don't you do the rewrite at this very moment and save yourself some time and extra work?
The set of word characters is probably correct, because ctrl-left, ctrl-right seem to work correctly or at least they are adequate.
And I think these use the current locale set on the editor or it might be using UTF8 internally. I'm not sure.
But anyway scintilla should handle encodings well, because this is one of its main tasks.
p.s. Also have you measured performance? All these calls your making aren't cheap at all!