One Hundred Hellos - Blog

CJK - I18N - 22 June 2006

Get some chinese text - google china
pick a language code (zh)
zh-tw Chinese (Taiwan)
zh-cn Chinese (PRC)
zh-hk Chinese (Hong Kong SAR)
zh-sg Chinese (Singapore)

Use google language tools : Chinese -> 中国 -> China
tranlate big resource file:
for i in ./heads/aviso-core/conf/internationalization/action.xml ./heads/aviso-core/conf/internationalization/bluebox.xml ./heads/aviso-core/conf/internationalization/calendar.xml ./heads/aviso-core/conf/internationalization/db.xml ./heads/aviso-core/conf/internationalization/dialog.xml ./heads/aviso-core/conf/internationalization/error.xml ./heads/aviso-core/conf/internationalization/fsm.xml ./heads/aviso-core/conf/internationalization/menu.xml ./heads/aviso-core/conf/internationalization/message.xml ./heads/aviso-core/conf/internationalization/misc.xml ./heads/aviso-core/conf/internationalization/search.xml ./heads/btna/conf/internationalization/action.xml ./heads/btna/conf/internationalization/db.xml ./heads/btna/conf/internationalization/error.xml ./heads/btna/conf/internationalization/ncrcodes.xml ./heads/btna/conf/internationalization/report.xml ./heads/bwsc/conf/internationalization/db.xml ./heads/bwsc-linden/conf/internationalization/db.xml ./heads/cascades/conf/internationalization/menu.xml ./heads/cascades/conf/internationalization/db.xml ./heads/cascades/conf/internationalization/message.xml ./heads/cascades/conf/internationalization/misc.xml ./heads/cascades/conf/internationalization/search.xml ./heads/r142/conf/internationalization/db.xml ./heads/r160/conf/internationalization/db.xml ./heads/unique/conf/internationalization/db.xml ./heads/viau/conf/internationalization/bluebox.xml ./heads/viau/conf/internationalization/fsm.xml; do echo ""; grep -v "" $i; echo ""; done

Use unique database/head on branch_2006-06-20_daniel_utf8_devel

either
-translate ./heads/aviso-core/conf/internationalization by adding : 中国 suffix/prefix whith a style sheet. i18n-zh.xsl
-modify localization.xsl to produce zh, unicode escaped properties files...

Apply a global servlet filter to change default encoding type...

Change database encoding to utf-8 (mysql4) try users table first.

--------------------------------------------------------
-- What I've learned: from static page experiments
The browser detection seems to hinge on the jsp directive:
<%@ page language="java" contentType="text/html; charset=UTF-8" %>

which affects the http Content-Type header:
Content-Type: text/html;charset=UTF-8
whereas the default is
Content-Type: text/html;charset=ISO-8859-1

The html/head/meta/@content doesn't seem to affect the browser charset selection:
<meta content="text/html; charset=UTF-8" http-equiv="content-type">
when the content type is set in the http response header, which is always the case with jsp pages.
You can observe the headers with:
wget -S -O - http://boole:8080/unique/china-UTF-8.jsp