[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: character converter (Re: current work)
From: |
Tom Tromey |
Subject: |
Re: character converter (Re: current work) |
Date: |
25 Jun 2001 10:18:10 -0600 |
>>>>> "Aaron" == Aaron M Renn <address@hidden> writes:
Aaron> We should probably ensure that there is a common mechanism
Aaron> going forward so that we don't end up with rework. I'm
Aaron> particularly interested to see how the framework will handle
Aaron> complex character sets.
I agree. This is a tricky area though, and the libgcj code is
undergoing some discussion. For instance we discovered that the
String(byte[]) constructor was creating too much garbage if you
converted a large number of strings. That's because it was creating a
new converter for each string. So now we're looking at a way to reuse
converters (the patch isn't in yet but probably will be soon).
Takashi> Well, I want to add other japanese encodings like EUC-JP
Takashi> ,Shift_JIS... Does anyone implement them? If not, I would try
Takashi> that.
libgcj has converters for a few Japanese encodings. If the character
set interface were unified then Classpath could simply use these.
Note that this isn't a trivial amount of work.
Tom