info-cvs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Regarding check-in of unicode characters.


From: Rahul
Subject: Re: Regarding check-in of unicode characters.
Date: 24 Jun 2005 10:18:59 -0700
User-agent: G2/0.2

Hi Maninder -

You can use UTF-8 encoded files.

UTF-8 is a method of representing Unicode text with a stream of
8-bit bytes. The resulting stream is both ASCII-compatible and
reverse-ASCII-compatible. A single character can occupy from 1 to 4
bytes.

Most IDE and editors like emacs these days, will let you switch the
default charset encoding from ASCII to UTF-8.

More information on Unicode and UTF-8 is here -
http://www.unicode.org/faq/utf_bom.html#14

A UTF-8 encoded source file with text in UTF-8 can be
handled by CVS (on Linux etc)  without any problems. Most IDEs and
Editors supporting CVS can be configured for UTF-8 encoding by default
to get Unicode support in CVS to work seamlessly.


With regards to CVSNT, the big difference is CVSNT supports a -ku
keyword
flag to let you have the file in UTF-16 be automatically be converted
to
UTF-8.

Internally CVSNT also stores the files in UTF-8. Since most IDE,
editors and tools will do UTF-8 encoding, this may be good enough.

For command-line cvs usage, tools like iconv (on UNIX, Cygwin)
can be used to go from any format to any other.  So just use UTF-8
and you should be home. You could have a script do iconv before and
after any ci/co to make it seamless.

Rahul Bhargava,
CTO, WANdisco
Mountain View, CA
http://www.wandisco.com/cvs



Arthur Barrett wrote:
> Maninder,
>
> CVS does not support Unicode files, instead they must be committed as binary, 
> in which case you cannot do diff/merge etc.
>
> CVSNT on Mac/Linux/Unix/Windows (free, open source, GPL, just like CVS) does 
> support unicode files (-ku).
>
> Please supply more information (file type in repository, cvs version etc).
>
> To determine the "status" of the file use:
> cvs status myfile
>
> To determine your version of CVS use:
> cvs version
>
> If you are using CVSNT then the CVSNT newsgroup is the place that the 
> developers of CVSNT answer questions:
>
> news://news.cvsnt.org/support.cvsnt <news://news.cvsnt.org/support.cvsnt>
>
> or
>
> http://www.cvsnt.org/cgi-bin/mailman/listinfo/cvsnt 
> <http://www.cvsnt.org/cgi-bin/mailman/listinfo/cvsnt>
>
>
> Regards,
>
>
> Arthur Barrett
>
>
>       -----Original Message-----
>       From: address@hidden [mailto:address@hidden On Behalf Of Maninder 
> Singh(SDG)
>       Sent: Thursday, 16 June 2005 1:51 AM
>       To: address@hidden
>       Subject: Regarding check-in of unicode characters.
>
>
>
>       Hi All,
>
>                   We are trying to check-in the following characters into CVS.
>
>       But while doing so, the highlighted characters get corrupt and upon 
> checking out, these are replaced by some other characters.
>
>       Sí (means yes in Spanish)
>
>       Sì (means yes in Italian)
>
>
>
>       Can anybody provide a solution?
> 
>        
> 
>       Thanks and Regards,
> 
>       Maninder Singh



reply via email to

[Prev in Thread] Current Thread [Next in Thread]