/[gentoo]/xml/htdocs/doc/en/utf-8.xml
Gentoo

Diff of /xml/htdocs/doc/en/utf-8.xml

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

Revision 1.33 Revision 1.41
1<?xml version='1.0' encoding="UTF-8"?> 1<?xml version='1.0' encoding="UTF-8"?>
2<!-- $Header: /var/cvsroot/gentoo/xml/htdocs/doc/en/utf-8.xml,v 1.33 2005/08/05 12:58:11 swift Exp $ --> 2<!-- $Header: /var/cvsroot/gentoo/xml/htdocs/doc/en/utf-8.xml,v 1.41 2006/07/15 17:22:49 fox2mike Exp $ -->
3<!DOCTYPE guide SYSTEM "/dtd/guide.dtd"> 3<!DOCTYPE guide SYSTEM "/dtd/guide.dtd">
4 4
5<guide link="/doc/en/utf-8.xml"> 5<guide link="/doc/en/utf-8.xml">
6<title>Using UTF-8 with Gentoo</title> 6<title>Using UTF-8 with Gentoo</title>
7 7
23 23
24<!-- The content of this document is licensed under the CC-BY-SA license --> 24<!-- The content of this document is licensed under the CC-BY-SA license -->
25<!-- See http://creativecommons.org/licenses/by-sa/2.5 --> 25<!-- See http://creativecommons.org/licenses/by-sa/2.5 -->
26<license /> 26<license />
27 27
28<version>2.13</version> 28<version>2.20</version>
29<date>2005-08-05</date> 29<date>2006-07-15</date>
30 30
31<chapter> 31<chapter>
32<title>Character Encodings</title> 32<title>Character Encodings</title>
33<section> 33<section>
34<title>What is a Character Encoding?</title> 34<title>What is a Character Encoding?</title>
339 your FAT filesystems or Joilet CD-ROMs.)</comment> 339 your FAT filesystems or Joilet CD-ROMs.)</comment>
340</pre> 340</pre>
341 341
342<p> 342<p>
343If you plan on mounting NTFS partitions, you may need to specify an <c>nls=</c> 343If you plan on mounting NTFS partitions, you may need to specify an <c>nls=</c>
344option with mount. If you plan on mounting FAT partitions, you many need to 344option with mount. If you plan on mounting FAT partitions, you may need to
345specify a <c>codepage=</c> option with mount. Optionally, you can also set a 345specify a <c>codepage=</c> option with mount. Optionally, you can also set a
346default codepage for FAT in the kernel configuration. Note that the 346default codepage for FAT in the kernel configuration. Note that the
347<c>codepage</c> option with mount will override the kernel settings. 347<c>codepage</c> option with mount will override the kernel settings.
348</p> 348</p>
349 349
534 534
535<p> 535<p>
536Terminal emulators that use Xft and support Unicode are harder to come by. 536Terminal emulators that use Xft and support Unicode are harder to come by.
537Aside from Konsole and gnome-terminal, the best options in Portage are 537Aside from Konsole and gnome-terminal, the best options in Portage are
538<c>x11-terms/rxvt-unicode</c>, <c>xfce-extra/terminal</c>, 538<c>x11-terms/rxvt-unicode</c>, <c>xfce-extra/terminal</c>,
539<c>gnustep-apps/terminal</c>, <c>x11-terms/mlterm</c>, <c>x11-terms/mrxvt</c> or 539<c>gnustep-apps/terminal</c>, <c>x11-terms/mlterm</c>, or plain
540plain <c>x11-terms/xterm</c> when built with the <c>unicode</c> USE flag and 540<c>x11-terms/xterm</c> when built with the <c>unicode</c> USE flag and invoked
541invoked as <c>uxterm</c>. <c>app-misc/screen</c> supports UTF-8 too, when 541as <c>uxterm</c>. <c>app-misc/screen</c> supports UTF-8 too, when invoked as
542invoked as <c>screen -u</c> or the following is put into the 542<c>screen -U</c> or the following is put into the <path>~/.screenrc</path>:
543<path>~/.screenrc</path>:
544</p> 543</p>
545 544
546<pre caption="~/.screenrc for UTF-8"> 545<pre caption="~/.screenrc for UTF-8">
547defutf8 on 546defutf8 on
548</pre> 547</pre>
577(set-default-coding-systems 'utf-8) 576(set-default-coding-systems 'utf-8)
578(set-terminal-coding-system 'utf-8) 577(set-terminal-coding-system 'utf-8)
579</pre> 578</pre>
580 579
581<p> 580<p>
582Nano currently does not provide support for UTF-8, although it has been planned 581Nano has provided full UTF-8 support since version 1.3.6.
583for a long time. With luck, this will change in future. At the time of writing,
584UTF-8 support is in Nano's CVS, and should be included in the next release.
585</p> 582</p>
586 583
587</body> 584</body>
588</section> 585</section>
589<section> 586<section>
611Irssi has complete UTF-8 support, although it does require a user 608Irssi has complete UTF-8 support, although it does require a user
612to set an option. 609to set an option.
613</p> 610</p>
614 611
615<pre caption="Enabling UTF-8 in Irssi"> 612<pre caption="Enabling UTF-8 in Irssi">
616/set term_type UTF-8 613/set term_charset UTF-8
617</pre> 614</pre>
618 615
619<p> 616<p>
620For channels where non-ASCII characters are often exchanged in non-UTF-8 617For channels where non-ASCII characters are often exchanged in non-UTF-8
621charsets, the <c>/recode</c> command may be used to convert the characters. 618charsets, the <c>/recode</c> command may be used to convert the characters.
628<title>Mutt</title> 625<title>Mutt</title>
629<body> 626<body>
630 627
631<p> 628<p>
632The Mutt mail user agent has very good Unicode support. To use UTF-8 with Mutt, 629The Mutt mail user agent has very good Unicode support. To use UTF-8 with Mutt,
633put the following in your <path>~/.muttrc</path>: 630you don't need to put anything in your configuration files. Mutt will work
634</p> 631under unicode enviroment without modification if all your configuration files
635 632(signature included) are UTF-8 encoded.
636<pre caption="~/.muttrc for UTF-8">
637set send_charset="utf8" <comment>(outgoing character set)</comment>
638set charset="utf8" <comment>(display character set)</comment>
639</pre> 633</p>
640 634
641<note> 635<note>
642You may still see '?' in mail you read with Mutt. This is a result of people 636You may still see '?' in mail you read with Mutt. This is a result of people
643using a mail client which does not indicate the used charset. You can't do much 637using a mail client which does not indicate the used charset. You can't do much
644about this than to ask them to configure their client correctly. 638about this than to ask them to configure their client correctly.
713set terminal.linux.utf_8_io = 1 707set terminal.linux.utf_8_io = 1
714 708
715<comment>(For links, edit ~/.links/links.cfg and add the following 709<comment>(For links, edit ~/.links/links.cfg and add the following
716line)</comment> 710line)</comment>
717terminal "xterm" 0 1 0 us-ascii utf-8 711terminal "xterm" 0 1 0 us-ascii utf-8
712</pre>
713
714</body>
715</section>
716<section>
717<title>Samba</title>
718<body>
719
720<p>
721Samba is a software suite which implements the SMB (Server Message Block)
722protocol for UNIX systems such as Macs, Linux and FreeBSD. The protocol
723is also sometimes referred to as the Common Internet File System (CIFS). Samba
724also includes the NetBOIS system - used for file sharing over windows networks.
725</p>
726
727<pre caption="Enabling UTF-8 for Samba">
728<comment>(Edit /etc/samba/smb.conf and add the following under the [global] section)</comment>
729dos charset = 1255
730unix charset = UTF-8
731display charset = UTF-8
718</pre> 732</pre>
719 733
720</body> 734</body>
721</section> 735</section>
722<section> 736<section>
835<title>Resources</title> 849<title>Resources</title>
836<body> 850<body>
837 851
838<ul> 852<ul>
839 <li> 853 <li>
840 <uri link="http://www.wikipedia.com/wiki/Unicode">The Wikipedia entry for 854 <uri link="http://en.wikipedia.org/wiki/Unicode">The Wikipedia entry for
841 Unicode</uri> 855 Unicode</uri>
842 </li> 856 </li>
843 <li> 857 <li>
844 <uri link="http://www.wikipedia.com/wiki/UTF-8">The Wikipedia entry for 858 <uri link="http://en.wikipedia.org/wiki/UTF-8">The Wikipedia entry for
845 UTF-8</uri> 859 UTF-8</uri>
846 </li> 860 </li>
847 <li><uri link="http://www.unicode.org">Unicode.org</uri></li> 861 <li><uri link="http://www.unicode.org">Unicode.org</uri></li>
848 <li><uri link="http://www.utf-8.com">UTF-8.com</uri></li> 862 <li><uri link="http://www.utf-8.com">UTF-8.com</uri></li>
849 <li><uri link="http://www.ietf.org/rfc/rfc3629.txt">RFC 3629</uri></li> 863 <li><uri link="http://www.ietf.org/rfc/rfc3629.txt">RFC 3629</uri></li>

Legend:
Removed from v.1.33  
changed lines
  Added in v.1.41

  ViewVC Help
Powered by ViewVC 1.1.20