ࡱ> 8<67;:9nU_dVaPNG  IHDR`-9bKGD#2 cmPPJCmp0712OmuIDAT(cps=Q@|6*~9 -~s7>.9?8ad%c@ xp~ -@]RCiG#s [8SIENDB`nP rR1PNG  IHDR`-9bKGD#2 cmPPJCmp0712OmwIDAT(ccŁo @?H@e@D >1sK%pϜH ~~>?d@?Hp ŧrKqw`IENDB`n&@oѐ PNG  IHDRWorbKGD#2 cmPPJCmp0712`IDAT8OՔ!o@_7ÒK@B2~VR:dm!(pmKDH&XR!;;Jn]TDy{V '9g'q*FߌQ+KȞ4DG~YqdS a76u@Z1kos&iRBo늿Oh)GXazFPћtXQi&wτaH2"dTyQ u ?\qV.-!mMNKb9Bq ƫA@yv/(H5Nu}n04XCB *r'9MqLS>ٵ ɵ[[v&;˖꽔l* o=vVT/wFHIENDB`nooZyqx4CPNG  IHDR`-9bKGD#2 cmPPJCmp0712OmoIDAT(c09999Ȝ(py s83g0ϖ0c{s?<ҟ`<PvqA2 /IENDB`n|_ J_ل",yPNG  IHDRLBGbKGD#2 cmPPJCmp0712Om~IDATc穳2Ճc# g IENDB`n&EZTjJoA"PNG  IHDR`-9bKGD#2 cmPPJCmp0712OmIDAT(c!2\=g~$=qُ<`y ???$x?Ϟ<gπ9o*>{xN}3'AgH>wyFge~yϋۀM{ԭN[;eebq; h㊸ȎYA%RTqK$%"t.xUK Pd[>m6>31}QVz6=>ɫ=L^~a۹,кؐp <O2db1Čpg̭~Ch1c37|xyY.JX;+fz:#Qٝ4,,>h{)fM=3r7)IENDB`n tT1dJKcPNG  IHDR`-9bKGD#2 cmPPJCmp0712OmIDAT(ϥϽ 0`3 d#ґtl@(qE P*`dd񛗳 Rƕ{Yp ow6{O-X){z_,p/. -c({*i%4pp(XIR.pm6 g K_IENDB`nϟʬ=xPNG  IHDR`-9bKGD#2 cmPPJCmp0712OmIDAT(c8#q~8M o߽pdAe}sP>5? *,s H߂z ܂r@2VA8b@9m _5@Ko.98B 1~_IENDB`nF5E8nTCPNG  IHDR`$ApPLTEf3̙f3f3ffffff3f3333f333f3f3̙f3̙̙̙̙f̙3̙ffffff3f3333f333f3̙f3̙̙f3̙f3ff̙ffff3f33̙33f333̙f3ffffff3ffff̙fff3fffffff3ffffffffffff3fff3f3f3f3ff33f3ffffff3f3333f333333̙3f3333333f3333f3f3f3ff3f33f33333333f333333333f333f3̙f3f3ffffff3f3333f333f3wUD"wUD"wUD"ݻwwwUUUDDD"""bKGDH cmPPJCmp0712HsIDATHK0DNyghvC612̅8f !kk*.ZФ"qJzPcCA7a7i,{Q9HXxQ)`+7AJ mBp0@#Y?ija5!`R:[Pk C-]"hԋe6gYp(ASn^cQgzwcKJ禚"88I@Y`T{|C` P!휆,%tEXtCommentclip2gif 0.7.2 by Yves Piguet@IENDB`nd쮀nIc/PNG  IHDR1bKGD#2 cmPPJCmp0712`IDAT8OS!{@ }j̵nr\ek琫?9e: [rxr0ӘpTuo<Ǖ2}r7Z(uMCrtB18COϗ(PÕ8,5vK̃4SZC8A00$QH<_}ŗ5"C@kC>e@=VQr΄z@o<ã' t(;݀S8Y&IA³#Kl0`ev 7V842(@IW]w=^>x)/~M@\ePd6([xJf3QNrܢs]JԎ`>wBoq(^rnԡr+62hxL)G$q>o6IENDB`n!/s9U E'PNG  IHDR`-9bKGD#2 cmPPJCmp0712OmIDAT(c88?8?ۑ9s~T' ?,T0>``CQĿb2 g~W<]|9a9m~9/gý=o~w=3?<qaq@B? |2>5| X*$IENDB`n.QG)J݌}PNG  IHDR`-9bKGD#2 cmPPJCmp0712OmkIDAT(cpsAF I0y9'<9ʱ9?&☥͙}`y55 j[؃9DztsmVIENDB`n0$G'KD8aPNG  IHDR:5bKGD#2 cmPPJCmp0712OmIDAT=ϱ 0PK)`a`!D"$6\+b|@0eDT91%Z hR2^Gd# J \+eO#y:.ؕ:爓VkaY6~0d 6_5؍IENDB`n$gek;'/U˜(=PNG  IHDRBdzbKGD#2 cmPPJCmp0712OmIDAT(ύA @ Ј+zq9Eo3нE273S+Bz|rU)b?ݨ# ,GGG_eB3s'<4LlGլ TY=7v۲UoPIENDB`n?,+TuB&PNG  IHDR? "bKGD#2 cmPPJCmp0712OmOIDATc,6̇261?ϟs~ c10Cl'TG0孔IENDB`n5ߡn[%vtqH( PNG  IHDR`-9bKGD#2 cmPPJCmp0712OmIDAT(]б 0Pwi#2G*E@b E 8[1םXD23h&K]O20R`0?GL 712#27p 8Am4D!5:oGIENDB`nb Xvl<^7riPNG  IHDRo[^bKGD#2 cmPPJCmp0712OmIDAT(U1NAG(5L$܀cxca) XdwqI(F0Fgݘlf=gF1/ޟ/m>'nOmf L0u<.l:FAwrCM }/JJhh9d}I;:W.)y0J%<^rWdyH5E<&9ٰC_ZtẏѤ'nx:Z%ݻ'}^ἄIENDB`n4X̤7DRPNG  IHDR`-9bKGD#2 cmPPJCmp0712OmaIDAT(cp>8Ώ u21L?3ci`s0#,Ɓ ?G0Fppzۄ82E1DIENDB`nKˁ#|H'PNG  IHDR8 bKGD#2 cmPPJCmp0712OmIDAT(υ=n0urTə Q\p 42t*]ѕ:oS[ٓ"]q^h$@> bN&2[[goOqfQ% QG](w/nׯLR$YV6#mf%:Pe"ԫH0T\{D𸯫 ȿ}Q_'L\:ArIENDB`n[bW$PNG  IHDRW bKGD#2 cmPPJCmp0712Om>IDATc HfZߪ 0@@C#@*~ia~Z}IENDB`n@e);2-I%9PNG  IHDRD=bKGD#2 cmPPJCmp0712OmIDAT(MνPWBbͨ^Am$'ј\6ҡހ`j{C+oJ+7<FLɡEvtԶ(lxUc gCJve?"ˮH.[k&frqv곊xX4+}z<8@G).D&5Oi>+LdzzfSRƿ1,IENDB`nL.$|YTPNG  IHDR`rJIbKGD:2> cmPPJCmp0712OmIDAT8˕0^-K_RJ^e),}aJ^eBf.zg&@̀\ZB` %'z 4k[֖t?`0?,4H u`>jV|JtgXG#}y4@#ZBqE%~44}AZL'"xt!cg9%hRpশ:@Jzl 9>=QS(IENDB`nN'3>isT+PNG  IHDR`-9bKGD#2 cmPPJCmp0712OmrIDAT(cp>?@G#qG؟ojAbKGDH cmPPJCmp0712HsIDATXGYۂ Vj=I­ஊ]^j/a\k`; {}ʬ7rsf'QH)d^x.s=pM߾‡F;9Q+? so'̉D= RY.žG1RL߈Q'X|nta>Z"viD kJb ߝ8qY}WoAh:78@y:baBXZ>m6 }3s4C\YdE% &| e%ɛg'sI*cA/Ie_I? Q([~DC%|"NHpg϶tNp1=NN7q]Kqhj>PbbbkC[x"bX7)0}x(b@oq!ɀ(y+cČ,pju]->'1s!Cq | ė;9C)!VЛ}€Dvz zU3!|mn /ό9G 7YB {*|/JH"Yjs J-G e: jMFtKս;@3xH}%P(q'aNoD"sjXI|Y`9;t'QQ-c<0Yj¯h{XE9vGϋJ;J-e{1u,ǰRWe̼=hJ WS@=߬`P'l/հ6|-?&X]F.9ݦ0061dGޛB/3CHN_&S6Dnrخ};LOs!5^.y-뱨9ۍ7O:"QݯacIENDB`nX s_F6!PNG  IHDRe˻}bKGD#2 cmPPJCmp0712OmIDAT(U=JAA Mā9WbjV)m`1EtbFΗ0߄hW/ ا{}͟ހNKmeHت#V '(˰nb럲µB4WzNYV\(B]XԊuVܯpաV4U}' ̯M`6;E3`r\VvZFeC?-RJ.S|?,)IENDB`nߵB ]g[ PNG  IHDR` HbKGD#2 cmPPJCmp0712`IDATHKc@ WuX'g`߁ š+Y5Z J ^Pb~n,iLw`r`#k@.pڗ"O?cMjy& )bĕ^?ʽJXgbxSt) 71͠VNYnBq>n~~*pÄ2\ _BSJs#82ę2xDqJ10|Rq+-e-Ʊ*#'Ûv0SZ\>_`ӄ{JBc9&0u9qznf`A y3~cg`6j3(ʐl陗E_H2r.!(Ϯ $[4Oũ}ҤA񴌆xm@f f`ce`x^ӗ7 ,S,@P= r7(^_$G2EA,U+%biK@o 8]}侷'λ s'<=LQRw>0O}9pmxHE [v2Pg%Ü{\a+&@G]aX033ߢXpE`}Llâǟͻw (ؿ)P~8>rDD@9.CIENDB`nBM{'30/5 E FEиj.ʺdSZTҍ^ެܻ&)KBr!j1&>v$<"/1C 3 ~3LaD0S[V%[9{v/!,DZ(>艬UUSY"o~,e~iҮPZϢ_|C =1xD0jCr|yyIo%?i|Q`l۷ok@<֚%,)Sѭ<5Ѐ8fRsYY_~l3g&O=t@jUK)zԖ^x}IIi1{(0 9 b x .http://www.unicode.org/.http://www.unicode.org/8http://www.unicode.org/cldr/8http://www.unicode.org/cldr/E8http://www.unicode.org/cldr/8http://www.unicode.org/cldr/>http://unicode.org/reports/tr35>http://unicode.org/reports/tr35@http://unicode.org/reports/tr10 >http://unicode.org/reports/tr10>http://unicode.org/reports/tr35>http://unicode.org/reports/tr35Thttp://www.unicode.org/cgi-bin/cldr-surveyThttp://www.unicode.org/cgi-bin/cldr-survey/ 0DArial Gotq0B 0"DTimes New Roman0B 0 DVerdanaw Roman0B 0"0DWingdingsRoman0B 0@DTimesngsRoman0B 0 PDMonotype Sans Duospace WT JB 01`DLetter GothicDuospace WT JB 02F 0.  @n?" dd@  @@``  7@ / , %)+-16 8:;IJLOPQRSTVW]cdfklw"x{T$$b$U_dVab$P rR1b$@oѐ .b$ooZyqx4C b$|_ J_ل",yb$EZTjJoA".b$2-0.b$5 ?ób$ tT1dJKc b$ϟʬ=x b$F5E8nTC b$d쮀nIc/_b$/s9U E')fb$.QG)J݌}b$0$G'KD8arb$$gek;'/U˜(=b$?,+TuB&b$5ߡn[%vtqH( Sb$u.[C(@[isT+)!b$7)XG"b$ s_F6!`Z)b$ߵB ]g[ * 0e0e     A@  A5% 8c8c     ?1 d0u0@Ty2 NP'p<'pA)BCD|E||s " 0e@        @ABC DEEFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `abN E5%  N E5%  N F   5%    !"?N@ABC DEFFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `ab MMM33ff̙@8$5}Y ʚ;?2ʚ;g4DdDd B 0ppp@ <4ddddLpC 0q<4!d!dLpC 0q g4<d<d B 0Uph p<4BdBdLpC 0qc 2___PPT10 8\   c[]( $ ___PPT98   $c2[4   ]X\? %O =TC CLDR: The Common Locale Data Repository Locales for the World6D',,3Lisa Moore George Rhoten Mark Davis Steven Loomis$ P)0P48*Agenda:Why CLDR? CLDR data Tools and vetting Today and the futuregAgenda:Why CLDR? CLDR data Tools and vetting Today and the future, 1 1`1>Locales  is anything the same?%Locales specify user preferences Linguistic and cultural differences Languages, scripts, writing systems, ordering, directionality, formatting, numbers, sizes Interoperability issues across platforms Global economics has increased the need for greater globalization support in computer systemsBEZ$EZ$Add the Universal Character Encoding%%(1Unicode: Unique character codes for all languagesa4The Need for Common Locale Data (6Computing environments often contain a variety of operating systems and software. Historically locale sensitive data research has been done by individuals and/or companies. Because of political changes, it is easy for locale data to become out of date. It is difficult to get complete agreement on correctness.7Z74'Common Locale Data Project<Began as Common XML Locale Repository (CXLR) developed by OpenI18N in 2003 CLDR project began in 2004 Hosted by Unicode Consortium http://www.unicode.org/cldr/ Goals: Common, necessary software locale data for all world languages Collect and maintain locale data XML format for effective interchange Freely availabledZ ZZ ZEd 0d8CLDR in use (partial list)wLibraries and Environments ICU JDK Operating Systems Solaris AIX MacOS X Applications OpenOffice.org Acrobat ModernBillZ ZZ Z Z" Z     $$ ((,,>A dAgenda0Why CLDR? CLDR data Tools and vetting The futureF   5%What is a Locale?A locale is an identifier referring to linguistic and cultural preferences en_US, en_GB, ja_JP These preferences can change over time due to cultural and political reasons Introduction of new currencies, like the Euro Standard sorting of Spanish changes Many of these preferences have varying degrees of standardization 12 and 24 hour format in the United States This is a very broad topic Scope of data limited to common system applicationsKP PMPR PBP+ POPKMRB+O>K\3$Types of Locale DataDates/time/calendar formats Number/currency formats Measurement system Collation specification Sorting Searching Matching Translated names for language, territory, script, timezones, currencies,& Script and characters used by a languageZ_" ds_s 8cLocale Data Markup LanguageLocale data described using XML CLDR data uses LDML Structure of CLDR controlled by Locale Data Markup Language (LDML) specification http://unicode.org/reports/tr35 Xd $d 0pALDML Data Categories $P>MsDNames Provides translated display names for languages, territories, scripts, variants and keywords used in CLDR. Most of this information is at the language level, since it typically does not vary by territory, only language.*ea'Languages, Scripts, Territories Example(((From ga.xml (Irish): <localeDisplayNames> <languages> <language type="aa">Afar</language> <language type="ab">Abcisis</language>& <scripts> <script type="Arab">Araibis</script>& <territories> <territory type="AD">Andra </territory> <territory type="AE">Aontas na nimrochta Arabacha </territory>&  F:#03&!l03& xI Characters Allows for creation of exemplar character sets. An exemplar set specifies the set of characters that must be present in order to properly render the language. Auxiliary exemplar set defines additional characters that may appear in foreign words or phrases. Lower case onlyF   `~O Date Formatsv Defines representation of calendars using various calendaring systems (Gregorian, Buddhist, Islamic, Japanese, etc.) Defines formatting for dates, times, eras and time zones wide, abbreviated, or narrow Date and time formats use patterns of letters to define proper formatting Week information Relative day/time translations (for example, yesterday, tomorrow, etc. )dPPgPZP gZbCharacters / Dates ExamplesFrom ga.xml (Irish): <characters> <exemplarCharacters> [a b-e f-i j-o p-u v-z] </exemplarCharacters> <exemplarCharacters type="auxiliary"> [    ! A W a k] </exemplarCharacters> </characters>& <dayContext type="format"> <dayWidth type="abbreviated"> <day type="sun">Domh</day> <day type="mon">Luan </day>& P<@PF$!C_ >a*SSpecifying Time Zone Names Based on Olson time zone database Localized display names for standard, daylight, and generic representations of time zones. Short and long display names.* UNumbers Specifies proper localized formatting of numeric quantities Decimal Scientific Currency Percentages Includes localized decimal, thousands separators, currency symbols, etc.` <( I  <(ITime Zones / CurrenciesvFrom ga.xml (Irish) and root.xml: <timeZoneNames> <zone type="Europe/Dublin"> <long> <standard>Men-Am Greenwich</standard> <daylight>Am Samhraidh na hireann </daylight> </long>& <numbers> <currencies> <currency type= EUR"> <displayName>Euro</displayName> <symbol> </symbol>& $P0PPP!FJ Y  K+ S  zK Delimiterss Specifies a primary and secondary of delimiter characters to be used for bracketing quotations in text* g g{LDelimiters ExampleFrom fr.xml (French): <delimiters> <quotationStart></quotationStart> <quotationEnd></quotationEnd> <alternateQuotationStart> </alternateQuotationStart> <alternateQuotationEnd> </alternateQuotationEnd> </delimiters> Z 0ZPZFfbf!bf*bf3bf'bb  vGText Layout Directionc Specifies the proper vertical and horizontal directionality of text in the given language.* [ [wHLayout ExampleSFrom ar.xml (Arabic): .TZF>b$I|M Measurements Specifies which measurement system (metric vs. US) is used Specifies paper size (in mm) used for business letters*rr}NMeasurements ExampleFrom en_US.xml: 279 216 From ga.xml: Meiricenach Madrach HPFfb F      b   1  #Z Collation Information in collation directory, not main XML version of Java/ICU collation syntax Unicode collation algorithm is the base http://unicode.org/reports/tr10 Allows tailoring of the UCA on a per locale basis.2 ZZ d 0[Collation ExampleFrom collations/root.xml: <collations validSubLocales="ga ga_IE id id_ID ms ms_BN ms_MY nl nl_BE nl_NL pt pt_BR pt_PT"> <collation type="standard"> <rules> ... <s></s> <t></t> <s></s> <t></t> <s></s> <t></t> <s></s> <t></t>& PPb@B b & b  b  #   b F bF b F  $b$(F( ,b,0F0 4b48F8 hhAgenda:Why CLDR? CLDR data Tools and vetting Today and the future@$  &T+Latest Release: CLDR 1.4Released: July 17, 2006 360 locales: 121 languages 142 territories 25% more data 17,000 new or modified data items Over 100 different contributors d' RR Challenges $fComplex Formats Experts knowledgeable both in technology and a specific language Collation Exemplar characters Etc& Require close interaction of CLDR experts with language experts:Q# @Q#@ Getting InvolvedSimplest  anyone! Use CLDR Bug report / feature request More Involved Vetting, Assessment, Tools, Policies, Decisions, & Any Unicode member eligible to name representatives including country liaison membersp&   %!Example Country Process (Finland)""(\Finnish Ministry of Education made CLDR data a major goal, 2004-06 Research Institute for the Languages of Finland ( RILF aka  Kotus ) designated agency Two official languages (Finnish and Swedish) & four regional / minority languages (three Smi & Romani as spoken in Finland) to be covered Over 30 different parties represented: commercial, non-commercial, individuals Results expected to lead to new/revised national standardsXCPl$PC/(;>{uFor More InformationUnicode http://www.unicode.org/ CLDR http://www.unicode.org/cldr/ LDML specification http://unicode.org/reports/tr35 lisam@us.ibm.com Z ZZ Z Z!$Z ZZ!J d 0d 0%Ad 0Ut/0+,: ; < = ?@FLMOZ!'()*+,-7 ` b!+fl` ii_EEfL u` f;v3ff` XVΟ` _V?" dd@,?n<d2@ %" d! n<@" d`n< n?" dd@   @@``PR    @ ` `6p>> J3B3+. 2(   -T    "   c TBCDE`Fjd @ Hes>9Na }  ZaI9HH24@`"B *   c ByC DE,F6d @  y%f  ryyy@`"    c BTCfDE F*d @ ff0<TT*6ff@`"0vb     # "     c BHCDEF"d @ <H @`" -    c BCDEFd @   *H`xT ~ l N* 6Zx N lT6BD@`"    c BOCDEFd @ $$I=%N Bx1=C =0 1[   x  % =[ I0 O IIJL@`" o   c BCDEFd @ $$mI Z T =a  m I 7~7U y   JL@`"\   c B,CDEFd @ $$,=~ r1   H s   7frC  s H , ,,JL@`"j   c BCDEFd @ $$&z>% 2n&@ Z hs ,  sr~ 8 ty ` @ &&JL@`"Hj   c ,B]CDEFd @ 33WP?*(u[ba UVa{ "3*EKPQK| ? 3 1  W  J O[V  ]  (7 ? K W| ]WPWPhl@`" 2   c BCDEFd @ ,,^*3;MS_-kqV}HHwqeV_S-GA03^* g  $ ;Y}eG0 $ g 0 Z\@`"_    c B+CDEFd @ $$6T~+rN0F r  6 ` l B * x F JL@`"   c BCDEFd @ $$<kAJd ~ / Y  Zf e ; #~ d J)GwHJL@`"   c B>CDEFd @ $$ b66`O>2CT* <b  B x s N *  b bJL@`"h   c BCDEFd @ $$ 2G<D87};2d  M < ,H Y ) d 2 2JL@`"J   c BECDEFd @ //0`mE92aT$ 0 r c 17m(P\(y=7 c ~ <  `d@`" hr   c 4B\CDEPFZd @ 6*`@1=J\17"fn066*,@`"U d   c BCDE$F.d @  lJr@`"2   c dBCDEhFrd @ MuQa2%> >! aiQS68@`"   BCMDE0F:  M5>]8oMM@`"M   BCDE$F.   lf@`"r   c 4BWCDEPFZd @ Q6~;7,8WE&,7u#<QQ*,@`"`   BJCWDE0F:  J8x?WxJJ@`"RW   B CDE$F.  xr<~x  @`"nB   0D> "  nB   0D> "4 4 nB   0D> "F F b   # "nB   0D> "nB !  0D> "nB "  0D> """nB #  0D> "nB $  0D> "nB %  0D> "XXnB &  0D> "   ' <@ " `0  T Click to edit Master title style! !: ) 6 #" `    D*LRC  XI The Localisation Factory##   $ + 0 " `  RClick to edit Master text styles Second level Third level Fourth level Fifth level!     S - 6#" `p Z rDublin, Ireland, October, 2006$0 &     . H? ?"6?@`NNN?N  `*"(2f    h  Bdd޽h @ ?"` f;v3ff___PPT10i. |7+D=' = @B +  Globe1  00)+$30(  $-T  $ " $ c TBCDE`Fjd @ Hes>9Na }  ZaI9HH24@`"B * $ c ByC DE,F6d @  y%f  ryyy@`"  $ c BTCfDE F*d @ ff0<TT*6ff@`"0vb    $# "   $ c BHCDEF"d @ <H @`" -  $ c BCDEFd @   *H`xT ~ l N* 6Zx N lT6BD@`"   $ c BOCDEFd @ $$I=%N Bx1=C =0 1[   x  % =[ I0 O IIJL@`" o  $ c BCDEFd @ $$mI Z T =a  m I 7~7U y   JL@`"\  $ c B,CDEFd @ $$,=~ r1   H s   7frC  s H , ,,JL@`"j  $ c BCDEFd @ $$&z>% 2n&@ Z hs ,  sr~ 8 ty ` @ &&JL@`"Hj  $ c ,B]CDEFd @ 33WP?*(u[ba UVa{ "3*EKPQK| ? 3 1  W  J O[V  ]  (7 ? K W| ]WPWPhl@`" 2 $ c BCDEFd @ ,,^*3;MS_-kqV}HHwqeV_S-GA03^* g  $ ;Y}eG0 $ g 0 Z\@`"_  $ c B+CDEFd @ $$6T~+rN0F r  6 ` l B * x F JL@`" $ c BCDEFd @ $$<kAJd ~ / Y  Zf e ; #~ d J)GwHJL@`" $ c B>CDEFd @ $$ b66`O>2CT* <b  B x s N *  b bJL@`"h $ c BCDEFd @ $$ 2G<D87};2d  M < ,H Y ) d 2 2JL@`"J $ c BECDEFd @ //0`mE92aT$ 0 r c 17m(P\(y=7 c ~ <  `d@`" hr $ c 4B\CDEPFZd @ 6*`@1=J\17"fn066*,@`"U d $ c BCDE$F.d @  lJr@`"2 $ c dBCDEhFrd @ MuQa2%> >! aiQS68@`" $ BCMDE0F:  M5>]8oMM@`"M $ BCDE$F.   lf@`"r $ c 4BWCDEPFZd @ Q6~;7,8WE&,7u#<QQ*,@`"` $ BJCWDE0F:  J8x?WxJJ@`"RW $ B CDE$F.  xr<~x  @`"nB $ 0D> "  nB $ 0D> "4 4 nB $ 0D> "F F b  $# "nB  $ 0D> "nB !$ 0D> "nB "$ 0D> """nB #$ 0D> "nB $$ 0D> "nB %$ 0D> "XXnB &$ 0D> "   '$ <s/ "*p / T Click to edit Master title style! ! ($ 0v/ " `   / W#Click to edit Master subtitle style$ $  *$ 0@{/ "`   / D*LRC  XI The Localization Factory## h $ Bdd޽h @ ?"` f;v3ff___PPT10i. |7+D=' = @B + 0 `(    N/jjjj 0$  / x* I%%IInnd  N/jjjj "`{ > / D*Orlando, Florida, September, 2005##c "I%%IInnd  c $ ?LD  /4  N/jjjj  x / RClick to edit Master text styles Second level Third level Fourth level Fifth level!     S   T/jjjj "`{>  / v* I%%IInn  Tl/jjjj {g\ > / x* I%%IInnH  0j ? 3380___PPT10.}'E ldd(  d( d Nӹjjjj s$   8*CLDR: Locales for the World I%%IInn d Nعjjjj  /$  ^* I%%IInn d Tܹjjjj s   ~*LRC- XI The Localisation Factory Dublin, Ireland, October 2006@@  'I%%IInn d T0jjjj  /  ^* I%%IInnH d 0j ? 3380___PPT10.?m  %  $(  r  S h/'$  / r  S +B#style.visibility<*\ %(D' =-u6Bdiamond(in)*<3<*\ D' =%(D' =%(DF' =A@BBBB0B%(D' =1:Bvisible*o3>+B#style.visibility<*\ %(D' =-u6Bdiamond(in)*<3<*\ D' =%(D' =%(DF' =A@BBBB0B%(D' =1:Bvisible*o3>+B#style.visibility<*\&%(D' =-u6Bdiamond(in)*<3<*\&D' =%(D' =%(DF' =A@BBBB0B%(D' =1:Bvisible*o3>+B#style.visibility<*\&;%(D' =-u6Bdiamond(in)*<3<*\&;+8+0+\v  + , ` <(  ~  s *p.1 '  `0  1  ~  s *)1 +  ` 1  H  0޽h ? f;v3ff___PPT10i.P++D=' = @B +}  $(  r  S '  `0   r  S \+  `  H  0޽h ? f;v3ff___PPT10i.\N+D=' = @B +     "$,( (  ,x , c $'  `0   x , c $+  `   , Z 3f!|?P0P , C (A U_IPA?0N , C &A U_AD?0 0 \ , C 4AU_AngloSaxonP K V  , C .AU_Arabic?p 0 X  , C 0AU_Armenian z. V  , C .AU_Bengali 0]P  , C (A U_UCAS kX  , C 0A U_Cherokee0 p Z , C 2A U_Chinese1?P \ Z , C 2A U_Chinese2?p \  V , C .A U_Deseret \ = X , C 0AU_Ethiopic i9Z , C 2AU_Georgian?`  T , C ,AU_Greek?0 `  X , C 0AU_GujaratiP  X , C 0AU_Gurmukhip 2 T , C ,AU_Hebrew ' T , C ,AU_Hindi? c  ]Z , C 2AU_Japanese? sV , C .AU_Kannada0 v V , C .AU_Korean?P s Z , C 2AU_Malayalamp t R , C *AU_Ogham O R , C *AU_Oriya  rX , C 0AU_Persian? X , C 0AU_Russian?0  R  , C *AU_TamilP  T !, C ,AU_Telegup  X ", C 0AU_Dzongkha =  #, N ?P W @& "00H , 0޽h ? f;v3ff___PPT10i.`+D=' = @B +   <(   ~   s * e'  `0   ~   s *hj+  `  H   0޽h ? f;v3ff___PPT10i.)pJH+D=' = @B +   P<(  P~ P s *<'  `0   ~ P s *+  `  H P 0޽h ? f;v3ff___PPT10i.@U+D=' = @B +  6(  ~  s *h'  `0   x  c $+ 00V  H  0޽h ? f;v3ff___PPT10i.6s+D=' = @B + *  <(  ~  s *|iY'  `0  Y ~  s *pY+  ` Y H  0޽h ? f;v3ff___PPT10i.P++D=' = @B +  H0(  Hx H c $$B'  `0   x H c $B+  `v  H H 0޽h ? f;v3ff___PPT10i. 0f+D=' = @B +  D0(  Dx D c $W'  `0   x D c $,X+  `  H D 0޽h ? f;v3ff___PPT10i.+D=' = @B + ZK0 r(     N4B'  `0     < +  `  B  s *޽h ? 33y___PPT10Y+D=' = @B + -  <(  ~  s *?1 '  `0  1  ~  s *T1 +  ` 1  H  0޽h ? f;v3ff___PPT10i.P++D=' = @B + ! P(  ~  s *X'  `0     c $P+  `  & Z H  0޽h ? f;v3ff___PPT10i.+D=' = @B +  *(  x  c $p1'  `0   r  S D2+  `  H  0޽h ? f;v3ff___PPT10i.+D=' = @B +  p*(  px p c $D9'  `0   r p S 9+  `  H p 0޽h ? f;v3ff___PPT10i.Py+D=' = @B +  0(  x  c $>'  `0   x  c $?+  `  H  0޽h ? f;v3ff___PPT10i.`g+D=' = @B +  0(  x  c $X'  `0   x  c $tY+  `  H  0޽h ? f;v3ff___PPT10i.8+D=' = @B +r 0 (  ^  S LD     c $đ x   xdThere are some challenges for creating data for CLDR. Some of the information can be complex. Some items in CLDR have a very specific purpose and meaning, but a language expert may be unfamiliar with these purposes and meanings. Sometimes close interaction between experts can be difficult over the phone or face to face. Interacting over e-mail is easier.H  0j ? 3380___PPT10.}HXJB 0 (  d  c $LD      0͑ ? x  ___PPTMAC11 ,namd$ Times Roman L___PPT10,$___PPT9d\ rNow we will look at some examples of conflicting data. These are items which turn up when data comparisons are made. Not everything is an either-or case. Sometimes we find that a restructuring of the data is in order to accomodate both the old and new data because both could be correct. Typographical errors: Sometimes this is due to data being entered by keyboard incorrectly. Other times it can be due to using one locale s translations as a template for another locale s data. Regional differences: Regional and sub-regional differences may require the decision to keep both sets of data in different locales rather than choosing one over another. For example, German in Germany and Switzerland frequently have spelling differences, and sometimes American English is different from British English. Context of usage: There is more than one way to sort German text. There is normal German sorting, and there is German phonebook sorting. For example,  f and  of sort in differently between normal German sorting and German phonebook sorting. Parts of speech: Some languages make a distinction between the way month names are written when cited independently, and when written as part of a date. For example,  March 2004 at the heading of a Calendar would be written as just the name March, but the date  3rd March, 2004 would require a different form meaning  of March . CLDR accommodates such languages using a type value of  standalone or  format , respectively. Standards vs. common use: CLDR uses the commonly used translation or format for the default. However alternates are allowed in CLDR. Sometimes there is more than one right answer. Misunderstanding: Sometimes translators don t have enough knowledge about how CLDR works. Sometimes a translator will try to translate the format and characters of a date format instead of just the format. The localized characters of a date format are in a separate field of CLDR. Uncommon cases: There are some items and concepts in CLDR that are not commonly known by all translators. For example, how does a translator translate the word  Interlingua (a language) when the translator has never heard of the Interlingua language. Sometimes translators guess, and these guesses will appear during the vetting process. Individual preferences: Some people have different preferences, and this can vary between translators. For example, the US military usually use 24 hour time, but the rest of the United States uses 12 hour time. 0gZ0g  K  +   n     ,( H  0j ? 33 0 ~`(  `^ ` S LD   /x ` c $/ x  / H ` 0j ? 3380___PPT10.?@(A_* 0 ~ h(  h^ h S LD   x h c $ x   H h 0j ? 3380___PPT10.@PW 0 ~0l(  l^ l S LD   x l c $  x   H l 0j ? 3380___PPT10.@aX' 0 @ph(  p^ p S LD    p c $ x   ^JThe Common Locale Data Repository (CLDR) was developed in response to the need for standardized locales based on Unicode. CLDR provides key building blocks for software to support the world s languages. This data is used by a wide spectrum of companies for their software internationalization and localization  adapting software to the conventions of different languages and locations for such common tasks as formatting of dates, times, time zones, numbers, and currency values, sorting text; and choosing languages or countries by name, among others. The CLDR project collects and maintains locale data and uses the Locale Data Markup Language (LDML) to describe the data.H p 0j ? 3380___PPT10.@S% 0 `xc(  x^ x S LD    x c $  x   YEA locale is a string identifier that refers to specific linguistic and cultural preferences. These preferences can include date/time formatting, number formatting, spelling of certain names and many other items. These preferences can change over time due to cultural and political reasons. For example, modern Spanish sorts differently from older Spanish from the 1990s. In another example, some countries mandate how specific regions are referred to (this can happen when ownership of a region is in dispute). Of course, these types of preferences are not absolute. For example, most people in the United States use 12 hour time, but there are some people in the US that use 24 hour time. There are some languages, like French and Japanese, that have published standards for how to sort those languages. There are some other languages that may not have enough exposure to other cultures to have names for certain places or concepts. There are many things that locale data can cover. It could cover industry specific topics, like shoe size. CLDR limits its scope to a few specific topics. H x 0j ? 3380___PPT10.@n$ 0 p|~(  |^ | S LD    | c $4 x   t`This is a list of the some of the topics that CLDR has translations and formats for locale data.H | 0j ? 3380___PPT10.@`/ 0 ~(  ^  S LD   x  c $lC x   H  0j ? 3380___PPT10.@  0 }u0 (  ^  S LD   o  c $w x   YWho can participate in CLDR? Anyone can get involved! It can be as simple as suggesting a fix for a translation that is misspelled, or it can be as big as submitting data for a whole new locale. We also welcome vetters that can verify that data is correct, tool writers and many other people interested in the topic of locale data. When submitting data to the CLDR project, references to standards, dictionaries or actual examples of every day use frequently help to get the locale data vetted correctly. Please see the CLDR project web site for how to submit locale data and how to participate in the project. Designed for most effective participation from people around the world Meetings By phone, never face to face Short, frequent Allows preparation between meetings Resolves conflicts and new feature requests Written Email Bug database submissions l}$$}H  0j ? 3380___PPT10.@ 0 ~@(  ^  S LD   x  c $\} x   H  0j ? 3380___PPT10.@ 0 ~`(  ^  S LD   x  c $쎹 x   H  0j ? 3380___PPT10.@вT+ 0 d(  d  c $LD     s *H x   NHere is a summary of the latest CLDR release. Complete POSIX-format data with POSIX conversion tool More timezone translations Data for UN M.49 regions, including continents and regions Addition of ISO 4217 currency code change overs Additional number and data tests to verify CLDR implementations Mappings from language to script and territory Various other fixes, additions, and extensions Survey tool for improved collection of data (read only to non-members)..  ,isH  0j ? 3380___PPT10.@p*:^ 0 ` J(  d  c $LD     s * x   4 Will show a demo of Survey Tool H  0j ? 3380___PPT10.6saa 0  q(  d  c $LD   3   s *3  x  3  [GHere is an example of what CLDR looks like. In this snippet of CLDR data, some translations are provided for some language, country and script display names. The keys use other standards, like ISO-639, ISO-3166 and other various standards. As you can see CLDR is written in XML. This data can be used for web site preferencesH  0j ? 3380___PPT10.@b 0  "(  d  c $LD   m~  s *m x  m H  0j ? 3380___PPT10.@d 0  "(  d  c $LD   Y~  s *t$Y x  Y H  0j ? 3380___PPT10.@PWe 0 @ "(  d  c $LD   ? ~  s *6?  x  ?  H  0j ? 3380___PPT10.@PWg 0 p "(  d  c $LD   ? ~  s *|?  x  ?  H  0j ? 3380___PPT10.@PWh 0  "(  d  c $LD   Y~  s *nY x  Y H  0j ? 3380___PPT10.@PWrtR09{$ MiU? fink"^+ ptt30K%"8`n̎ڐ? HFL ',O:TdZH` >Ld /py*s-v0GIJ2zPA}DTLN4:< Q+T eY\7O' iVtaeg{(0 9 b x .http://www.unicode.oOh+'0T hp    CLDR: Locales for the World Lisa MooreeGlobeoolisamoo122Microsoft PowerPoint Wo@|1H-@Ԋ@3 bGS;  )'  A  """)))UUUMMMBBB999|PP3f333f3333f3ffffff3f̙3ff333f333333333f33333333f33f3ff3f3f3f3333f33̙33333f333333f3333f3ffffff3f33ff3f3f3f3fff3ffffffffff3ffff̙fff3fffff3fff333f3f3ff3ff33f̙̙3̙ff̙̙̙3f̙3f333f3333f3ffffff3f̙3f3f3f333f3333f3ffffff3f̙3f3ffffffffff!___www4'A x@(xKʦ """)))UUUMMMBBB999|PP3f3333f333ff3fffff3f3f̙f3333f3333333333f3333333f3f33ff3f3f3f3333f3333333f3̙33333f333ff3ffffff3f33f3ff3f3f3ffff3fffffffff3fffffff3f̙ffff3ff333f3ff33fff33f3ff̙3f3f3333f333ff3fffff̙̙3̙f̙̙̙3f̙3f3f3333f333ff3fffff3f3f̙3ffffffffff!___wwwhttp://unicode.org/reports/tr35>http://unicode.org/reports/tr35@http://unicode.org/reports/tr10 >http://unicode.org/reports/tr10>http://unicode.org/reports/tr35>http://unicode.org/reports/tr35Thttp://www.unicode.org/cgi-bin/cldr-surveyThttp://www.unicode.org/cgi-bin/cldr-survey&ICU Locale Explorer~http://demo.icu-project.org/icu-bin/locexp?_=ga&SHOWLanguages=10Languages/ 0DArial GotXq0B 0"DTimes New Roman0B 0 DVerdanaw Roman0B 0"0DWingdingsRoman0B 0@DTimesngsRoman0B 0 PDMonotype Sans Duospace WT JB 01`DLetter GothicDuospace WT JB 02F 0.  @n?" dd@  @@`` 86> / , %)+-16 8:;IJLOPQRSTVW]cdfklw"x{Tb$M{'isT+)!b$7)XG"b$ s_F6!`Z)b$ߵB ]g[ * 0e0e     A@  A5% 8c8c     ?1 d0u0@Ty2 NP'p<'pA)BCD|E||s " 0e@        @ABC DEEFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `abN E5%  N E5%  N F   5%    !"?N@ABC DEFFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `ab MMM33ff̙@8,5}Y ʚ;?2ʚ;g4DdDd B 0ppp@ <4ddddLpC 00q<4!d!dLpC 00q g4<d<d B 0Uph p<4BdBdLpC 00qc :___PPT10 8\   c[\]< $ ; !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~C      !"#$%'()*+,-./012345:<=>?@ABDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxz{|}~Root EntrydO)*J b Pictures?Current User&/SummaryInformation(UPowerPoint Document(ռDocumentSummaryInformation8 Z5TbŶoj ^Iwj8Ա`oߴoٷχ>\l&ߏt׻];0dLqaP(!>t:2(`e=P\Q`?W>.4Jt.\n¼%nkA?`sbSjn(bWY|>996##ud2nHHKn.iL<|8jT ՜{PXqJPcc%˗+ m,NLL..wC"QDzՆ p㋊ę3VULo A7 H錆sNC({H*sK ǀN8>œ0Iլ:]Pjj+X->m2 HdHDкnA*tw Xx0}{\AAHZͤƞ\0)gN:w.@"kͅ'ݸBf,U?.m#2jEo(?Ae)p@⥣N%t+:+;aԾ;u2 8xOĩ˳ϡ} |3~~Ŀ|Ke%zbļޘఆH0+<HX(3@I ۓ8$vD^ɤ)X資zy.Ymv ~-B \!~,|riHkqt׾[[ow]qi!Ak_&rꢨӖ^˲8l+e`U̻ =>Cg5}p)lXA&3vw3'R款!/Cthby˻ 98咖Ċ )qEE{2Y^a ijGDO [t(Y6Huk=*'i>A!58 LkUctBo?4l5[,fS3A$= qCp|ZOp%99xĦxz&Dy&@- I LVaT&vciKs6ިR%@LB%$zr<_H(EHL ]KZ8in..[֘DoEI8CNy8].']%%MsmƲ٢gҮڽ.cahv6f 4~*tj4wcVaӦK[/jUWZ7 FС QlϢh<KoԢr"fMݨ+Vh0G\U DL\>7P(,r3"=bӔ~b)QiisVXSQ=, f1j5щ,!i~l,eXǤ|,8,7GDA<ݦ[{Dycy4[W Y>X P(yѯlÖ^XlUAVוdf h,a%H=@ uF;o; ŠZ͋U7 #UK>h;"T I<{qj8KLM^o{j줨 #r4& Vz\cTMlЪ?~dbf28wz_(i3>5)m v*xI_K {;wZl){m z: .)@!=% w6U~J>!/E meŋ$ _|aMϟ[_a_mUÝULTŤB,Nz 'Ozߦ?XܦlB! 4n)(MMMx2FłT*>B!&\rrUm4st8q1{C< 7ir#h!'? E{jE]zrq֮{ ѷ) E& 6l8X<ڷmۆO\ޕ+W2˟{9E;g|M0tF !lIY|9 Ǐȃr2СCLOO߽{73cG͆ EZշ/Y' H '.}gw-ʻ)9@ޜ1+y7ɾa?z " !koo'*eX+B 566z,FaRƀBx-hEA%:$ cΊ#z5(OmXcqf*z fdpq?ĉk.<###%ϷGUYp;*((Knn.ËB0c}}=i ij6//aa F={٥KU<O6IQ|^忍M+GM%~ꉬ-cЀVM3]~Ŭ): =ɓ'5Kaa!;w:=)U. xvvזN'b5]Jx-yOG޿ʳg2LH; SwyS 7`?N)17|Kk-)H5}-qx+cXzOAv_QQ!##6KKKGYCI6gC .YTTo2"``ÜQ.P̈́'y]*gAVfX`ݿ(#}Ć9t:X3B1_ ޲e v&[* -KD4 0y^fe8pi驍e<:*\ߴe[b2ߋ ˤ %O.e<8HYp!v2$[!M H @I#񋙝DCh?p0`RSS㽖dűI$Eևr .~vh-u7MD{3!3 ?acIƔ2ܚt3f@/ԱrBv~LwO%&•c~j^+Bctu;:"n'd-J?Lo+(HzW-تШNKt/r|4T0J3Ev? http://unicode.org/reports/tr35>http://unicode.org/reports/tr35@http://unicode.org/reports/tr10 >http://unicode.org/reports/tr10>http://unicode.org/reports/tr35>http://unicode.org/reports/tr35Thttp://www.unicode.org/cgi-bin/cldr-surveyThttp://www.unicode.org/cgi-bin/cldr-survey/ 0DArial GotXq0B 0DTimes New Roman0B 0 DVerdanaw Roman0B 0"0DWingdingsRoman0B 0@DTimesngsRoman0B 0 PDMonotype Sans Duospace WT JB 01`DLetter GothicDuospace WT JB 02F 0.  @n?" dd@  @@`` ( 8? / , %)+-16 8:;IJLOPQRSTVW]cdfklw"x{Tb$M{'isT+)!b$7)XG"b$ s_F6!`Z)b$ߵB ]g[ * 0e0e     A@  A5% 8c8c     ?1 d0u0@Ty2 NP'p<'pA)BCD|E||s " 0e@        @ABC DEEFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `abN E5%  N E5%  N F   5%    !"?N@ABC DEFFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `ab MMM33ff̙@85}Y ʚ;?2ʚ;g4DdDd B 0ppp@ <4ddddLpC 00q<4!d!dLpC 00q g4<d<d B 0Uph p<4BdBdLpC 00qc ___PPT10 8\   c[\]( $ ___PPT9x8   $c2[]X\? %O =UC CLDR: The Common Locale Data Repository Locales for the World6D',,3Lisa Moore George Rhoten Mark Davis Steven Loomis$ P)0P48*Agenda:Why CLDR? CLDR data Tools and vetting Today and the futuregAgenda:Why CLDR? CLDR data Tools and vetting Today and the future, 1 1`1>Locales  is anything the same?&Locales specify user preferences Linguistic and cultural differences Languages, scripts, writing systems, ordering, directionality, formatting, numbers, sizes Interoperability issues across platforms Global economics has increased the need for greater globalization support in computer systems LEZ EZ$Add the Universal Character Encoding%%(1Unicode: Unique character codes for all languagesa4The Need for Common Locale Data (6Computing environments often contain a variety of operating systems and software. Historically locale sensitive data research has been done by individuals and/or companies. Because of political changes, it is easy for locale data to become out of date. It is difficult to get complete agreement on correctness.7Z74'Common Locale Data Project<Began as Common XML Locale Repository (CXLR) developed by OpenI18N in 2003 CLDR project began in 2004 Hosted by Unicode Consortium http://www.unicode.org/cldr/ Goals: Common, necessary software locale data for all world languages Collect and maintain locale data XML format for effective interchange Freely availablehZ$ZZ$ZEd 0d8CLDR in use (partial list)jLibraries and Environments ICU  International Components for Unicode JDK  Java Development Kit Operating Systems Solaris AIX MacOS X Applications OpenOffice.org Acrobat ModernBillZF ZZ Z Z" Z+     $$ ((,,> dAgenda0Why CLDR? CLDR data Tools and vetting The futureF   5%What is a Locale?A locale is an identifier referring to linguistic and cultural preferences en_US, en_GB, ja_JP These preferences can change over time due to cultural and political reasons Introduction of new currencies, like the Euro Standard sorting of Spanish changes Many of these preferences have varying degrees of standardization 12 and 24 hour format in the United States This is a very broad topic Scope of data limited to common system applicationsKP PMPR PBP+ POPKMRB+O>K\3$Types of Locale DataDates/time/calendar formats Number/currency formats Measurement system Collation specification Sorting Searching Matching Translated names for language, territory, script, timezones, currencies,& Script and characters used by a languageT_" ds_s 8cLocale Data Markup LanguageLocale data described using XML CLDR data uses LDML Structure of CLDR controlled by Locale Data Markup Language (LDML) specification http://unicode.org/reports/tr35 Xd $d 0pALDML Data Categories $P>MsDNames Provides translated display names for languages, territories, scripts, variants and keywords used in CLDR. Most of this information is at the language level, since it typically does not vary by territory, only language.*ea'Languages, Scripts, Territories Example(((From ga.xml (Irish): <localeDisplayNames> <languages> <language type="aa">Afar</language> <language type="ab">Abcisis</language>& <scripts> <script type="Arab">Araibis</script>& <territories> <territory type="AD">Andra </territory> <territory type="AE">Aontas na nimrochta Arabacha </territory>&  F:#03&!l03& xI Characters Allows for creation of exemplar character sets. An exemplar set specifies the set of characters that must be present in order to properly render the language. Auxiliary exemplar set defines additional characters that may appear in foreign words or phrases. Lower case onlyD  `~O Date Formatsv Defines representation of calendars using various calendaring systems (Gregorian, Buddhist, Islamic, Japanese, etc.) Defines formatting for dates, times, eras and time zones wide, abbreviated, or narrow Date and time formats use patterns of letters to define proper formatting Week information Relative day/time translations (for example, yesterday, tomorrow, etc. )bPPgPZPgZbCharacters / Dates ExamplesFrom ga.xml (Irish): <characters> <exemplarCharacters> [a b-e f-i j-o p-u v-z] </exemplarCharacters> <exemplarCharacters type="auxiliary"> [    ! A W a k] </exemplarCharacters> </characters>& <dayContext type="format"> <dayWidth type="abbreviated"> <day type="sun">Domh</day> <day type="mon">Luan </day>& P<@PF$!C_ >a*SSpecifying Time Zone Names Based on Olson time zone database Localized display names for standard, daylight, and generic representations of time zones. Short and long display names.* UNumbers Specifies proper localized formatting of numeric quantities Decimal Scientific Currency Percentages Includes localized decimal, thousands separators, currency symbols, etc.^ <( I <(ITime Zones / CurrenciesvFrom ga.xml (Irish) and root.xml: <timeZoneNames> <zone type="Europe/Dublin"> <long> <standard>Men-Am Greenwich</standard> <daylight>Am Samhraidh na hireann </daylight> </long>& <numbers> <currencies> <currency type= EUR"> <displayName>Euro</displayName> <symbol> </symbol>& $P0PPP!FJY  K+ S  zK Delimiterss Specifies a primary and secondary of delimiter characters to be used for bracketing quotations in text* g g{LDelimiters ExampleFrom fr.xml (French): <delimiters> <quotationStart></quotationStart> <quotationEnd></quotationEnd> <alternateQuotationStart> </alternateQuotationStart> <alternateQuotationEnd> </alternateQuotationEnd> </delimiters> Z 0ZPZFfbf!bf*bf3bf'bb  vGText Layout Directionc Specifies the proper vertical and horizontal directionality of text in the given language.* [ [wHLayout ExampleSFrom ar.xml (Arabic): .TZF>b$I|M Measurements Specifies which measurement system (metric vs. US) is used Specifies paper size (in mm) used for business letters*rr}NMeasurements ExampleFrom en_US.xml: 279 216 From ga.xml: Meiricenach Madrach HPFfb F      b   1  #Z Collation Information in collation directory, not main XML version of Java/ICU collation syntax Unicode collation algorithm is the base http://unicode.org/reports/tr10 Allows tailoring of the UCA on a per locale basis.2 ZZ d 0[Collation ExampleFrom collations/root.xml: <collations validSubLocales="ga ga_IE id id_ID ms ms_BN ms_MY nl nl_BE nl_NL pt pt_BR pt_PT"> <collation type="standard"> <rules> ... <s></s> <t></t> <s></s> <t></t> <s></s> <t></t> <s></s> <t></t>& PPb@B b & b  b  #   b F bF b F  $b$(F( ,b,0F0 4b48F8 hhAgenda:Why CLDR? CLDR data Tools and vetting Today and the future0&  &T+Latest Release: CLDR 1.&4Released: July 17, 2006 360 locales: 121 languages 142 territories 25% more data 17,000 new or modified data items Over 100 different contributors f'$RR Challenges $fComplex Formats Experts knowledgeable both in technology and a specific language Collation Exemplar characters Etc& Require close interaction of CLDR experts with language experts:Q# @Q#@ Getting InvolvedSimplest  anyone! Use CLDR Bug report / feature request More Involved Vetting, Assessment, Tools, Policies, Decisions, & Any Unicode member eligible to name representatives including country liaison membersp&   %!Example Country Process (Finland)""(\Finnish Ministry of Education made CLDR data a major goal, 2004-06 Research Institute for the Languages of Finland ( RILF aka  Kotus ) designated agency Two official languages (Finnish and Swedish) & four regional / minority languages (three Smi & Romani as spoken in Finland) to be covered Over 30 different parties represented: commercial, non-commercial, individuals Results expected to lead to new/revised national standardsVCPl PC/(;>{uFor More InformationUnicode http://www.unicode.org/ CLDR http://www.unicode.org/cldr/ LDML specification http://unicode.org/reports/tr35 lisam@us.ibm.com Z ZZ Z Z! Z ZZ!J d 0d 0%Ad 0Ut/0+,: ; < = ?@FLMOZ!'()*+,-}  $(  r  S Sy'  `0  y r  S LRy+  ` y H  0޽h ? f;v3ff___PPT10i.\N+D=' = @B +   B : P(  P~ P s *Ly'  `0  y  P s * y+  <$@ 0 y b P  .0e0eA     A@  A5% 8c8c     ?+ A)BCD|E||# "0e@       @ABC DEEFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `abN E5%  N E5%  N F   5%    !"?N@ABC DEFFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `ab@pP H P 0޽h ? f;v3ffO G ___PPT10' .@U+ED ' = @B D ' = @BA?%,( < +O%,( < +D ' =%(D ' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*P%(D' =-u6Bdiamond(in)*<3<*PD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*P%(D' =-u6Bdiamond(in)*<3<*PD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*P%(D' =-u6Bdiamond(in)*<3<*PD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*P,%(D' =-u6Bdiamond(in)*<3<*P,D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*P,=%(D' =-u6Bdiamond(in)*<3<*P,=+  6(  ~  s *(y'  `0  y x  c $y+ 00V y H  0޽h ? f;v3ff___PPT10i.6s+D=' = @B +r q4C`AdzZ'eM]}(0 9 D x .http://www.unicode.org/.htt  !"#$%'()*+՜.+,D՜.+,`    On-screen ShowgIBMռ5$O ,ArialTimes New RomanVerdana WingdingsTimesMonotype Sans Duospace WT JLetter GothicGlobeD CLDR: The Common Locale Data Repository Locales for the WorldAgendaAgenda'Locales does anything stay the same?Locales the many differences%Add the Universal Character Encoding The Need for Common Locale DataCommon Locale Data ProjectCLDR in use (partial list)AgendaWhat is a Locale?Types of Locale DataLocale Data Markup LanguageLDML Data CategoriesNamesNames Examples Characters Date FormatsCharacters / Dates ExamplesTime Zone NamesNumbersTime Zones / Currencies DelimitersDelimiters Example CollationCollation ExampleAgenda CLDR ToolsVetting Process for DataCauses of Conflicting DataAgendaLatest Release: CLDR 1.4 ChallengesGetting Involved"Example Country Process (Finland)For More Information  Fonts UsedDesign Template Slide Titles$ 8@ _PID_HLINKSA6http://www.unicode.org/http://www.unicode.org/cldr/http://www.unicode.org/cldr/ http://unicode.org/reports/tr35 http://unicode.org/reports/tr10 http://unicode.org/reports/tr35+http://www.unicode.org/cgi-bin/cldr-survey@http://demo.icu-project.org/ic_㱼lisamlisamu-bin/locexp?_=ga&SHOWLanguages= *___PPT9 8   $c2[]\? %O =UC CLDR: The Common Locale Data Repository Locales for the World6D',,3Lisa Moore George Rhoten Mark Davis Steven Loomis$ P)0P48*Agenda:Why CLDR? CLDR data Tools and vetting Today and the futuregAgenda:Why CLDR? CLDR data Tools and vetting Today and the future, 1 1iLLocales  does anything stay the same?''( "Theatre Center News: The date of the last version of this document was 2003t^3g20e. A copy can be obtained for $50,0 or 1.234,57 3@=. We would like to acknowledge contributions by the following authors (in alphabetical order): Alaa Ghoneim, Behdad Esfahbod, Ahmed Talaat, Eric Mader, Asmus Freytag, Avery Bishop, and Doug Felt." MZ  -       C  ~  $_#`1<Locales  the many differencesWLocales specify user preferences Linguistic and cultural differences Languages, scripts, writing systems, ordering, directionality, formatting, numbers, sizes Even in the same locale, interoperability issues across platforms Global economics has increased the need for greater globalization support in computer systems Everyone expects more! tEPZ$PPPEZ$Add the Universal Character Encoding%%(1Unicode: Unique character codes for all languagesa4The Need for Common Locale Data (6Computing environments often contain a variety of operating systems and software. Historically locale sensitive data research has been done by individuals and/or companies. Because of political changes, it is easy for locale data to become out of date. It is difficult to get complete agreement on correctness.7Z74'Common Locale Data Project<Began as Common XML Locale Repository (CXLR) developed by OpenI18N in 2003 CLDR project began in 2004 Hosted by Unicode Consortium http://www.unicode.org/cldr/ Goals: Common, necessary software locale data for all world languages Collect and maintain locale data XML format for effective interchange Freely availablehZ$ZZ$ZEd 0d8CLDR in use (partial list)jLibraries and Environments ICU  International Components for Unicode JDK  Java Development Kit Operating Systems Solaris AIX MacOS X Applications OpenOffice.org Acrobat ModernBillZF ZZ Z Z" Z+     $$ ((,,> dAgenda0Why CLDR? CLDR data Tools and vetting The futureF   5%What is a Locale?A locale is an identifier referring to linguistic and cultural preferences en_US, en_GB, ja_JP These preferences can change over time due to cultural and political reasons Introduction of new currencies, like the Euro Standard sorting of Spanish changes Many of these preferences have varying degrees of standardization 12 and 24 hour format in the United States This is a very broad topic KP$PMPR$PBP+$PPKMRB+>K)3$Types of Locale DataDates/time/calendar formats Number/currency formats Measurement system Collation specification Sorting Searching Matching Translated names for language, territory, script, timezones, currencies,& Script and characters used by a languageT_" ds_s 8cLocale Data Markup LanguageLocale data described using XML CLDR data uses LDML Structure of CLDR controlled by Locale Data Markup Language (LDML) specification http://unicode.org/reports/tr35 Xd $d 0pALDML Data Categories $P>MsDNames Provides translated display names for languages, territories, scripts, variants and keywords used in CLDR. Most of this information is at the language level, since it typically does not vary by territory, only language.*eaNames Examples(From ga.xml (Irish): <localeDisplayNames> <languages> <language type="aa">Afar</language> <language type="ab">Abcisis</language>& <scripts> <script type="Arab">Araibis</script>& <territories> <territory type="AD">Andra </territory> <territory type="AE">Aontas na nimrochta Arabacha </territory>&  F:#03&!l03& xI Characters Allows for creation of exemplar character sets. An exemplar set specifies the set of characters that must be present in order to properly render the language. Auxiliary exemplar set defines additional characters that may appear in foreign words or phrases. Lower case onlyD  `~O Date Formatsv Defines representation of calendars using various calendaring systems (Gregorian, Buddhist, Islamic, Japanese, etc.) Defines formatting for dates, times, eras and time zones wide, abbreviated, or narrow Date and time formats use patterns of letters to define proper formatting Week information Relative day/time translations (for example, yesterday, tomorrow, etc. )bPPgPZPgZbCharacters / Dates ExamplesFrom ga.xml (Irish): <characters> <exemplarCharacters> [a b-e f-i j-o p-u v-z] </exemplarCharacters> <exemplarCharacters type="auxiliary"> [    ! A W a k] </exemplarCharacters> </characters>& <dayContext type="format"> <dayWidth type="abbreviated"> <day type="sun">Domh</day> <day type="mon">Luan </day>& P<@PF$!C_ >a*STime Zone Names Based on Olson time zone database Localized display names for standard, daylight, and generic representations of time zones. Short and long display names.* UNumbers Specifies proper localized formatting of numeric quantities Decimal Scientific Currency Percentages Includes localized decimal, thousands separators, currency symbols, etc.^ <( I <(ITime Zones / CurrenciesvFrom ga.xml (Irish) and root.xml: <timeZoneNames> <zone type="Europe/Dublin"> <long> <standard>Men-Am Greenwich</standard> <daylight>Am Samhraidh na hireann </daylight> </long>& <numbers> <currencies> <currency type= EUR"> <displayName>Euro</displayName> <symbol> </symbol>& $P0PPP!FJY  K+ S  zK Delimiterss Specifies a primary and secondary of delimiter characters to be used for bracketing quotations in text* g g{LDelimiters ExampleFrom fr.xml (French): <delimiters> <quotationStart></quotationStart> <quotationEnd></quotationEnd> <alternateQuotationStart> </alternateQuotationStart> <alternateQuotationEnd> </alternateQuotationEnd> </delimiters> Z 0ZPZFfbf!bf*bf3bf'bb  vGText Layout Directionc Specifies the proper vertical and horizontal directionality of text in the given language.* [ [wHLayout ExampleSFrom ar.xml (Arabic): .TZF>b$IZ Collation Information in collation directory, not main XML version of Java/ICU collation syntax Unicode collation algorithm is the base http://unicode.org/reports/tr10 Allows tailoring of the UCA on a per locale basis.2 ZZ d 0[Collation ExampleFrom collations/root.xml: <collations validSubLocales="ga ga_IE id id_ID ms ms_BN ms_MY nl nl_BE nl_NL pt pt_BR pt_PT"> <collation type="standard"> <rules> ... <s></s> <t></t> <s></s> <t></t> <s></s> <t></t> <s></s> <t></t>& PPb@B b & b  b  #   b F bF b F  $b$(F( ,b,0F0 4b48F8 hhAgenda:Why CLDR? CLDR data Tools and vetting Today and the future0&  &T+Latest Release: CLDR 1.4Released: July 17, 2006 360 locales: 121 languages 142 territories 25% more data 17,000 new or modified data items Over 100 different contributors f'$RR Challenges $fComplex Formats Experts knowledgeable both in technology and a specific language Collation Exemplar characters Etc& Require close interaction of CLDR experts with language experts:Q# @Q#@ Getting InvolvedSimplest  anyone! Use CLDR Bug report / feature request More Involved Vetting, Assessment, Tools, Policies, Decisions, & Any Unicode member eligible to name representatives including country liaison membersp&   %!Example Country Process (Finland)""(\Finnish Ministry of Education made CLDR data a major goal, 2004-06 Research Institute for the Languages of Finland ( RILF aka  Kotus ) designated agency Two official languages (Finnish and Swedish) & four regional / minority languages (three Smi & Romani as spoken in Finland) to be covered Over 30 different parties represented: commercial, non-commercial, individuals Results expected to lead to new/revised national standardsVCPl PC/(;>{uFor More InformationUnicode http://www.unicode.org/ CLDR http://www.unicode.org/cldr/ LDML specification http://unicode.org/reports/tr35 lisam@us.ibm.com Z ZZ Z Z! Z ZZ!J d 0d 0%Ad 0Ut/0+,: ; < = ?@FLMOZ!'()*+,-w   (  x  c $$=t'  `0  t x  c $r+  P t   B(~ ?"6@ NNN?N` ` 6"0 H  0޽h ? f;v3ff___PPT10i.\N+D=' = @B +}  $(  r  S Sy'  `0  y r  S LRy+ P& y H  0޽h ? f;v3ff___PPT10i.\N+D=' = @B +o     "$,\ (  ,x , c $ey'  `0  y x , c $dfy+  ` y  , Z 3f!|?P0,$D 0P , C (A U_IPA?0N , C &A U_AD?0 0 \ , C 4AU_AngloSaxonP K V  , C .AU_Arabic?p 0 X  , C 0AU_Armenian z. V  , C .AU_Bengali 0]P  , C (A U_UCAS kX  , C 0A U_Cherokee0 p Z , C 2A U_Chinese1?P \ Z , C 2A U_Chinese2?p \  V , C .A U_Deseret \ = X , C 0AU_Ethiopic i9Z , C 2AU_Georgian?`  T , C ,AU_Greek?0 `  X , C 0AU_GujaratiP  X , C 0AU_Gurmukhip 2 T , C ,AU_Hebrew ' T , C ,AU_Hindi? c  ]Z , C 2AU_Japanese? sV , C .AU_Kannada0 v V , C .AU_Korean?P s Z , C 2AU_Malayalamp t R , C *AU_Ogham O R , C *AU_Oriya  rX , C 0AU_Persian? X , C 0AU_Russian?0  R  , C *AU_TamilP  T !, C ,AU_Telegup  X ", C 0AU_Dzongkha =  #, Npy?P W @& "00H , 0޽h ? f;v3ffKC___PPT10#.`+ksD' = @B D' = @BA?%,( < +O%,( < +D' =%(D' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*,%(D' =-u6Bdiamond(in)*<3<*,+   <(   ~   s *4y'  `0  y ~   s *y+ F y H   0޽h ? f;v3ff___PPT10i.)pJH+D=' = @B +  H0(  Hx H c $8y'  `0  y x H c $ y+  `v y H H 0޽h ? f;v3ff___PPT10i. 0f+D=' = @B +  D\(  Dx D c $+B#style.visibility<*D%(D' =-u6Bdiamond(in)*<3<*DD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*D4%(D' =-u6Bdiamond(in)*<3<*D4D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*D4G%(D' =-u6Bdiamond(in)*<3<*D4GD ' =%(DT ' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*DG_%(D' =-u6Bdiamond(in)*<3<*DG_D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*D_g%(D' =-u6Bdiamond(in)*<3<*D_gD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*Dgq%(D' =-u6Bdiamond(in)*<3<*DgqD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*Dqz%(D' =-u6Bdiamond(in)*<3<*DqzD*' =%(D' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*Dz%(D' =-u6Bdiamond(in)*<3<*DzD9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*D%(D' =-u6Bdiamond(in)*<3<*D+ (  b(  ~  s *y'  `0  y   c $y+ <$@ 0 y H  0޽h ? f;v3ff___PPT10.cmC+ED_' = @B D' = @BA?%,( < +O%,( < +Dk' =%(D' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*+9%(D' =-u6Bdiamond(in)*<3<*+9D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*9`%(D' =-u6Bdiamond(in)*<3<*9`D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*`%(D' =-u6Bdiamond(in)*<3<*`D*' =%(D' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*%(D' =-u6Bdiamond(in)*<3<*D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*%(D' =-u6Bdiamond(in)*<3<*D ' =%(DT ' =%(D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*%(D' =-u6Bdiamond(in)*<3<*D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*%(D' =-u6Bdiamond(in)*<3<*D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*6%(D' =-u6Bdiamond(in)*<3<*6D9' =4@BBBB%(D' =1:Bvisible*o3>+B#style.visibility<*6G%(D' =-u6Bdiamond(in)*<3<*6G+   <(  ~  s *>'  `0   ~  s *0+  `  H  0޽h ? 3ff~___PPT10i.d+D=' = @B +  @ <(  ~  s *ij'  `0   ~  s *ų+  `  H  0޽h ? f;v3ff___PPT10i.@ {+D=' = @B +% 0 UM`x(  x^ x S LD   G x c $0 x   yA locale is a string identifier that refers to specific linguistic and cultural preferences. These preferences can include date/time formatting, number formatting, spelling of certain names and many other items. These preferences can change over time due to cultural and political reasons. For example, modern Spanish sorts differently from older Spanish from the 1990s. In another example, some countries mandate how specific regions are referred to (this can happen when ownership of a region is in dispute). Of course, these types of preferences are not absolute. For example, most people in the United States use 12 hour time, but there are some people in the US that use 24 hour time. There are some languages, like French and Japanese, that have published standards for how to sort those languages. There are some other languages that may not have enough exposure to other cultures to have names for certain places or concepts. There are many things that locale data can cover. It could cover industry specific topics, like shoe size. CLDR limits its scope to a few specific topics. Scope of data limited to common system applications $zE3 E3H x 0j ? 3380___PPT10.@rT]]|3!5?1` e+.xDe9]7l.|(0 9 & x .http://www.unicode.org/.http://www.unicode.org/8http://www.unicode.org/cldr/8http://www.unicode.org/cldr/E8http1 Languages@http://demo.icu-project.org/icu-bin/locexp?_=ga&SHOWLanguages=1 Languages://www.unicode.org/cldr/8http://www.unicode.org/cldr/>http://unicode.org/reports/tr35>http://unicode.org/reports/tr35@http://unicode.org/reports/tr10 >http://unicode.org/reports/tr10>http://unicode.org/reports/tr35>http://unicode.org/reports/tr35Thttp://www.unicode.org/cgi-bin/cldr-surveyThttp://www.unicode.org/cgi-bin/cldr-survey&ICU Locale Explorer~http://demo.icu-project.org/icu-bin/locexp?_=ga&SHOWLanguages=10Languages&ICU Locale Explorer~http://demo.icu-project.org/icu-bin/locexp?_=ga&SHOWLanguages=10Languages/ 0DArial GotXq0B 0"DTimes New Roman0B 0 DVerdanaw Roman0B 0"0DWingdingsRoman0B 0@DTimesngsRoman0B 0 PDMonotype Sans Duospace WT JB 01`DLetter GothicDuospace WT JB 02F 0.  @n?" dd@  @@`` 80< / , %)+-16 8:;IJLOPQRSTVW]cdfklw"x{Tb$M{'isT+)!b$7)XG"b$ s_F6!`Z)b$ߵB ]g[ * 0e0e     A@  A5% 8c8c     ?1 d0u0@Ty2 NP'p<'pA)BCD|E||s " 0e@        @ABC DEEFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `abN E5%  N E5%  N F   5%    !"?N@ABC DEFFGHIJK5%LMNOPQRSTUWYZ[ \]^_ `ab MMM33ff̙@8 5}Y ʚ;?2ʚ;g4DdDd B 0ppp@ <4ddddLpC 00q<4!d!dLpC 00q g4<d<d B 0Uph p<4BdBdLpC 00qc ___PPT10 8\   c[\]( $ ___PPT9x8   $c2[]X\? %O =TC CLDR: The Common Locale Data Repository Locales for the World6D',,3Lisa Moore George Rhoten Mark Davis Steven Loomis$ P)0P48*Agenda:Why CLDR? CLDR data Tools and vetting Today and the futuregAgenda:Why CLDR? CLDR data Tools and vetting Today and the future, 1 1iLLocales  does anything stay the same?''( "Theatre Center News: The date of the last version of this document was 2003t^3g20e. A copy can be obtained for $50,0 or 1.234,57 3@=. We would like to acknowledge contributions by the following authors (in alphabetical order): Alaa Ghoneim, Behdad Esfahbod, Ahmed Talaat, Eric Mader, Asmus Freytag, Avery Bishop, and Doug Felt." MZ  -       C  ~  $_#`1<Locales  the many differencesWLocales specify user preferences Linguistic and cultural differences Languages, scripts, writing systems, ordering, directionality, formatting, numbers, sizes Even in the same locale, interoperability issues across platforms Global economics has increased the need for greater globalization support in computer systems Everyone expects more! tEPZ$PPPEZ$Add the Universal Character Encoding%%(1Unicode: Unique character codes for all languagesa4The Need for Common Locale Data (6Computing environments often contain a variety of operating systems and software. Historically locale sensitive data research has been done by individuals and/or companies. Because of political changes, it is easy for locale data to become out of date. It is difficult to get complete agreement on correctness.7Z74'Common Locale Data Project<Began as Common XML Locale Repository (CXLR) developed by OpenI18N in 2003 CLDR project began in 2004 Hosted by Unicode Consortium http://www.unicode.org/cldr/ Goals: Common, necessary software locale data for all world languages Collect and maintain locale data XML format for effective interchange Freely availablehZ$ZZ$ZEd 0d8CLDR in use (partial list)jLibraries and Environments ICU  International Components for Unicode JDK  Java Development Kit Operating Systems Solaris AIX MacOS X Applications OpenOffice.org Acrobat ModernBillZF ZZ Z Z" Z+     $$ ((,,> dAgenda0Why CLDR? CLDR data Tools and vetting The futureF   5%What is a Locale?A locale is an identifier referring to linguistic and cultural preferences en_US, en_GB, ja_JP These preferences can change over time due to cultural and political reasons Introduction of new currencies, like the Euro Standard sorting of Spanish changes Many of these preferences have varying degrees of standardization 12 and 24 hour format in the United States This is a very broad topic KP$PMPR$PBP+$PPKMRB+>K)3$Types of Locale DataDates/time/calendar formats Number/currency formats Measurement system Collation specification Sorting Searching Matching Translated names for language, territory, script, timezones, currencies,& Script and characters used by a languageT_" ds_s 8cLocale Data Markup LanguageLocale data described using XML CLDR data uses LDML Structure of CLDR controlled by Locale Data Markup Language (LDML) specification http://unicode.org/reports/tr35 Xd $d 0pALDML Data Categories $P>MsDNames Provides translated display names for languages, territories, scripts, variants and keywords used in CLDR. Most of this information is at the language level, since it typically does not vary by territory, only language. An example: ICU Locale Explorer 4!*d 0eaNames Examples(From ga.xml (Irish): <localeDisplayNames> <languages> <language type="aa">Afar</language> <language type="ab">Abcisis</language>& <scripts> <script type="Arab">Araibis</script>& <territories> <territory type="AD">Andra </territory> <territory type="AE">Aontas na nimrochta Arabacha </territory>&  F:#03&!l03& xI Characters Allows for creation of exemplar character sets. An exemplar set specifies the set of characters that must be present in order to properly render the language. Auxiliary exemplar set defines additional characters that may appear in foreign words or phrases. Lower case onlyD  `~O Date Formats Defines representation of calendars using various calendaring systems (Gregorian, Buddhist, Islamic, Japanese, etc.) Defines formatting for dates, times, eras and time zones wide, abbreviated, or narrow Date and time formats use patterns of letters to define proper formatting Week information Relative day/time translations (for example, yesterday, tomorrow, etc. ) An example: ICU Locale ExplorerdPPgPzPgzd 0bCharacters / Dates ExamplesFrom ga.xml (Irish): <characters> <exemplarCharacters> [a b-e f-i j-o p-u v-z] </exemplarCharacters> <exemplarCharacters type="auxiliary"> [    ! A W a k] </exemplarCharacters> </characters>& <dayContext type="format"> <dayWidth type="abbreviated"> <day type="sun">Domh</day> <day type="mon">Luan </day>& P<@PF$!C_ >a*STime Zone Names Based on Olson time zone database Localized display names for standard, daylight, and generic representations of time zones. Short and long display names.* UNumbers Specifies proper localized formatting of numeric quantities Decimal Scientific Currency Percentages Includes localized decimal, thousands separators, currency symbols, etc.^ <( I <(ITime Zones / CurrenciesvFrom ga.xml (Irish) and root.xml: <timeZoneNames> <zone type="Europe/Dublin"> <long> <standard>Men-Am Greenwich</standard> <daylight>Am Samhraidh na hireann </daylight> </long>& <numbers> <currencies> <currency type= EUR"> <displayName>Euro</displayName> <symbol> </symbol>& $P0PPP!FJY  K+ S  zK Delimiterss Specifies a primary and secondary of delimiter characters to be used for bracketing quotations in text* g g{LDelimiters ExampleFrom fr.xml (French): <delimiters> <quotationStart></quotationStart> <quotationEnd></quotationEnd> <alternateQuotationStart> </alternateQuotationStart> <alternateQuotationEnd> </alternateQuotationEnd> </delimiters> Z 0ZPZFfbf!bf*bf3bf'bb  Z Collation Information in collation directory, not main XML version of Java/ICU collation syntax Unicode collation algorithm is the base http://unicode.org/reports/tr10 Allows tailoring of the UCA on a per locale basis.2 ZZ d 0[Collation ExampleFrom collations/root.xml: <collations validSubLocales="ga ga_IE id id_ID ms ms_BN ms_MY nl nl_BE nl_NL pt pt_BR pt_PT"> <collation type="standard"> <rules> ... <s></s> <t></t> <s></s> <t></t> <s></s> <t></t> <s></s> <t></t>& PPb@B b & b  b  #   b F bF b F  $b$(F( ,b,0F0 4b48F8 hhAgenda:Why CLDR? CLDR data Tools and vetting Today and the future0&  &T+Latest Release: CLDR 1.4Released: July 17, 2006 360 locales: 121 languages 142 territories 25% more data 17,000 new or modified data items Over 100 different contributors f'$RR Challenges $fComplex Formats Experts knowledgeable both in technology and a specific language Collation Exemplar characters Etc& Require close interaction of CLDR experts with language experts:Q# @Q#@ Getting InvolvedSimplest  anyone! Use CLDR Bug report / feature request More Involved Vetting, Assessment, Tools, Policies, Decisions, & Any Unicode member eligible to name representatives including country liaison membersp&   %!Example Country Process (Finland)""(\Finnish Ministry of Education made CLDR data a major goal, 2004-06 Research Institute for the Languages of Finland ( RILF aka  Kotus ) designated agency Two official languages (Finnish and Swedish) & four regional / minority languages (three Smi & Romani as spoken in Finland) to be covered Over 30 different parties represented: commercial, non-commercial, individuals Results expected to lead to new/revised national standardsVCPl PC/(;>{uFor More InformationUnicode http://www.unicode.org/ CLDR http://www.unicode.org/cldr/ LDML specification http://unicode.org/reports/tr35 lisam@us.ibm.com Z ZZ Z Z! Z ZZ!J d 0d 0%Ad 0Ut/0+,: ; < = ?@FLMOZ!'()*+,-  L<(  L~ L s *y'  `0  y ~ L s *y+  ` y H L 0޽h ? 3ff~___PPT10i.d+D=' = @B +  ` x<(  x~ x s *'  `0   ~ x s *+  `  H x 0޽h ? 3ff~___PPT10i.d+D=' = @B +  @ <(  ~  s *ij'  `0   ~  s *ų+  `  H  0޽h ? f;v3ff___PPT10i.@ {+D=' = @B +r |8s~OeX8lRoot EntrydO) Pictures?Current User&>SummaryInformation(U      !"#$%&'()*+,-./01234567:&=      !"#$%'()*+,-./012345<=>?@ABDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxz{|}~  !"#$%'()*+ts/tr10 http://unicode.org/reports/tr35+http://www.unicode.org/cgi-bin/cldr-survey@http://demo.icu-project.org/ic"_㱼 Karl KellyKarl Kellyu-bin/locexp?_=ga&SHOWLanguages=