Localisation Focus - The International Journal of Localisation Paper DOwnloads
Home About LRC LRC News Resources

Localisation Focus

The International Journal of Localisation

Localisation Focus–The International Journal of Localisation provides a forum for localisation professionals and researchers to discuss and present their localisation-related work, covering all aspects of this multi-disciplinary field, including software engineering and HCI, tools and technology development, cultural aspects, translation studies, human language technologies (including machine and machine assisted translation), project management, workflow and process automation, education and training, and details of new developments in the localisation industry. Proposed contributions are peer-reviewed thereby ensuring a high standard of published material. Localisation Focus–The International Journal of Localisation is distributed worldwide to libraries and localisation professionals, including engineers, managers, trainers, linguists, researchers and students. Indexed on a number of databases, this journal affords contributors increased recognition for their work. Localisation-related articles, book reviews, perspectives, insights and correspondence are all welcome.

The Sinhala Collation Sequence and its Representation in UNICODE

Volume 5 Issue 1


Authors: A.R. Weerasinghe, D.L Herath, K. Gamage


The alphabet of a language is perhaps the first thing we learn as users. The alphabet of our mother tongue would be the first alphabet we ever learn. And yet, a closer look reveals that there is much about such an alphabet that we have not explicitly specified anywhere. The Sinhala alphabet order is a prime example. We use it, recite it and yet would be hard pressed to define it explicitly.

Sinhala is spoken in all parts of Sri Lanka except some districts in the north, east and centre by approximately 20 million people. It is spoken by an additional 30,000 (1993) people in Canada, Maldives, Singapore, Thailand and United Arab Emirates. Sinhala is classified as an Indo-European language and used as an official language.

The UNICODE Collation Algorithm (UCA) is an attempt to make explicit the collation sequence of any language expressed in the UNICODE (or any other) coding system. In order to express the Sinhala collation sequence (alphabetical order) using UCA, the authors undertook the task of identifying unresolved issues facing the unambiguous definition of the order. This paper first describes the issues identified through this study, suggesting alternate solutions and recommending one of them. Finally, it sets out the recommended collation sequence for Sinhala in the form of the UNICODE collation specification. The outcome of this process is a unique and unambiguous expression of the Sinhala collation sequence which could be tested using existing tools and software environments.

Keywords: UNICODE, Sinhala, Collation, UNICODE Collation Algorithm, Localisation, Internationalisation

Click Here to Download



For more information on Localisation Focus–The International Journal of Localisation, contact LRC@ul.ie

Statement of Publication Ethics and Publication Malpractice

Localisation Focus – The International Journal of Localisation is committed to the academic community and the lay world in ensuring ethics in publication and quality of articles in publication. Plagiarism is strictly forbidden and papers found to be plagiarized will be removed or not published.

All authors submitting material to Localisation Focus – The International Journal of Localisation affirm that the content and associated materials are original and do not infringe the copyright of anyone. By submitting content, authors also confirm that the submission has full consensus of all the authors and it was neither submitted nor published previously.


© Copyright 2013 Localisation Research Centre (LRC). All rights reserved.