-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathalphabets.json
6902 lines (6902 loc) · 704 KB
/
alphabets.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
[
{
"script": "Adlm",
"name": "Adlam",
"unicode_pdf": null,
"family": "African",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": false,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "bottom",
"ligatures": "none",
"direction": "rtl",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Adlam script is used for writing the Fulani language in Guinea. Prior to the creation of this script, the Fulani language was written in the Latin and Arabic scripts. In the 1980s, two teenage brothers, Ibrahima and Abdoulaye Barry, wondered why their language did not have its own script and set about creating one. They named their script ‘Adlam’ because the first four characters in the repertoire represent A, D, L, and M.</p> <p>Following the creation of the script, the two brothers initially produced hand-written textbooks to aid in its dissemination. They first taught their other family members to read and write, then the women and children in their village. Along with the script, the books taught lessons in water hygiene and basic medical care. Gradually, the script grew in popularity and the brothers set up learning centres in Togo, Senegal, Benin, and even amongst Fulani speakers in New York. Eventually, the means were available to print the script, and a newspaper and a number of printed books were published. However, the rapid growth of Adlam attracted the attention of the Guinean government, who were opposed to the new script which they could not read, and Ibrahim was imprisoned for three months. After his release, in 2007, he moved to America and took a job with a software company. He used the expertise and financial savings he acquired there to create software for Adlam, and by 2012 three fonts and a keyboard had been created.</p> <p>The script is written from right to left and letters can be joined to one another (cursive) or separate. There are 28 letters in all, and each letter has four forms, including upper- and lower-case forms. The script is an alphabet, with both consonant and vowel characters. Adlam also contains a set of digits from 0-9 and a number of diacritics for marking vowel length, <a href=http://scriptsource.org/cms/scripts/page.php?item_id=glossary#gemination>gemination</a>, and foreign sounds. The glottal stop is also indicated with a diacritic.</p>"
},
{
"script": "Afak",
"name": "Afaka",
"unicode_pdf": null,
"family": "American",
"type": "syllabary",
"white_space": "discretionary",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Afaka script was a syllabary created around 1910 for writing the Nenge (also called Ndjuka, or Eastern Maroon Creole) language, an English <a href=http://scriptsource.org/cms/scripts/page.php?item_id=glossary#creole>creole</a> spoken by about 23,000 people in Suriname and French Guyana. It is the only known indigenous script for a creole language. The script was named for its inventor, a Nenge man called Afaka Atumisi. It is now believed to have fallen into disuse.</p> <p>Syllable structure in the Nenge language allows for initial consonant clusters containing a nasal and a stop, and for doubly articulated consonants (two consonants produced simultaneously at different places in the oral cavity). Vowels can also be complex or nasalized. If each of these is considered to be a single sound, as is the case for speakers of the language, syllables can be analyzed as exclusively consisting of a consonant + vowel. In the Afaka script, single letters were used to represent these complex consonant + vowel combinations. However, this resulted in many contrasts in the language being unrepresented in writing. The same letters were used to write bo bow and bṍ bone, for example. Syllables starting with the sounds b, d, dj,f, g, l, m, n, s, and j were written with the same letter regardless of the following vowel. The vowels i and e were not represented when following the consonants l, m, s, and w. Tone is also contrastive in the language, but was not written. na is and ná is not vary only in tone, so were written the same. Given the predictable subject matter for which the script was generally used - most texts repeated many fixed religious expressions - this underrepresentation did not necessarily hamper understanding when reading. However, it became more problematic when users needed to express novel or less predictable expressions, which the deficiencies in the script left ambiguous.</p> <p>One punctuation mark was used, a vertical line which marks the end of a clause or a sentence, roughly equivalent to a comma and a full stop. Inter-word spacing was used, but not by everyone.</p> <p>Afakas script came to the attention of the wider world in 1915, when Brother Bernard, a Catholic missionary in Suriname, observed a man reading a book full of unknown symbols. The man was Afakas brother-in-law Abena, one of about 30 people to whom Afaka had taught the script. Abena asked Br. Bernard to dictate a number of liturgical Catholic prayers in Nenge, which he then recorded in the script. Much of what is known about the script is attributed to Br. Bernards colleague, Father Morssink, who went to considerable lengths to study and propagate the script, with limited success.</p> <p>The limited success of the script is believed to have been largely due to its underrepresentation of many phonological contrasts. It was likely due in part to its associations with European missionaries; some Nenge speakers saw it as a means to convert them to a European faith. In addition, literacy had never been a part of the Nenge society, and many saw no reason to change their traditional way of life, especially at a perceived cost to their spiritual welfare. A more controversial view is that the symbols already existed prior to the creation of the script, but that they had religious, rather than phonetic, significance. Many of them resemble acrophonic rebuses - stylized pictures of something which begins with the sound the symbol stands for - from Africa, where they may have been used in religious ceremonies. Some Nenge may have felt that using the sacred symbols for mundane or secular purposes would be irreverent.</p>"
},
{
"script": "Aghb",
"name": "Caucasian Albanian",
"unicode_pdf": null,
"family": "European",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "unspecified",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Caucasian Albanian script was used to write the language of Caucasian Albania (which is not the same as present-day European Albania). The Caucasian Albanian language was a dialect of Old Udi, which is closely related to the Eastern Samur branch of Lezgian.</p> <p>The Caucasian Albanian script was an alphabet written from left to right with spaces between words. There were fifty-two letters in the alphabet. It is thought to have been based on Greek writing.</p>"
},
{
"script": "Ahom",
"name": "Ahom",
"unicode_pdf": null,
"family": "Southeast Asian",
"type": "abugida",
"white_space": "none",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": null,
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Ahom script was used by members of the Tai Ahom community in India for writing the Ahom language, an extinct member of the Tai-Kadai language family. Ahom has been written for at least 500 years, and possibly much longer. The Ahom script is derived from Old Mon, ultimately of Brahmi origin. The Ahom people descended from the Shan ethnic group of Myanmar but migrated to what is now Assam, India, in the 13th century, taking their language, script and cultural traditions with them. There they established the Ahom Kingdom, where Ahom was the official language until the absorption of the region by the British Empire in 1826. After this, the Assamese language and script were promoted and have now almost entirely replaced Ahom. The Ahom language is occasionally used in religious rituals, and there have been some recent revival efforts by the ethnic Ahom community in Assam; however, there were a number of irregularities in the script which have hampered efforts to decipher manuscripts or attain knowledge of the spoken language.</p> <p>Ahom is an abugida; each consonant letter represents a CV syllable where the vowel is a unless indicated otherwise by means of vowel diacritics. Vowel diacritics were written above, below, to the right of, or flanking the preceding consonant. Initial vowels having no preceding consonant were written using a silent dummy consonant letter, to which vowel diacritics could be attached as appropriate.</p> <p>The script originally contained 19 consonant letters, but during the 18th century a further 5 were added for writing words borrowed from other languages. There are 14 vowel diacritics, two which represent diphthongs and one which represents the sound am or a:m.</p> <p>Although the Ahom language was tonal, tones were not written, so it is no longer known how many tones were used or how they were arranged. Consonant clusters were often not written in full; rather, only the first element in the cluster was written, and the reader was to supply the missing parts. The initial consonant of a word was often not written if it was the same as the final consonant of the previous word. In a sequence of words beginning with the same letter, the words were contracted so that the initial letter was only written once.</p> <p>The first Ahom font was created in 1920 for the Ahom-Assamese-English dictionary compiled by Golap Chandra Barua. Although the accuracy of the dictionary itself has been called into question, as Baruas claimed fluency in Ahom was later disproved, the font was used in a number of other influential publications and has since become an authoritative model for letter shapes. The origin of the canonical order according to which the dictionary is organized is not known.</p>"
},
{
"script": "Arab",
"name": "Arabic",
"unicode_pdf": null,
"family": "Middle Eastern",
"type": "abjad",
"white_space": "between words",
"open_type_tag": "arab",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": null,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "bottom",
"ligatures": "required",
"direction": "rtl",
"direction_notes": "RTL bidirectional",
"sample": null,
"sample_img": null,
"description": "<p>Arabic writing is the second most broadly-used script in the world, after the Latin alphabet. It descended from the Nabataean abjad, itself a descendant of the Phoenician script, and has been used since the 4th century for writing the Arabic language. Since the words of the Prophet Muhammed can only be written in Arabic, the Arabic script has traveled far and wide with the spread of Islam and came to be used for a number of languages throughout Asia, Africa and the Middle East. Many of these are non-Semitic languages, so employ very different sound systems from spoken Arabic, and as a result the script has had to be adapted and is used slightly differently by speakers of different languages. Many African languages use an Arabic-based transcription system called <span class=Em>Ajami</span>, which is different from the original Arabic script. Romance languages such as Mozarabic or Ladino are also sometimes written in a modified Arabic script, called <span class=Em>Aljamiado</span>.</p> <p>Many variations on the script have developed over time and space, but these can be broadly classified into two groups; an angular <span class=Em>kufic</span> style which was originally used for stone inscriptions and which commonly employs no diacritics, and the <span class=Em>naskh</span> style which is more commonly used, more rounded in form, and governed by a set of principles regulating the proportions between the letters. There are a number of variant styles included in this group, including those used in Arabic calligraphy.</p> <p>Arabic letters are read from right to left. The script is an abjad; only the consonants are required to be written. The basic set of letters consists of twenty-eight consonants, although some languages use many more letters than this. Some letters can represent a consonant or a long vowel, depending on the context. An additional set of diacritics exists for writing short vowels, but its use is optional. Conventionally, for writing the Arabic language, long vowels are written and short vowels are omitted. Where a vowel is not written, readers of the script must use their knowledge of the language and its phonology to insert the appropriate vowel sound. Diacritics also exist for marking <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#gemination target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> gemination</a> (consonant lengthening).</p> <p>Arabic is a unicameral script; there is no upper and lower case. It is also obligatorily cursive, that is, all the letters in a word must be connected wherever possible. Some letters (a ا, d ﺩ, ð ﺫ, r ﺭ, z ﺯ and w ﻭ) cannot always be joined. Each letter has three contextual forms depending on whether it appears at the beginning, in the middle, or at the end of a word, as well as a basic stand-alone shape which is used when it appears at the end of a word <span class=Em>and</span> is preceded by one of the letters listed above which cannot join on the left side. Many letters look broadly similar to one another, differing only in the placement of one or more dots above or below the letter. For example, the letters representing ħ, g and x are all the same, except that ħ is unmarked, g has a dot in the loop of the letter and x has a dot above it. These dots are called <span class=Em>ijam</span> and form an integral part of the letter.</p> <p>A number of ligatures are used in handwriting, but only one l + a is compulsory ﻻ. A ligature is commonly used for writing the word <span class=Em>Allah</span> God .ﷲ</p> <p>Latin punctuation is commonly used, with a few exceptions. The Arabic comma, question mark, and percent sign are script-specific, and there is also an Arabic triple-dot mark. In addition, there are script-specific honorific marks which may be placed above a persons name in order to confer honor or a blessing upon them. There are also Koranic annotation signs, mostly to provide guidance in chanting and singing sacred text.</p> <p>The Arabic script employs two sets of numbers, <span class=Em>Standard</span> and <span class=Em>Eastern Arabic</span>. Latin numbers, which derive from a medieval set of Arabic numbers, are also used, particularly in North Africa. Like Arabic letters, numbers are written from right to left, but with the highest value on the left, as with Latin numbers, so they must be read from left to right. There is also a numeral system known as <span class=Em>abjad numerals</span>, in which each of the 28 letters of the Arabic abjad is assigned a numeric value of units, tens or hundreds. These are combined to create larger numbers.</p>"
},
{
"script": "Aran",
"name": "Arabic (Nastaliq variant)",
"unicode_pdf": null,
"family": "Middle Eastern",
"type": "abjad",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": false,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "bottom",
"ligatures": "required",
"direction": "rtl",
"direction_notes": "RTL bidirectional",
"sample": null,
"sample_img": null,
"description": "<p>The Nastaliq variant of Arabic script is a cursive script, written horizontally but at a sloping angle where the longer the sequence of characters is in a calligraphically-connected cluster, the taller it becomes. It has short verticals and long horizontal strokes. Due to the slope and calligraphic style, placement of nuqtas (dots) on the base characters is forced to vary quite a bit among different letters and even different forms of the same letter. Words are separated by a space. </p> <p>The name (Nastaliq) is a composite of the names for the \\\\\\Naskh\\\\\\\" and \\\\\\\"Taliq\\\\\\\" styles of writing. Nastaliq is used in Afghanistan"
},
{
"script": "Armi",
"name": "Imperial Aramaic",
"unicode_pdf": null,
"family": "Middle Eastern",
"type": "abjad",
"white_space": "discretionary",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "rtl",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Aramaic script was used for writing the Aramaic language, which was the trade language of the Middle East from about 1000 BC to about 1000 AD. Aramaic writing is derived from the Phoenician script. Because the evolution from one to the other was a continuous process over about 2000 years it is difficult to divide it neatly into uniquely Phoenician and uniquely Aramaic blocks; however, it is generally agreed that a divergence into two distinct scripts was evident by about the 8th century BC. Both the Phoenician and Aramaic scripts were the antecedents of a large and geographically diverse family of writing systems. Many of the scripts used today for writing Indo-European languages are part of this family. Again, because the two scripts were so closely related it is not clear exactly which writings systems descended from each. Conventionally, the scripts used in Western Europe and the Mediterranean are called Phoenician-derived and those used in the Middle East and Central/South Asia are called Aramaic-derived.</p> <p>Imperial Aramaic was both the official language of the Persian Empire from the 5th to the 3rd centuries BC, and the script used for writing this language. It was used throughout modern-day Iran, Afghanistan, Pakistan, Thrace and Macedonia, Iraq, northern Saudi Arabia, Jordan, Palestine, Israel, Lebanon, Syria, and parts of Egypt as far as Libya. Imperial Aramaic writing was so influential that it survived the collapse of the Persian Empire which had initially disseminated it, and continued to be used until the 2nd century AD. By the end of the 3rd century, variant forms of the script had diverged into distinct scripts such as Syriac, Nabataean and Palmyran. The form of Imperial Aramaic which changed the least is now used for writing the Hebrew language.</p> <p>Imperial Aramaic was written from right to left, with or without spaces between words. The script was an abjad; each of the twenty-two letters represented a consonant. Because the interpretation of some words was ambiguous when the vowels were not written, Aramaic scribes began using a select few of the existing consonantal letters to indicate long vowels, first at the end of words, then inside words. Letters having this dual consonant/vowel function are (called <span class=Em>matres lectionis</span>). The letters <span class=Em>waw</span> and <span class=Em>yudh</span> could represent either the consonants w and j respectively, or the long vowels u/o and i/e respectively. Similarly, the letter <span class=Em>aleph</span> represented the consonant ʔ at the beginning of a word, or the long vowel a/e elsewhere. One script-specific punctuation mark is used in writing Imperial Aramaic, a section sign to mark topic divisions in texts.</p> <p>Imperial Aramaic orthography was highly regular. Often the spellings of words more closely reflected their etymology than their pronunciation.</p>"
},
{
"script": "Armn",
"name": "Armenian",
"unicode_pdf": null,
"family": "European",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "armn",
"complex_positioning": "no",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": false,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "required",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Armenian alphabet was created around 405AD by Mesrop Mashtots. He reportedly studied a number of scripts, including Greek and Syriac, before having a dream in which he saw elements of these scripts integrated into one system. The Geez script has also been suggested as a possible influence. Armenia adopted Christianity in the early fourth century, but later began reverting to paganism. Mashtots believed that if the Bible could be written in Armenian the country would have the means to maintain a Christian faith and culture at a time when the Persian empire was attempting to assimilate the country. It also enabled Armenian educational institutions to be established, as well as a large body of early Armenian literature. The alphabet has been used in the country since its inception, with some modifications.</p> <p>There were originally 30 consonant and 6 vowel letters in the alphabet, but in the 12th century two more letters were added for representing aw and f, bringing the total to 38. A third symbol was also introduced to represent the word and in lower case form. Much later, in the early 20th century, orthographic reform under the Soviet government led to two spelling conventions being used. Broadly speaking, Mesropian orthography is used throughout the Armenian diaspora (which makes up the majority of Armenian speakers), and Reformed orthography is used within Armenia. The difference between the two has been likened to that between British and American English, though with stronger political connotations.</p> <p>Early Armenian writers relied heavily on ligated form of letters in order to save space. Five of these are still in common use, to combine the symbol for m with n, ɛ, i and x, and to combine the symbol for ɛ with v. </p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/r3/2r/r32rqthawp_ArmeLigas.png height=159 width=546 /></p> <p>A ligature also used to exist which combined the symbols representing ɛ + w/v, but it is used so commonly that followers of the Reformed orthography consider it a distinct letter rather than a ligature, bringing their total number of symbols to 39.</p> <p>The script is an alphabet; consonants and vowels are written with independent letters. It is read from left to right. The traditional order of the letters composes an acrostic prayer. It is generally believed that this order follows the order of the Greek alphabet, although it has also been suggested that Mashtot followed a number symbolism found in early kabbalistic traditions when ordering the letters. </p> <p>Armenian is written with spaces between words. There are four punctuation marks unique to Armenian, two of which follow a word to indicate that it has been abbreviated, or to add emphasis, and two marks which are placed after and above the last vowel, respectively, as a question mark and an exclamation mark. Non-Latin marks are used to perform the functions of full stop, semicolon, word-joining and word-splitting hyphen. The script employs upper and lower case.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/r3/2r/r32rqthawp_ArmenianPunctuation.png height=116 width=190 /></p> <p>Written Armenian contains no script-specific numbers, rather, each letter was assigned a numeric value. In recent years and particularly in print form, Latin numbers have increasingly been used.</p>"
},
{
"script": "Avst",
"name": "Avestan",
"unicode_pdf": null,
"family": "Middle Eastern",
"type": "alphabet",
"white_space": "discretionary",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "optional",
"direction": "rtl",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Avestan script was used from the 5th to the 13th century AD for writing the Avestan language, an Eastern Iranian language which is now only known from its use as the language of Zoroastrian religious texts called <span class=Em>Avesta</span>, although it is thought that at one time it was probably a natural language in everyday use. There are no surviving examples of written Avestan prior to its use as a liturgical language, and it is thought that the Avestan script was created particularly for the purpose of writing religious texts. At that time, many of the Iranian languages were written in the Pahlavi script, but this contained a number of ambiguous symbols and did not represent vowels, so was unsuitable for representing a religious language, the pronunciation of which was important. The script was also used to write religious commentaries in the Middle Persian language; these are called <span class=Em>Pazend</span>.</p> <p>Avestan is written from right to left using thirty-seven consonant and sixteen vowel letters. Many of the consonant letters were taken from the various forms of the Pahlavi script, and some of the vowels were taken from the Greek. Some new letters were created by the addition of diacritics to existing forms, and some appear to be original creations. One letter is used only in Pazend texts. There are four optional ligatures in use, for representing sk, šc, št, and ša. Aside from these, letters are not normally joined in a cursive way, although they may be written close enough together to touch.</p> <p>Punctuation was rarely used in Avestan texts, although words were separated by a dot. Scholars of the script in the 19th century created a punctuation system for use in transcribing texts, including symbols functioning as a colon, semicolon and full stop, and abbreviation and repetition marks. These have been encoded in the Avestan block in Unicode; the Unicode proposal document anticipates that modern users of the script will use them in Avestan texts as well as in transcriptions.</p>"
},
{
"script": "Bali",
"name": "Balinese",
"unicode_pdf": null,
"family": "Insular Southeast Asian",
"type": "abugida",
"white_space": "between words",
"open_type_tag": "bali",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": true,
"case": null,
"split_graphs": true,
"status": "Current",
"baseline": "hanging",
"ligatures": null,
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Balinese script is used for writing the Balinese language spoken on the Indonesian islands of Java and Bali. It is derived from the Old Kawi script, and is ultimately of Brahmic descent. It is very similar to the Javanese script in form and behaviour; some consider them to be typological variants of one another. The Balinese script is an abugida in which each of the consonants has an inherent a vowel. These can be modified by the addition of diacritics above, below or alongside the consonant, to change the inherent vowel to one of the other 15 vowels in the Balinese inventory. Some of these vowels are split vowels, that is, they are written with a combination of marks in more than one position relative to the consonant. Syllable-initial vowels are represented by independent vowel characters. At least two of these can function as consonants, in that they can be followed by the virama to represent a syllable-final glottal stop.</p> <p>There are 34 reported Balinese consonant characters, although some of these only enjoy very limited use, even to the extent that only a single word attests to them. To some degree this is attributed to the large number of loan words from Sanskrit or Old Javanese, which possessed larger consonant inventories than modern spoken Balinese. Each consonant has an alternate <span class=Em>pangangge</span> form for use in consonant clusters; a virama is also used in consonant clusters or to represent a syllable-final consonant. </p> <p>Balinese punctuation is used for beginning and ending a section of text, for roles corresponding roughly to those of the Latin comma, colon and full-stop, and, in some texts, for marking holy letters, particularly when using Sanskrit words in payers. A linebreaking hyphen is also used. Also sometimes included in discussions of Balinese characters is the notation system used for writing music, which is based on the vowel characters.</p> <p>Historically, Balinese has been inscribed into stone, or written on palm leaves. Traditionally, the religious texts written on palm leaves were considered to be sacred and could not be read by everyone; now, the palm leaves themselves are still considered sacred, although the contents is not, and can be read by anyone who is able, through the media of printed books. New literature in the script is uncommon, however traditional literature is published on a limited scale. Additionally, it is used for public signage on roads, at entrances to villages, and on government buildings. Community reading groups called <span class=Em>Sekaha Pesantian</span> also exist for the purpose of reading the Balinese script in a social context, commonly in song form.</p>"
},
{
"script": "Bamu",
"name": "Bamum",
"unicode_pdf": null,
"family": "African",
"type": "syllabary",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": null,
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Bamum script of Cameroon was devised at the end of the C19th following a dream in which the Bamum King Njoya was inspired to provide a writing system for his people by creating a series of pictographs. The script was something of a communal production; the Bamum people were reportedly invited to supply the King with a number of simple symbols, from which he chose more than 500 to use in the Bamum script. Initially, the script was logographic, but it underwent a series of changes influenced by the rebus principle. After about 30 years, full phonetic representation had been realized, and the script had developed into a syllabary composed of just 80 base characters.</p> <p>Both spoken and written Bamum are based around the (C)V syllable although syllable-final consonants, limited to p, t, m, n, ŋ, can also occur. The script is written from left to right, allegedly to remove any Arabic associations. It also employs 5 punctuation marks, roughly corresponding to the Latin full stop, comma, colon, semi-colon and question mark, as well as a differentiating mark, <span class=Em>nʒɛmli</span>, which differentiates two homonyms. <span class=Em>Nʒɛmli</span> is also placed in front of proper nouns by way of identification.</p> <p>Two diacritics, <span class=Em>ko<sup>ʔ</sup>ndon</span> and <span class=Em>tukwentis</span>, are also used. <span class=Em>Ko<sup>ʔ</sup>ndon</span> can be added to any of the 80 base characters. Most commonly, it serves to reduce the number of heteronyms in the language by adding a glottal stop - which is distinctive in spoken Bamum - to the syllable. Sometimes it can be used to change the initial consonant of the syllable, or, less commonly, to indicate a semantic rather than phonetic difference. <span class=Em>Tukwentis</span> can be added to 13 of the base characters, in most cases to mute the inherent vowel in a syllable. It is also written above the vowel o to change it to ə.</p> <p>The Bamum script was used for administrative purposes, documentation and education, until the 1930s saw King Njoya exiled by German and French colonial powers who also closed the schools and attempted to destroy documents written in the script. It has now fallen into disuse, but the <a href=http://www.bamumscript.org target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> Bamum Scripts and Archives Project</a> exists to preserve and digitally archive manuscripts written in the script and prevent its total loss.</p>"
},
{
"script": "Bass",
"name": "Bassa Vah",
"unicode_pdf": null,
"family": "African",
"type": "alphabet",
"white_space": "unspecified",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Bassa Vah script is used to write the Bassa language spoken in Liberia and Sierra Leone, and by Bassa speakers in Brazil and the Caribbean. The name originated from the Bassa word Vah, meaning to throw a sign; the script developed from the earlier Bassa communication system of signs made from chewed leaves or carved into trees and left in set locations where they would be found and interpreted. As this system developed, it was employed by the Bassa people to avoid slave traders, so was suppressed by colonial powers and became almost extinct.</p> <p>Widely hailed as the only indigenous African alphabet, the script consists of 23 consonants and 7 vowels. Bassa Vah is the only indigenous Liberian script to represent tones. This is done by means of 5 tone marks, which are written inside the vowel characters. </p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/66/9c/669cd3g9f3_bassatones.jpg height=170 width=250 /></p> <p>Characters are sorted in an order called <span class=Em>enni-ka-se-fa</span> - the names of the first four consonants. Amongst some users, this sequence is also the name of the script itself. Vowels are sorted at the end of the alphabet.</p> <p>The alphabet is <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#bicam target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> bicameral</a>, although the lower case has only been developed in recent years, as demonstrated by their absence in <a href=http://www.evertype.com/standards/iso10646/pdf/bassa.pdf target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> Michael Eversons draft chart</a>, and their presence in the subsequent <a href=http://www.uniboa.org/bassalanguage.html target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> Bassa Vah Association chart</a>.</p> <p>There are no unique Bassa Vah digits; Latin digits are used. Similarly, the Latin full stop and colon are used, in addition to a + symbol, which also performs a full stop function.</p>"
},
{
"script": "Batk",
"name": "Batak",
"unicode_pdf": null,
"family": "Insular Southeast Asian",
"type": "abugida",
"white_space": "none",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": false,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": "vertical (LTR) and horizontal (LTR)",
"sample": null,
"sample_img": null,
"description": "<p>The Batak script is used to write the six Batak languages (Toba, Karo, Dairi, Mandailing, Simalungun and Angkola) spoken collectively by approximately 3 million people on the Indonesian island of Sumatra. It is one of several scripts indigenous to the Indonesian archipelago, descended from the Old Kawi script, which in turn is derived from the Pallava, and ultimately the Brahmi, script.</p> <p>It is an abugida; its 18 consonant letters each contain an inherent a vowel, which is modified by means of diacritics for other vowels. Syllable-final consonants are indicated by means of a pangolat (virama). Consonants do not form conjuncts. Punctuation is not normally used, but a number of bindu characters are used to disambiguate similar words or phrases or to separate sections of text. These marks may each have a number of variants and may perform more of an ornamental than a grammatical function. A further sign (pustaha) exists to separate a title from the following body of text, which normally begins on the same line.</p> <p>Batak is read from left-to-right. Some have observed it being written vertically bottom-to-top along the length of a piece of bamboo. These bamboo stalks were placed next to each other and bound together with string in such a way as to form a bundle of plates inscribed with horizontal lines of right-running text.</p> <p>The script was standardised in the 1850s, and was used in education until the declaration of Bahasa Indonesia as the national language in 1945, since which time literacy and usage has declined. However, something of a Batak renaissance in recent years has led to teaching materials once again being made available for use in schools.</p>"
},
{
"script": "Beng",
"name": "Bengali (Bangla)",
"unicode_pdf": null,
"family": "Indic",
"type": "abugida",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": true,
"case": null,
"split_graphs": true,
"status": "Current",
"baseline": "hanging",
"ligatures": "required",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Bengali (also called Bangla) script is used for writing the Bengali language, spoken by over 180,000,000 people mostly in Bangladesh and India. It is also used for a number of other Indian languages including Sylheti and, with one or two modifications, Assamese. It is a Brahmic script although its exact derivation is disputed. Bengali writing shares some similarities with the Dravidian-language scripts, particularly in the shapes of some vowel letters, but it is generally more similar to the Aryan-language scripts, in particular Devanagari.</p> <p>Bengali is an abugida - that is, every consonant letter represents a syllable containing an inherent vowel - written from left to right. There is no upper- or lower-case. There are thirty-five consonant letters and eleven independent vowel letters, two of which represent the diphthongs oj and ow. Each vowel letter also has a diacritic form which combines with a consonant to modify the inherent vowel. These can be written to the left or the right of, or above or below the consonant. Some are <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#multig target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> digraphs</a>, written with part of the letter before and part after the consonant. Some vowel diacritics take contextual forms, for example <span class=Em>u</span> changes shape following <span class=Em>g</span> or <span class=Em>ʃ</span>. Some consonant-vowel combinations have a traditional ligated form which is often visually dissimilar to the base consonant+vowel diacritic form. These are more commonly found in handwriting, but occasionally in printed text also.</p> <p>There are three modifier signs, <span class=Em>khanda-ta</span> (represents a final dental stop), <span class=Em>anusvara</span> (nasalizes a final velar), <span class=Em>visarga</span> (represents a voiceless breath after a vowel), and <span class=Em>chandra bindu</span> (nasalizes a vowel). There is also a symbol, <span class=Em>hasanta</span>, to mute the inherent vowel.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/j6/sp/j6spskdfbr_BengaliSigns.png height=192 width=249 /></p> <p>Bengali writing differs from most Indian abugidas in that the inherent vowel in a syllable is either ɔ or o. In general, it is pronounced ɔ except when followed by i, j, u or w, in which contexts it is pronounted o. However this is not always predictable. In addition, at the end of a syllable (at the end of or within a word), the vowel is often not pronounced even when this is not indicated with <span class=Em>hasanta</span>. So the letters <span class=Em>m</span>+<span class=Em>t</span> can represent either mɔt opinion or moto approximately, but <span class=Em>g</span>+<span class=Em>t</span> always represents gɔto past.</p> <p>Consonants regularly form conjuncts, normally of two letters, more rarely of three or four. Often these involve little or no change in letterform, but in some cases one letter changes unrecognizably from its isolated form and in a few cases both letters do so. Conjuncts are written with the parts alongside each other or in stacks. Pronunciation of conjuncts is not always what would be expected from the sum of their parts, due to sound changes in spoken Bengali which are not represented in writing. The <span class=Em>j</span>+<span class=Em>ñ</span> conjunct is pronounced g+nasalization at the start of a word or gg elsewhere, for example <span class=Em>bijña</span>, pronounced biggo/õ wise. Combinations of C+<span class=Em>m</span> are pronounced at the start of a word as C+nasalization, or as CC elsewhere. The conjunct <span class=Em>kʃa</span> represents khɔ at the start of a word and khhɔ within a word, as in the case of the womans name written <span class=Em>lakʃmi</span> but pronounced lokkhi.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/j6/sp/j6spskdfbr_BengaliConjuncts.png height=170 width=360 /></p> <p>Some other spelling conventions also reflect archaic pronunciation. The three sibilant letters which represented three separate sounds ʂ, ʃ and s in Brahmi writing have been retained in Bengali orthography, all representing ʃ except before t, n, r or l, when they all assimilate to s. </p> <p>Bengali employs a set of script-specific digits 0-9.</p> <p>The Devanagari full stop, a vertical line, is used in Bengali alongside Latin punctuation. </p>"
},
{
"script": "Bhks",
"name": "Bhaiksuki",
"unicode_pdf": null,
"family": "Indic",
"type": "abugida",
"white_space": "unspecified",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "unspecified",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": "unspecified",
"sample": null,
"sample_img": null,
"description": "<p>Bhaiksuki is an extinct script used for writing Buddhist texts in the Indian state of Bihar. It is also known as the Arrow-Headed script due to the shape of the letters, many of which are capped with one or more triangular “arrows”. Little is known about this script, but the <a href=http://www.uni-marburg.de/fb10/iksl/indologie/forschung/drittmittelprojekte/bhaiksuki target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> Arrow-Headed Script Project</a> at Philipps-Universität Marburg, Germany is working to change that.</p>"
},
{
"script": "Blis",
"name": "Blissymbols",
"unicode_pdf": null,
"family": "Artificial",
"type": "logo-syllabary",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": false,
"reordering": false,
"case": null,
"split_graphs": false,
"status": "Historical",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Blissymbols was developed after the Second World War by Charles Bliss, to facilitate communication between speakers of different languages. It has also been used to help people with severe speech impediments to communicate without the need to speak, although Bliss did not approve of this usage. It is used in particular by people who struggle both to speak and to read or spell. The system consists of 4,000 symbols representing various abstract and concrete concepts. The symbols are not related in any way to the pronunciation of a word, so can be used in conjunction with any spoken language.</p> <p>The symbols are derived from standard geometric shapes and are either pictographs, such as a jagged line to represent the concept <span class=Em>electricity</span>, or ideographs, such as a triangle to represent the concept <span class=Em>creation</span>.</p> <p>Symbols can be combined to represent additional concepts, for example, the symbols for <span class=Em>house</span> and <span class=Em>money</span> can be combined to represent <span class=Em>bank</span>. In these cases, there is a correct, standardized order in which to write the combinations of symbols.</p> <p>Symbols can also be superimposed on top of one another, for example, the symbol for <span class=Em>mouth</span> can be superimposed onto <span class=Em>nose</span> to represent <span class=Em>taste</span>. Some symbols have an arbitrary shape, such as the articles <span class=Em>a/an</span> and <span class=Em>this</span>.</p> <p>The meaning of a symbol is determined not only by its shape but by its size. For example, a full-size circle represents the concept <span class=Em>sun</span>, and a half-size circle represents <span class=Em>mouth</span>. Sometimes visually similar symbols are also semantically related, for example a full-size upside-down ‘v’ represents <span class=Em>action</span>, but the same symbol half the size represents <span class=Em>activity</span>.</p> <p>The position and orientation of a symbol can also affect its meaning. A horizontal dash at the <a href=page.php?item_id=Glossary#baseline>baseline</a> represents <span class=Em>ground</span>, but slightly higher it represents <span class=Em>sky</span>. A single symbol is also used to represent both <span class=Em>steam</span> (pointing upwards) and <span class=Em>rain</span> (pointing downwards).</p> <p>Punctuation marks, similar to those used in the Latin script but sometimes differing in proportion, are also used. A space half the width of a symbol is left between a word and any following punctuation mark. A space the width of a symbol is left between two words.</p> <p>Blissymbols does not benefit from the use of a wide variety of fonts, as consistency is important in order for the symbols to be understood.</p>"
},
{
"script": "Bopo",
"name": "Bopomofo",
"unicode_pdf": null,
"family": "East Asian",
"type": "syllabary",
"white_space": "between words",
"open_type_tag": "bopo",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Bopomofo is the colloquial name given to the Zhuyin Fuhao phonetic system for transcribing Chinese. The script is used particularly often by students of the Mandarin language. It was developed in 1913 by the Commission on the Unification of Pronunciation (讀音統一會) in China to extend the use of Mandarin. It is still used by many books, newspapers and journals in Taiwan, to annotate the Taiwanese pronunciation of Chinese Han characters or spell out Taiwanese words for which no character exists, but has been largely replaced by Pinyin romanization in mainland China. The script is also used as the primary script for writing some of Taiwans minority languages including Atayal, Taroko, Paiwan and Yami.</p> <p>The script is a semi-syllabary based on syllable rhymes (normally the simple or complex vowel and optional closing consonant in a syllable) and onsets (the initial consonant preceding the rhyme). Of the 37 signs employed in Bopomofo writing, 21 represent onset consonants and 16 represent rhymes. Tones are written using diacritics.</p> <p>The shapes of the symbols are taken from parts of Chinese characters containing the relevant sound. The stroke order with which the signs are written is also based on that of Chinese characters.</p>"
},
{
"script": "Brah",
"name": "Brahmi",
"unicode_pdf": null,
"family": "Indic",
"type": "abugida",
"white_space": "discretionary",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": true,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "hanging",
"ligatures": "required",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Brahmi script is ancestral to most of the scripts of South Asia, Southeast Asia, and to some Central Asian scripts. The name Brahmi actually does not refer to a single, discrete script as such; general practise is to use the term to refer to any script in the family now known as Brahmic, up until approximately 400 AD, at which point they became differentiated enough to be given their own names. Brahmic writing was originally used for writing early dialects of the Prakrit language, but spread widely during the period of Indian cultural expansion in the 1st millenium AD and has since provided the underlying design for over sixty scripts used by languages from the Indo-Aryan, Dravidian, Austro-Asiatic and Tibeto-Burman language families. By about 1000 AD the various varieties of Brahmic writing had diverged to the point that they were all independent scripts whose common ancestry was not always immediately apparent.</p> <p>The earliest writing system in India was the Indus (Proto-Indic) script used by the Harrappan civilization of the Indus valley around 2500-1500BC. After this system fell from use, there appears to have been a long hiatus from which period we do not have any evidence of writing, until approximately 500 BC, at which time two writing systems appeared. One of these was Kharoshthi; the other was Brahmi. The origins of Brahmi are not known for certain, but it is thought to have been influenced by the Imperial Aramaic script.</p> <p>Brahmi writing has been attested primarily in inscriptions on graves and buildings, most famously the edicts made by Emperor Ashoka around 200 BC. It was not until 1836 that these were decoded, by James Prinsep, which enabled the deciphering of other inscriptions.</p> <p>The Brahmi script was generally written from left to right, with some rare exceptions. The script was an abugida; there were thirty-three consonant letters each representing a CV syllable containing the inherent vowel a. Other post-consonantal vowels were written using one of nine vowel diacritics (together with the inherent vowel, five pairs of long and short vowels could be represented) around the consonant letter. There were also nine letters for writing vowels which were not preceded by a consonant. Special conjunct letters existed for writing consonant clusters. Many of these features have been retained by descendant scripts from Brahmi. It is notable that the head-mark, which is characteristic in many varieties of a number of scripts derived from Brahmi, is missing from Brahmi writing itself. It is thought that this head-mark arose incidentally as a result of writing with a reed pen, and later became an integral part of the script.</p>"
},
{
"script": "Brai",
"name": "Braille",
"unicode_pdf": null,
"family": "Artificial",
"type": "alphabet",
"white_space": "discretionary",
"open_type_tag": "brai",
"complex_positioning": "no",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": false,
"reordering": false,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Braille script was created in 1821 by the Frenchman Louis Braille, who had become blind at age three. The system is based on a code called night writing, which had been developed by Charles Barbier to enable soldiers to communicate silently and without the need for a light. Barbier’s system used letters comprised of up to twelve raised dots, and proved too complex to be useful to the military.</p> <p>Louis Braille modified this concept to produce a script which used letters comprised of six dots. A dot may be raised at any or all of six positions. The script is not simply a letter by letter transcription of English words into Braille; all sixty-three possible combinations of dot positions are used, representing the twenty-six letters of the Latin alphabet, plus some common letter combinations and abbreviations. Some symbols also represent commonly used words, such as <span class=Em>but</span>, <span class=Em>this</span>, <span class=Em>day</span> etc.</p> <p>The sixty-fourth configuration, in which no dots are raised, leaves a space between words.</p> <p>Braille is usually written using larger letters than standard printed text, to enable the reader to more easily make the distinction between letters.</p> <p>Braille has also been adapted for use with languages that do not use the Latin script, generally following the Latin transcription conventions of that script. </p>"
},
{
"script": "Bugi",
"name": "Buginese",
"unicode_pdf": null,
"family": "Insular Southeast Asian",
"type": "abugida",
"white_space": "between phrases",
"open_type_tag": "bugi",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Buginese (also known as the Lontara) script is used for writing the Bugis, Makasar, and Mandar languages of Sulawesi in Indonesia. It is related to the other Brahmic scripts indigenous to the Indonesian archipelago. Like many in this family it is read from left to right. It is an abugida consisting of 18 consonants which can be marked with diacritics above, below or alongside to change the quality of the inherent a vowel. There is also a letter to represent a word-initial vowel, which can be modified by means of the same diacritics; unmarked it represents a. One ligature exists, to represent iya. Buginese writing employs one punctuation mark, roughly corresponding in usage to the Latin full-stop and comma combined. Unique Buginese digits are unknown; Latin digits are preferred. As is the case with many Brahmi scripts, letters are conventionally sorted according to place of articulation.</p> <p>The Buginese script is <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#defect target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> defective</a> in some ways; syllable-final nasals and glottal stops are never written; nor is <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#gemination target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> gemination</a>, which in speech is distinctive, marked. In many cases prenasalisation is likewise unwritten. The result is that any one letter may theoretically be interpreted in 9 different ways, although not all of these are necessarily legitimate. The Buginese are famous for creating word games and riddles exploiting the scripts defective aspects.</p> <p>Buginese literature was studied extensively in the C19th by the Dutch missionary B. F. Matthes who published a Buginese-Dutch dictionary in 1874 with a supplement in 1889, as well as a grammar. Although in the wake of European colonisation the Buginese script has largely been replaced by the Latin script, it was reportedly in some use in 1983. Today the script is used in Bugis and Makasar for ceremonial purposes, such as weddings, and for writing personal documents such as letters and notes. It is also used for printing traditional Buginese literature. Nevertheless it is considered to be under increasing threat as a living script.</p>"
},
{
"script": "Buhd",
"name": "Buhid",
"unicode_pdf": null,
"family": "Insular Southeast Asian",
"type": "abugida",
"white_space": "unspecified",
"open_type_tag": "buhd",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "optional",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Buhid script is used to write the Buhid language, spoken by about 8,000 people in the Mindoro region of the Philippines. It is an indigenous abugida script of Brahmic origin. It is proposed that the Buhid, Hanunoo and Tagbanwa scripts share common origins with the Tagalog script, an extinct script from the same region, because of the many features they have in common. All four scripts are of the abugida type, with an a vowel inherent in each consonant. The inherent vowel can be modified by the addition of a diacritic, called a <span class=Em>kulit</span>. In the Buhid script the <span class=Em>kulit</span> takes the form of a horizontal line, added above (to produce i) or below (to produce u) the syllable, often forming a ligature. It is read horizontally, from left to right, and top to bottom.</p>"
},
{
"script": "Cakm",
"name": "Chakma",
"unicode_pdf": null,
"family": "Indic",
"type": "abugida",
"white_space": "unspecified",
"open_type_tag": "none",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "optional",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Chakma script (also called Ojhapath, Ojhopath, or Ajhapath) is used for writing the Chakma language spoken in the Chittagong Hill Tracts of Bangladesh and in the Seven Sister States of Northeastern India. There are slight variations in the forms of the letters used in the two countries. The script is related to Mon Khmer and Myanmar, and many of the letters closely resemble Myanmar letters. It is also being adapted and extended for writing Tanchangya, a related language spoken in Bangladesh.</p> <p>The script is descended from Brahmi, and is an abugida. Each consonant contains an inherent vowel, which can be modified by attaching one of thirteen vowel diacritics to the consonant. Vowels which are pronounced at the start of a syllable are written either with four independent vowel letters (representing a, i, u, e) or by adding the appropriate vowel diacritic to the independent letter a.</p> <p>A virama symbol called <span class=Em>maayaa</span> is used to mute the inherent vowel so that consonant clusters can be represented. Consonant clusters which combine the letters ya, ra, la, wa, na, ba, ma and ha can also be written using conjuncts. In the past there was a wider range of letters which could be written using conjuncts but these are no longer so common.</p> <p>There are three symbols to indicate phonological processes; these are commonly found in scripts derived from Sanskrit. <span class=Em>Candrabindu</span> and <span class=Em>anusvara</span> indicate nasalization of a vowel. <span class=Em>Visarga</span> indicates aspiration of a preceding consonant. These can sometimes be used in combination.</p> <p>The Chakma script uses two Brahmic punctuation marks: <span class=Em>danda</span> and <span class=Em>double danda</span>, which indicate the end of a sentence and (in poetry) the end of a verse, respectively. There are also two script-specific punctuation marks: a question mark and a mark to indicate the end of a section. Script-specific numerals from 0-9 are used, alongside Bengali and, to a lesser extent, Myanmar, numerals.</p> <p>Historically, the script was used to write folk music, epic poems, Buddhist teachings, and for documenting traditional medicinal practices. Today, literacy in Chakma is low, although it is taught up to primary level in the Chakma Autonomous District in Mizoram, India. Most young Chakma speakers do not use the script; rather, they read and write their language using Bengali or Latin letters.</p>"
},
{
"script": "Cans",
"name": "Unified Canadian Aboriginal Syllabics",
"unicode_pdf": null,
"family": "American",
"type": "abugida",
"white_space": "between words",
"open_type_tag": "cans",
"complex_positioning": "no",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": false,
"reordering": null,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Unified Canadian Aboriginal Syllabics are not, strictly speaking, a single script, but a family of related scripts used for writing a number of Algonquin and Inuit languages (previously also the Athabaskan languages) indigenous to Canada. However, they are encoded in Unicode as a unified block. They are used almost exclusively in Canada, for writing the Cree, Ojibwe, Inuktitut, and occasionally Blackfoot languages.</p> <p>A Canadian syllabic script was first created in 1840 by the British missionary James Evans for writing the Swampy Cree dialect. He was influenced in part by two scripts intended to be read by the blind; James Freres alphabet and Thomas Lucass embossed shorthand system. Evans had also learned Pitman shorthand during his previous work as a merchant, and it is believed that he based the idea of modifying each letter by rotation on this system. Some scholars also believe he was influenced by the shapes of Devanagari letters.</p> <p>Canadian syllabics are abugidas; each symbol represents a consonant+vowel combination, with the exception of the bare vowel letter. Vowels which occur at the beginning of a syllable are written with a triangular symbol, rotated through 4 series of 90°, normally to represent the vowels a, o, i and e. Some languages which use the script have a different vowel inventory so assign different values to the rotations.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/kh/2l/kh2lafw9dz_UCASVowels.png height=63 width=291 /></p> <p>Each consonant+vowel symbol can be modified in four ways, again, generally to represent the vowels a, o, i and e. Unlike most other abugidas, vowel modification is not indicated by the addition of diacritics but by rotating the whole symbol. The way in which letters are modified depends on whether they have a symmetric shape or not. Letters which have a symmetric shape are rotated anti-clockwise 90° to represent i when upright, and a, e, o respectively with each subsequent rotation. Letters which are not symmetric also represent i when upright, but are inverted 180° to represent o. e and a are represented using the mirror images of i and o respectively.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/kh/2l/kh2lafw9dz_UCASConsonants.png height=65 width=360 /></p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/kh/2l/kh2lafw9dz_UCASConsonants2.png height=69 width=367 /></p> <p>A fifth modification can be performed on letters to make them superscript. In this form they represent a syllable-final consonant with no following vowel. Some languages do not use the superscript letters but instead have an additional set of symbols, unrelated to the consonant+vowel symbols, to represent syllable-final consonants.</p> <p>Additional marks are used to modify the length of a vowel or to indicate phonological processes such as labialization and aspiration. However, these marks are not used consistently from one language to another - any given combination of letter + combining mark may represent two different sounds in two different languages, so in Unicode each combination is encoded separately for each language in which it is used, rather than simply as a base character and diacritic. The symbol to represent vowel length is called a point; as this symbol is not used consistently even within one language, texts are sometimes referred to as pointed or unpointed depending on whether the writer has used the symbol or not.</p> <p>Spaces are used between words, and there is a symbol for a full stop. Latin punctuation (with the exception of full stop) is also sometimes used.</p> <p>The extent to which Canadian syllabics are used varies between languages. It is considered obsolete or seriously endangered among the Athabaskan group of languages and Blackfoot. However, the Canadian government generally encourages the use of the script, especially in areas with large Inuit populations. In Nunavut and Nunavik, syllabics are the official script alongside Latin, are used at the municipal, territorial / provincial, and federal levels, and are taught in schools. They are also used by businesses, and in the media and communications sector. Some languages have established standardization committees to standardize the syllabics as used for their language.</p>"
},
{
"script": "Cari",
"name": "Carian",
"unicode_pdf": null,
"family": "Middle Eastern",
"type": "alphabet",
"white_space": "none",
"open_type_tag": "cari",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": null,
"reordering": false,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Carian script was used between the 7th and 1st centuries BC in what is now the Aegean region of Turkey, for writing the Carian language. It was also used in the Nile delta, where Carians were fighting for the Egyptian pharoahs.</p> <p>The script was an alphabet comprising forty-five letters. Of these, phonetic values have been assigned to nine vowels and seventeen consonants. There is some speculation as to what sounds the remaining nineteen letters represented. Visually, the letters bear a close resemblance to Greek letters. Decipherment was initially attempted on the assumption that those letters which looked like Greek represented the same sounds as their closest visual Greek equivalents. However it has since been established that the phonetic values of the two scripts are very different. For example the <span class=Em>theta</span> θ symbol represents ‘th’ in Greek but ‘q’ in Carian.</p> <p>Carian was generally written from left to right, although Egyptian writers wrote primarily from right to left. It was written without spaces between words. </p>"
},
{
"script": "Cham",
"name": "Cham",
"unicode_pdf": null,
"family": "Southeast Asian",
"type": "abugida",
"white_space": "unspecified",
"open_type_tag": "cham",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "hanging",
"ligatures": "required",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Cham script is a Brahmi-derived abugida used for writing the Cham language. There are two major dialects of Cham, spoken collectively by about 230,000 people in two isolated groups in Vietnam and Cambodia, both of which once had a thriving literary tradition dating from the 8th century. The Cambodian Cham population used to be much larger, but it is estimated that up to 500,000 died as a result of the Khmer Rouges policies in the 1970s. Under this regime the Cham were also forced to use the Cambodian language exclusively.</p> <p>The script is written from left to right. The letters hang from the <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#baseline target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> baseline</a>. Syllables in the Cham language follow a strict CV(C) pattern. Most syllable-final consonants are represented by extending the right-most stroke of the symbol as shown in the example below, but some are modified by a diacritic. One consonant character can be used at the beginning or end of the syllable without any change in form.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/ng/nq/ngnq875xj9_ChamGa.png height=78 width=162 /></p> <p>With the exception of nasals, each consonant contains an inherent a vowel. Nasal consonants inherently contain the vowel ɨ. In each case, the inherent vowel can be overridden by the addition of 17 vowel diacritics. Of these vowels, 6 can also take a full (non-diacritical) form when they occur at the start of a syllable. The remaining 11 are represented syllable-initially by means of the independent a vowel modified with the appropriate vowel diacritic.</p> <p>There are four glides in the Cham phonemic inventory; these are each represented in writing with an independent symbol when they occur at the start of a syllable, but they also have a ligating form which can be added to another consonant to produce kja, pla, swa etc.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/ng/nq/ngnq875xj9_ChamGlide.png height=87 width=428 /></p> <p>Numerals 0-9 are also represented in the Cham script.</p> <p>Speakers of the Eastern and Western dialects use distinct styles of the script, with the Eastern dialect being written in a more rounded style (known as <span class=Em>akhăr thrah</span>) and the Western dialect in a more fluid style. Currently, the Cham script is used much more widely by speakers of Eastern Cham; the majority of Western Cham speakers are Muslims and use an adaptation of the Arabic script alongside Cham. In Vietnam, where the Eastern dialect is predominantly spoken, bilingual education in Vietnamese and Cham exists at primary school level, and textbooks have been published using the script.</p>"
},
{
"script": "Cher",
"name": "Cherokee",
"unicode_pdf": null,
"family": "American",
"type": "syllabary",
"white_space": "between words",
"open_type_tag": "cher",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Cherokee script was created by a Cherokee man named Sequoyah (also known as George Guess or George Gist) who believed that the key to the colonialists success and power lay within their talking leaves, the written correspondence they used to exchange information and ideas. Although Sequoyah was illiterate, he noted the shapes of the letters in an English Bible and based the shapes of the Cherokee letters on them. For this reason, many of the letters resemble Latin letters and numbers, although there is no relationship between their sounds in English and in Cherokee. Sequoyah spent 12 years devising the Cherokee syllabary, and presented it formally in 1821. It achieved almost instant popularity and by 1824 most Cherokee were literate in the script.</p> <p>In 1828 Sequoyah collaborated with Rev. Samuel A. Worcester to modify the script to facilitate the creation of a printing press. The letters they created together are somewhat different from Sequoyahs original set, and are the letters in use today.</p> <p>The script is written from left to right. There are 85 letters in the syllabary. Apart from one letter representing s, each symbol represents one syllable consisting of a vowel only, or of a consonant / consonant cluster followed by a (long or short) vowel, or of a consonant followed by a vowel and a final h or ʔ. The syllabary is normally ordered in a chart with the first six symbols representing vowels, followed by consonant + vowel combinations loosely based on the order of the English alphabet. Sequoyah arranged the letters in a different order, which is no longer widely used. Letters cannot be combined to form ligatures; consonant clusters which cannot be written with a single letter are written using a combination of syllables, in which the vowels are considered to be dummy vowels and are not pronounced. Words are separated by spaces or by a dot just above the <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#baseline target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> baseline</a>, and Latin punctuation is used with the exception of the full stop. There is a script-specific symbol to indicate the end of sentences. Case is not used in the script; although certain letters are sometimes written larger than others this is for typographic effect rather than casing.</p> <p>There are four tones in spoken Cherokee, and a single syllable can slide from one tone to another. The tone of a word does not necessarily affect its meaning, so there is no method for marking tone in writing. In cases where the tone does affect the meaning of a word, it is determined from context which meaning is intended.</p> <p>Sequoyah also invented a system of writing numerals. However, as Latin numerals were already widely used by the Cherokee community these were never popular and have since dropped entirely from use.</p> <p>From the 1870s until the early 20th century, the US government implemented formal assimilation policies with the intention of civilizing Native Americans. Native American children were forcibly removed from their families and sent to mandatory boarding schools where they were not allowed to speak their language, practice traditional ways or visit their homes, sometimes for three or four years at a time. As a result, a generation of Cherokee people grew up unable to speak the Cherokee language, and it is now estimated to be spoken by fewer than 10% of Cherokee people. Among those who do speak the language, the script is widely used for writing letters, recipes, folktales, diaries, and for personal record-keeping. It is also used in some legal, governmental and religious documents and, in some areas, public signage. Efforts are being made to revive both the language and the script; to that end it is used in a limited capacity in education. Knowledge of the script is considered a prerequisite for full Cherokee citizenship. Two widely used publications in the script are a Cherokee New Testament and a hymnal. Although the orthography has never officially been standardized, many regard the spellings in these books as a standard for formal language.</p>"
},
{
"script": "Cirt",
"name": "Cirth",
"unicode_pdf": null,
"family": "Artificial",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": true,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Fictional",
"baseline": "bottom",
"ligatures": null,
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Cirth script was created by J. R. R. Tolkien for writing the elvish and dwarvish languages spoken in the mythological world of Middle-earth. The shapes of the Cirth are based on the Futhark runes, and they are used in Middle-earth for writing inscriptions on wood and stone, in the same way that runes have been used in the real world.</p> <p>Each rune generally represents one sound, and each sound is represented by one rune. Cirth was used for the Khuzdûl, Sindarin, and Quenya languages, but some signs represent different sounds in different languages, and other signs are only used in one or two of the three languages. Some of the Cirth also had two forms, which could be glyph variants in one language, but represent two different sounds in another.</p> <p>The Cirth script was written from left to right with no punctuation.</p>"
},
{
"script": "Copt",
"name": "Coptic",
"unicode_pdf": null,
"family": "European",
"type": "alphabet",
"white_space": "none",
"open_type_tag": "copt",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": false,
"reordering": false,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": null,
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Coptic alphabet is closely derived from the Greek alphabet. It is used for writing the Coptic language, which was the latest descendant of the Egyptian language and is now the liturgical language of the Coptic church. Prior to the creation of the Coptic alphabet, Egyptian was written in the Greek alphabet; prior to that, in the Demotic script. By the 1st century, spoken Egyptian had become quite removed from Demotic writing. It was also around that time that the Christian religion began spreading from Greece to Egypt. Egyptian Christians wanted a visible symbol of their new faith, and in the 3rd century the Bible was translated into the Coptic language. This translation used the Greek alphabet supplemented by Demotic symbols for representing non-Greek sounds.</p> <p>The Standard Coptic script contained 32 letters, 25 from the Greek Script and 7 from the Demotic. Some dialects borrowed more or fewer letters from Demotic, to better represent their phonology. Early Coptic writing was from right to left, perhaps influenced by surrounding Semitic scripts, but later Standard Coptic was written from left to right. Vowels were not always written, although some orthographies used a small mark above the preceding consonant to indicate that the following vowel had been omitted. A small number of diacritics were used, for example to show where a syllable break occurred. Punctuation and inter-word spacing were not used.</p> <p>There were several dialects of the Coptic language. The language is generally believed to have fallen from everyday use in the 16th century, and is officially classified as extinct, although there have been some reports of scattered use of spoken Coptic in ethnic Egyptian families, but these few users are not passing the language onto the next generation. Coptic is one of the two liturgical languages of the Coptic Orthodox and Coptic Catholic churches, along with Arabic, and the script is still used in this context.</p>"
},
{
"script": "Cprt",
"name": "Cypriot",
"unicode_pdf": null,
"family": "European",
"type": "syllabary",
"white_space": "none",
"open_type_tag": "cprt",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "rtl",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Cypriot script was used from approximately the 11th to the 4th centuries BC, for writing the Greek language in Cyprus. It descended from the Linear A script, and is closely related to the Linear B script, but has no visual or structural relationship to the Greek alphabet. The script was used primarily for record keeping, not for literary purposes.</p> <p>The Cypriot script was a syllabary, with each of fifty-six letters representing an open (CV) syllable. It was written from right to left without interword spacing. Voiced sounds were under-represented in the script, despite being distinctive in speech. For example, the <span class=Em>p</span> series of letters (pa, pe, pi, po, pu) represented syllables beginning with both p and b.</p> <p>In order to represent syllables with a structure other than CV, for example, CCV, CVC or CVV (diphthongs), certain spelling conventions were implemented. Consonant clusters were written using CV signs whose vowels agreed with the vowel of the whole syllable. So <span class=Em>bre</span> was written using the signs be+re. In consonant clusters in which a nasal preceded another consonant, the nasal was not written. Closed CVC syllables were written using the <span class=Em>e</span> series of sounds. In spoken Greek, only the consonants n, r, and s were used at the end of a word, so only the signs representing <span class=Em>ne</span>, <span class=Em>re</span> and <span class=Em>se</span> were written to close a syllable. Diphthongs were always written out, with the independent vowel signs representing the second part of the diphthong.</p> <p>Word breaks were not indicated by spaces or any other means. </p>"
},
{
"script": "Cyrl",
"name": "Cyrillic",
"unicode_pdf": null,
"family": "European",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "cyrl",
"complex_positioning": "no",
"requires_font": false,
"unicode": true,
"diacritics": false,
"contextual_forms": null,
"reordering": false,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The creation of the Cyrillic script is traditionally attributed to Saint Cyril, a missionary working in Bulgaria during the 9th century. He and his brother are also credited with the invention of the Glagolitic script, a derivation of the Greek cursive alphabet which was modified to fit the sound systems of Slavic languages. Some historians credit Clement of Ohrid, a student of Saint Cyrils, with creating the Cyrillic script as a more readable writing system based on Glagolitic. The Cyrillic script was initially used for writing Old Church Slavonic (also called Old Bulgarian), but it has undergone a number of changes since that time, so much so that the old and modern variants are considered by many to be two different but related scripts. Many of the modern letterforms differ from those used in early Cyrillic writing, some letters have been dropped, and new letters have been added. An orthographic reform was implemented by the Russian tsar Peter the Great in 1708 which removed a number of obsolete letters so that Russian writing is now almost perfectly phonetic.</p> <p>The script has traditionally been used for writing the Slavic languages, of which Russian is the most widely spoken. During the nineteenth and twentieth centuries, particularly under Soviet rule, it was extended to write over 50 languages throughout Eastern Europe and Asia.</p> <p>Cyrillic is written from left to right with spaces between words. Upper- and lowercase letters are used; most printed lowercase letters are simply smaller versions of their uppercase equivalents. However, the shapes of handwritten and italic letters can vary greatly from the shapes of printed letters. With the conversion of the script to non-Slavic languages in which new sounds needed to be written, a number of <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#multig target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> digraphs, trigraphs and tetragraphs</a> were employed. Russian and the Slavic languages use only a few digraphs. Many new letters were also created for writing the non-Slavic languages by adding hooks, breves or diaereses to existing letters. These marks are not considered diacritics, instead, a letter so modified is considered a separate letter with the mark an intrinsic part.</p> <p>After the dissolution of the Soviet Union in 1991, some of the languages which had been forced to adopt the Cyrillic script opted instead to give the Latin or Arabic script official status. However, at the end of 2002, President Putin signed into law an addition to Russian Federation law stipulating that the alphabets of the state language of the Russian Federation and the state languages of its republics must be based on the Cyrillic script. Other scripts for the alphabets of the state language of the Russian Federation and the state languages of its republics can only be adopted under federal laws. The law was received very negatively in some states.</p>"
},
{
"script": "Cyrs",
"name": "Cyrillic (Old Church Slavonic variant)",
"unicode_pdf": null,
"family": "European",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Old Church Slavonic (also called Old Bulgarian) was the earliest Slavic language to be written. it was first written in the Glagolitic script, and later in the early Cyrillic script. Cyrillic writing has undergone a number of changes since that time, so much so that the old and modern variants are considered by many to be two different but related scripts.</p> <p>Notable differences between the Cyrillic as it was used for writing Old Church Slavonic and as it is used today include the following: Obsolete letters have been dropped, and some additional letters have been added. The repertoire of letters in the early Cyrillic alphabet is cited in documents from the time as being as few as 27 or as many as 43. This variation may be due to the writers conflicting definitions as to what constitutes a letter. Modern Cyrillic as it is used for writing the Russian language uses 33 letters.</p> <p>Early Cyrillic was unicameral, that is, there was only one case, although some letters were written larger than others for emphasis or decorative purposes. Modern Cyrillic is written using upper- and lower case. Early Cyrillic writing did not use separate symbols for numbers, rather, each letter was assigned a numeric value, as in the Greek script. A diacritic mark was written above the symbol to indicate that it was to be interpreted as a number. Modern Cyrillic has used Latin numbers since the early 18th century.</p> <p>Diacritics were also used in early Cyrillic writing to convey phonological information such as palatalization, stress and breathing, as well as abbreviation of a word. Many of these marks have been dropped from current usage, and other hookes, breves and diaereses have been added to facilitate the representation of non-Slavic languages in the script.</p> <p>Early Cyrillic punctuation was borrowed from Latin, Greek and Armenian.</p>"
},
{
"script": "Deva",
"name": "Devanagari (Nagari)",
"unicode_pdf": null,
"family": "Indic",
"type": "abugida",
"white_space": "between words",
"open_type_tag": "tagb",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": true,
"case": null,
"split_graphs": false,
"status": "Current",
"baseline": "hanging",
"ligatures": "required",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Devanagari is a Northern Brahmic script related to many other South Asian scripts including Gujarati, Bengali, and Gurmukhi, and, more distantly, to a number of South-East Asian scripts including Thai, Balinese, and Baybayin. The script is used for over 120 spoken Indo-Aryan languages, including Hindi, Nepali, Marathi, Maithili, Awadhi, Newari and Bhojpuri. It is also used for writing Classical Sanskrit texts. Generally the orthography of the script reflects the pronunciation of the language.</p> <p>The script is written from left to right. Letters hang from a headstroke, which is generally continuous throughout the length of the word, except when writing the letters <span class=Em>jha</span>, <span class=Em>tha</span>, <span class=Em>dha</span>, <span class=Em>bha</span>, <span class=Em>a</span> and <span class=Em>ā</span>, which all have a break in the headstroke. In handwriting, the headstroke is sometimes omitted.</p> <p>Devanagari is an abugida; each letter represents a consonant with an inherent ə vowel, which can be modified using vowel diacritics. Vowel diacritics can be written above, below, to the left or to the right of the consonant. There are thirty-two consonant and ten vowel letters, plus ten vowel diacritics. The vowel signs represent long and short forms of five vowel sounds. Vowel sounds which are not preceded by a consonant are written with a vowel letter; otherwise they are indicated by a vowel diacritic, or, in the case of ə, the lack thereof. There are also two letters for the long and short forms of the syllabic consonant r̩, which are ordered with the vowel letters.</p> <p>The Brahmic scripts are known for ordering the letters according to the principles of articulatory phonetics, that is, according to the place and manner in which the sound is produced. In Devanagari, the vowels are ordered first, with each short vowel followed by its long equivalent. The consonants are ordered by place, and within place, manner of articulation. Consonants are ordered in rows; velar, palatal, retroflex, dental, labial. Within each row, they follow the order unaspirated and aspirated voiceless, unaspirated and aspirated voiced, and nasal. Sonorants, fricatives and sibilants are ordered last.</p> <p>Some letters take variant forms depending on their context. Notably, the letter <span class=Em>ra</span> takes different forms when in initial/final position, when preceding the letter <span class=Em>ya</span>, when preceding a consonant other than <span class=Em>ya</span>, when following a consonant with a vertical stem, and when following a letter with a rounded bottom.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/m4/8t/m48txu8ufx_DevanagariRA.png height=242 width=270 /></p> <p>Most consonants have two forms; a full form and a half form, which often visually resembles the full form but is missing the vertical stem. The half forms can combine with the full forms of other letters to represent consonant clusters. Those letters which dont have a half form make use of a <span class=Em>halant</span> symbol which silences the inherent vowel so that the two consonants can be pronounced in sequence. Some combinations join horizontally and some stack vertically. This is generally governed by the shape of the letter. A number of ligatures (commonly called <span class=Em>conjuncts</span> when referring to Indic scripts) are also used which do not bear visual similarity to the individual letters of which they are comprised.</p> <p>Devanagari writing employs three non-alphabetic symbols for modifying the quality of a vowel. <span class=Em>Anusvara</span> is written with a dot above the vowel and either nasalizes the vowel or represents a nasal consonant articulated at the same place as the following consonant. A variant form of this symbol is called <span class=Em>candrabindu</span> and is written with a moon dot above the letter to represent nasalization of the vowel exclusively. The symbol <span class=Em>visarga</span> follows <span class=Em>anusvara</span>; this symbol represents a voiceless breath following the vowel, and is written with two dots stacked vertically.</p>"
},
{
"script": "Dogr",
"name": "Dogra",
"unicode_pdf": null,
"family": "Indic",
"type": "abugida",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "yes",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": true,
"case": null,
"split_graphs": false,
"status": "Historical",
"baseline": "hanging",
"ligatures": "required",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Dogra or Dogri script is a Brahmi-based alphasyllabary that is written from left to right. Independent and initial vowels are written using letters, while dependent vowels are expressed using combining signs. Consonant letters possess the inherent vowel <span class=Em>a</span>, which is changed by the attachment of a vowel sign. Consonant clusters are written in various ways, such as with visible <span class=Em>halanta</span>, as an atomic ligature, and with half-forms; <span class=Em>repha</span> is attested, but rarely used. There are no formal conventions regarding representation of conjuncts.</p> "
},
{
"script": "Dsrt",
"name": "Deseret (Mormon)",
"unicode_pdf": null,
"family": "American",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "dsrt",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "rtl",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Deseret alphabet was used primarily by members of the Church of Latter-Day Saints (Mormons) in Utah for writing the English language. It was created in 1854 by George D. Watt, to whom the task had been given by Brigham Young, the church president at the time. Watt had studied Pitman shorthand prior to joining the Mormon church, and this may have influenced the creation of the Deseret letters. The alphabet was short-lived; by the time Young died in 1877 it had been abandoned. During its use, it underwent a number of revisions both in the number and the shape of the letters. The standardized version approved by the University of Deseret contained thirty-eight letters. Two additional letters have since been discovered in hand-written materials. Four books were published in the standardized version: two reading primers, the Book of Mormon, and an excerpt from it called the Book of Nephi. In addition, the script was used on public signage in Salt Lake City, in scripture quotations in the Deseret News, and in personal correspondence and journals. The development of the alphabet was somewhat controversial; critics suspected it was intended to further isolate members of the church from the outside world, but proponents of the script claimed that it was created to facilitate English language acquisition by non-English converts to Mormonism.</p> <p>The alphabet contains upper and lower case letters, which are identical apart from in size. It is written from right to left. Spelling is phonetic, with the exception of eight words which have the same pronunciation as individual letter names. That is, the letter pronounced ai is used to write the words eye and I"
},
{
"script": "Dupl",
"name": "Duployan shorthand, Duployan stenography",
"unicode_pdf": null,
"family": "American",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": false,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Duployan Shorthand (also called Sloan-Duployan Shorthand, or Duployan Stenography) was first created in 1860 by Emile Duployé, as a stenographic shorthand for rapidly writing French. It was also adapted for writing English, German, Spanish, and Romanian.</p> <p>Around 1879, a Catholic priest, Father Jean-Marie Raphaël Le Jeune, who had learned Duployan Stenography in France, went to British Columbia to work as a missionary among the Chinook. The Chinookan language family is made up of several closely related languages, many of which are now extinct or severely endangered. One of these, a creole called Chinook Jargon, is still spoken by a few small communities in Oregon. Father Le Jeune became frustrated with the inefficiency of the Roman alphabet for writing the Chinook languages, in particular Chinook Jargon, and adapted and augmented Duployan Shorthand for this purpose. In the 1890s, he published the Kamloops Wawa, a newspaper published in Chinook Jargon. Some sources treat Duployés shorthand and Father Le Jeunes augmented version for Chinook Jargon as two different scripts.</p> <p>Around the same time, Denis-Romulus Perrault, a stenographer from Montreal, was applying Duployan Shorthand to a number of First Nations languages spoken in modern-day Quebec and Newfoundland. The exact languages which Perrault expressed in Shorthand is unknown, however the script is currently used for writing the Okanagan, Lillooet, Shushwap and North Thompson languages of interior British Columbia, as well as Chinook Jargon.</p> <p>Duployan Shorthand is an alphabet, representing both consonants and vowels. In general, consonants are written using straight lines or curved arcs, and have invariable orientation (that is, they do not rotate to match the surrounding letters). Vowel letter-shapes tend to be based on circles and semi-circles, and can be rotated or reshaped to fit the context.</p> <p>The script is used slightly differently for different languages, having been augmented with extra symbols to represent the phonologies of each language. For example, Chinook writing also contains compound consonant letters and a logograph. Some languages also use variant forms of the core letter set, including letters with ancillary diacritics and size variants.</p> <p>Duployan Shorthand is written in a cursive style from left to right. It has been praised for its simplicity and the speed with which it can be written. </p>"
},
{
"script": "Egyd",
"name": "Egyptian demotic",
"unicode_pdf": null,
"family": "African",
"type": "logo-syllabary",
"white_space": "none",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "rtl",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Egyptian Demotic script was used from about 600 BC to about 400 AD for writing the Egyptian language. Demotic writing descended from Egyptian Hieratic writing, the manuscript writing system which was used alongside Hieroglyphics, and it was later used in place of both of these scripts. The development of the Demotic script is generally divided into three periods, Early, Middle (Ptolemaic) and Late (Roman). Demotic was mostly written on papyrus, embalming fabric and other soft surfaces, but was sometimes also inscribed in stone, the most famous example being the middle section of the Rosetta Stone. Initially the script was used primarily in an administrative context; later it came to be used for literary and religious texts also. It was gradually supplanted by the Coptic alphabet from 300 AD onwards.</p> <p>Egyptian Demotic was a cursive script, written from right to left. It was similar in form to hieroglyphic and hieratic writing, in that it used a combination of logographic and phonetic signs. It reproduced many of the same shapes of the pictographs of the earlier scripts, in a more cursive form which no longer necessarily resembled the concepts/objects they represented. New words for which there was no pre-existing sign were spelled using phonetic symbols. </p>"
},
{
"script": "Egyh",
"name": "Egyptian hieratic",
"unicode_pdf": null,
"family": "African",
"type": "logo-syllabary",
"white_space": "none",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Egyptian Hieratic was a writing system used for writing the Egyptian language. It developed alongside Hieroglyphic writing and was used for administrative and religious texts from approximately 3100 BC until 650 BC.</p> <p>Hieratic writing was normally written in ink with a reed brush on papyrus, whereas Hieroglyphs were inscribed in stone. Although it used a combination of pictographic and phonetic symbols, it was more cursive in form, and many of the symbols were unrelated to those used in Hieroglyphic writing. It is not the same script as cursive hieroglyphics.</p> <p>There were a number of concurrent forms of Hieratic writing, displaying varying degrees of cursiveness and defined by the era and the purpose for which they were written. The varieties can broadly be divided into two categories, a highly cursive \\\\\\businesshand\\\\\\\" which employed a large number of ligatures and was used for administrative documents"
},
{
"script": "Egyp",
"name": "Egyptian hieroglyphs",
"unicode_pdf": null,
"family": "African",
"type": "logo-syllabary",
"white_space": "none",
"open_type_tag": "none",
"complex_positioning": "unknown",
"requires_font": false,
"unicode": true,
"diacritics": null,
"contextual_forms": null,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "centered",
"ligatures": "unspecified",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>Egyptian hieroglyphics were used for writing the Egyptian language from about 3000 BC until 400 AD. Symbols resembling hieroglyphs had been used by artisans in the region since 4000 BC, but with no ascertainable linguistic content. The first hieroglyphs were used for making inscriptions on buildings and tombs. Later they came to be used to decorate jewelry, record events on papyrus and to put a royal or divine signature, called a <span class=Em>cartouche</span> on an item.</p> <p>The Egyptian hieroglyphic system of writing consisted of both phonetic symbols and pictographs. There were about 30 symbols representing single consonants, plus about 130 bilateral and trilateral symbols which represented two or three consonants. For example, the heart-with-trachea symbol represented the consonants n + f + r, pronounced <span class=Em>nefer</span>. Some symbols were used as phonetic complements, which reinforced the final consonant in a bi- or trilateral symbol. Vowels were not written. Pictograms were symbols which visually resembled the item or concept they represented. In total there were an estimated 7000 pictograms, but these were not all in use at the same time.</p> <p>Some symbols could be used either as a pictogram or as a phonetic symbol. For example, a rectangular spiral represented both the sound pr, and the concept of a house. Combined with the fact that vowels were not written, this resulted in some symbols or combinations of symbols having multiple possible meanings. To indicate which meaning was intended, a third class of symbol, called <span class=Em>determinatives</span>, was used. Often, a small vertical stroke underneath the symbol was used to indicate that the object/concept - rather than the phonetic - interpretation was intended. Another example of a determinative was a pictogram of a pair of legs to indicate that the preceding symbol was a verb of movement.</p> <p>Egyptian hieroglyphic writing did not use any punctuation or word spacing. Symbols were arranged either in vertical columns or horizontal rows, and could be read from right to left or from left to right. In order to demonstrate in which direction the text had been written, the scribe would orientate any animate pictograms (that is, symbols in the shape of a human or animal) to face in the intended direction of reading.</p>"
},
{
"script": "Elba",
"name": "Elbasan",
"unicode_pdf": null,
"family": "European",
"type": "alphabet",
"white_space": "between words",
"open_type_tag": "none",
"complex_positioning": "no",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": false,
"reordering": false,
"case": null,
"split_graphs": null,
"status": "Historical",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Elbasan script was created by Albanian Christians in the mid-18th century. It is known primarily from its use in the Elbasan Gospel Manuscript, the second-oldest manuscript in Albania.</p> <p>Elbasan was an alphabet written from left to right. Forty letters were used, influenced in part by the Greek and Slavic scripts, but largely novel creations. It is thought that the Greek system of marking certain letters with an overline to indicate a numeric value was also used by Elbasan writers. </p> <p>Elbasan is no longer used, the Latin alphabet having been standardized for Albanian in 1909. </p>"
},
{
"script": "Ethi",
"name": "Ethiopic (Geʻez)",
"unicode_pdf": null,
"family": "African",
"type": "abugida",
"white_space": "between words",
"open_type_tag": "ethi",
"complex_positioning": "no",
"requires_font": false,
"unicode": true,
"diacritics": true,
"contextual_forms": true,
"reordering": null,
"case": null,
"split_graphs": null,
"status": "Current",
"baseline": "bottom",
"ligatures": "none",
"direction": "ltr",
"direction_notes": null,
"sample": null,
"sample_img": null,
"description": "<p>The Ethiopic (Geez) script was developed as the writing system of the Geez language, a Semitic language spoken in Ethiopia and Eritrea until the 10th to the 12th centuries. Although the language ceased to be used in vernacular speech (it now serves a liturgical function only), the script is still widely used for writing the Ethiopian and Eritrean Semitic languages such as Tigré, Amharic and Tigrinya. In some languages, the script is called <span class=Em>fidäl</span> (ፊደል), which means alphabet, and individual letters are referred to as <span class=Em>fidel</span>. The script is believed by many to have derived from the epigraphic South Arabian script, of Proto-Sinaitic heritage, although there is some dispute surrounding this assertion; some also believe it to have descended from Egyptian hieroglyphics. According to the tradition of the Ethiopian Orthodox Tewahedo Church, the script was divinely revealed to Enos, grandson of the first man, Adam. Unlike other Semitic scripts, Ethiopic is written from left to right.</p> <p>The original Geez script was an abjad - vowels were not written - but the current script is classified as an abugida. Each symbol represents a CV syllable, but vowels are not inherent in the consonant. The original Ethiopic script contained 182 characters, although the basic (unmarked) consonants number only 26. The script has since been extended for other languages and now contains over 500 symbols. Some of the new symbols represent phonological processes such as palatalization, pharyngealization and labialization.</p> <p>The unmarked set is known as the first order (also called the first form). Each of the first order consonants can be combined with one of six vowels, to produce a syllograph. The resulting sets of syllographs are known as the second, third, fourth, fifth, sixth and seventh orders. In some languages there is an additional vowel symbol which can be combined with the first order consonants to produce an eighth order.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/zf/fl/zffllj5kg6_Ethioforms.png height=238 width=397 /></p> <p>As shown in the chart, the way in which consonant characters are combined with vowel signs often follows a general pattern, particularly in the second to sixth orders, but it is by no means fully systematic. This has sometimes resulted in the creation of a new character, unrelated in form to the unmarked consonant. For this reason, in tables of Ethiopic letters, all 182 characters tend to be displayed, unlike tables of Brahmic abugidas which combine consonants and vowels in a predictable manner. </p> <p>The script does not completely represent some languages. For example, <a href=http://scripts.sil.org/cms/scripts/page.php?&cat_id=Glossary#gemination target=_blank><img src=../sites/s/themes/default/_media/system_icons/offsite_link.png /> gemination</a> (consonant lengthening) is generally not represented in the orthography, despite being distinctive in speech. Conversely, some distinctions in the Amharic orthography are no longer significant in speech, for example, the difference between h, ħ and x is only etymologically significant in spoken Amharic but continues to be represented in the orthography.</p> <p><img src=http://scriptsource.org/cms/scripts/../sites/s/media/database/ssproto/entries/zf/fl/zffllj5kg6_EthioH.png height=66 width=145 /></p> <p>The conventional ordering of the consonants differs from that of other Semitic scripts; the reasons for this are unclear although it is speculated that the ordering is reminiscent of that used for the ancient Sabean script, a South Semitic predecessor of Geez.</p>"