Medieval Unicode Font Initiative


MUFI proposal for coordinated usage of the Private Use Area

Ed. by Odd Einar Haugen, University of Bergen, Norway

Version 1.0 beta (30 September 2003)

 

Contents


A

Base characters

B

Precomposed characters

C

Variant letter forms

1

Ligatures

15

Characters with overline or macron

30

Nordic

2

Small capitals

16

Characters with acute accent

31

English

3

Enlarged minuscules

17

Characters with double acute accent

32

Portuguese

4

Base line abbreviation characters

18

Characters with dot above

5

Modified base line abbreviation characters

19

Characters with dot below

6

Combining marks

20

Characters with diaeresis

7

Combining superscript characters

21

Characters with hook above

8

Punctuation marks

22

Characters with ogonek

9

Critical and epigraphical signs

23

Characters with breve

10

Metrical symbols

24

Characters with macron and breve

25

Characters with macron and acute


 

Background & contributors

The present proposal is a major revision and extension of two proposals published on the MUFI web site:

A proposal for subranges within the Private Use Area of Unicode (15 June 2002)
A proposal for supplementary characters in Unicode (5 February 2003)

These proposal received a number of helpful comments from (in alphabetical order) Jim Allan (U.S.), Deborah W. Anderson (Berkely, CA), Peter S. Baker (Charlottesville, VA), Michael Beddow (Leeds), António H.A. Emiliano (Lisboa), Michael Everson (Dublin), Jost Gippert (Frankfurt), Juan-José G. Marcos (Plasencia, Spain), Susana T. Pedro (Lisboa), David J. Perry (Rye, NY), Gerhard Schumacher (Köln), Ken Whistler (Unicode consortium), and several other Unicode officers.

The present proposal is the result of a meeting held in Bergen, Norway, 30-31 August 2003. Participants at this meeting were Odd Einar Haugen (Bergen), Michael Everson (Dublin), Michael Irlenbusch (Bergen), Alec McAllister (Leeds), Gerhard Schumacher (Köln), and Tarrin Wills (Sydney).

This proposal was published 8 September 2003 on the MUFI site for review and has been discussed electronically within the MUFI group in the ensuing weeks.

Many aspects of this proposal will be controversial, and more than one of the contributors and advisors listed above may disagree with the solutions chosen in the proposal. It is, however, clear that this proposal would have progressed much slower and been much inferior had it not been for all the help and comments received.

 

 

Introduction

This proposal contains a set of characters (or variant character forms) for the use of medievalists and to some extent classicists. The aim of the proposal is to establish a coordinated usage of code points in the Private Usea Area, and it particularly aims at coordinating the usage of code points in existing Unicode fonts such as Titus and Junicode with fonts under development.

The proposal contains a representative glyph for each character, a categorisation, a recommended entity name, a Unicode code point, and a descriptive name.

Glyphs
The glyphs used in this proposal are for guidance only.

Categories
Characters are divided into six categories:

0. Characters which already are in the Unicode standard. They have been included in the proposal for the sake of information.

1. Characters which should be proposed for inclusion in the Unicode standard (or currently are pending approval). Characters approved by Unicode should subsequently be removed and substituted with a warning sign, preferably the character in question placed within a triangular sign.

2. Precomposed characters which can be encoded with a sequence of Unicode characters, but which can not be displayed easily or properly with existing font technology. Characters in this category should be removed as soon as there is mature smart font technology available.

3. Variant character form (e.g. regional or chronological variants). Characters in this category might be proposed as variants in the Unicode standard. They should be removed as soon as there is mature smart font technology available for handling variant letter forms.

4. Characters that are under review for inclusion in the MUFI proposal. For this reason, space has been reserved for them.

5. Characters already in the Unicode standard which ought to have a separate entity name for semantic reasons, e.g. to distinguish the semicolon used as a punctuation mark from the semicolon used as an abbreviation mark. This category is relevant only for encoding purposes.

Entity names
Entities are used in numerous encoding schemes such as SGML/XML. For the sake of transparency and interchangeability, it is recommended that entities as far as possible conform to the standard ISO entity sets. An updated list of ISO conformant entities can be found at the
Oasis web site.

In addition to the ISO entities, a number of entities for characters not designated in this standard is needed. This proposal uses the syntax and inventory recommended in the Menota handbook, ch. 2 and 5, summarised in the table below. Note that not all slots need to be filled in; in most cases only one or two slots are used in addition to the base line character.

base line character

main type

variant

ligature

fixed modification

loose modification

a
A

etc.

comb
scap
enl
ins
unc
run

rot
squ
tall
dotless
brk
close
open
fixhi
fixmed

lig

slash
bar
baracr
barbl
flour
ovl
ogon
hook
loopupright
looplowleft
des

acute
curl
dblac
dot
uml
grave
brev
macr
dotbl
sup

Explanation of the entity names. Note that if there is a conflict between the standard ISO entities as of 01.09.2003 and the syntax suggested here ISO entites should be preferred.

MUFI code point
Recommended code point in the Private Use Area in the Basic Multilingual Plane of Unicode, E000 - F8FF.
If a character already is in the Unicode standard (or there is a near-identical character in the standard) official, non-PUA code points will be given here.

Junicode code point
Junicode code points are located in the range F100 - F19F. As far as possible, Junicode code points have been re-used in this proposal.

Titus code point
Titus code points are located in the range E000 - F0FF. Precomposed MUFI characters have been allocated to this area by the Titus project.

Descriptive name
Each character has been given a descriptive name, as far as possible according to the rules in the Unicode standard. The
Menota handbook ch. 2 has further details on the interpretation of the Unicode naming rules.

 

Top of document

 

A: Base characters (F100 - F1FF and F200 - F28F)

Base characters are base line characters or combining characters making up a part of a composite character. The allocation of codepoints has been coordinated with characters in the Junicode and Alphabetum fonts. Subranges 1, 2, 3 and 9 are placed in the F200 - F24F section; subranges 4, 5, 6, 7 and 8 in the F100 - F1FF section (using already existing Junicode code points wherever possible), and subrange 10 is placed in the F700 - F71F section (using already exisiting Alphabetum code points).

Four subranges, 11 - 14, have been reserved for future additions, and the code points F250 - F28F (64 positions) have been set aside for this purpose.


MUFI subrange 1: Ligatures

Ligatures are two base line characters which are joined so that they form a new, composite base line character. Some consist of two identical characters, e.g. "a+a", others of different characters, e.g. "a+v". In Medieval Nordic manuscripts, ligatures may be used to denote length, "a+a", diphthong, "a+v", or a distinct vowel quality, often mutation (Umlaut), "a+v". Only ligatures which reflect a distinct phonological value should be recognised as characters of their own. - Finally, the broken character "l" representing "ll" should be seen as a ligature of two stems, broken in the middle.

Glyph

Cat.

Entity

MUFI

Descriptive name

1

&AAlig;

F200

LATIN CAPITAL LIGATURE AA

1

&aalig;

F201

LATIN SMALL LIGATURE AA

1

&AOlig;

F202

LATIN CAPITAL LIGATURE AO

1

&aolig;

F203

LATIN SMALL LIGATURE AO

1

&AUlig;

F204

LATIN CAPITAL LIGATURE AU

1

&aulig;

F205

LATIN SMALL LIGATURE AU

1

&AVlig;

F206

LATIN CAPITAL LIGATURE AV

1

&avlig;

F207

LATIN SMALL LIGATURE AV

4

&AVligslash;

F208

LATIN CAPITAL LIGATURE AV WITH STROKE

2

&avligslash;

F209

LATIN SMALL LIGATURE AV WITH STROKE

1

&AYlig;

F20A

LATIN CAPITAL LIGATURE AY

1

&aylig;

F20B

LATIN SMALL LIGATURE AY

1

&lbrk;

F20C

LATIN SMALL LETTER BROKEN L

Reserved space after this range for additions: F20D to F214 (8 code points).

 

Top of document


MUFI subrange 2: Small capitals

Small capitals have the same form as majuscules (capital letters), but are usually drawn with the same height as a minuscule (small) letter such as "x". In Medieval Nordic manuscripts, small capitals were used to denote geminates, i.e. long consonants, or they were used ornamentally. The letters "B", "D", "G", "M", "N", "R", "S" and "T" were most frequently used as geminates, while these and other letters might also be used as ornaments in the whole or in parts of highlighted words. Some of the small capitals, e.g. "O" and "C", are difficult to distinguish from minuscule letters.

Unicode 4.0 has defined nine small capitals in the IPA Extensions range, sc. "B", "G", "H", "I", "L", "N", "Œ", "R" and "Y", and another 14 small capitals for the Uralic Phonetic Alphabet in the Phonetic Extensions range , "A", "C", "D", "ETH", "E", "J", "K", "M", "O", "P", "T", "U", "V", "W" and "Z". Thus, only a handful of small capitals remain. Of these, only small capital "S" and "F" can appear as geminates. The rest, i.e. "Q", "THORN" and "X" can only appear as small capitals in ornamental usage. 

Glyph

Cat.

Entity

MUFI

Descriptive name

1

&fscap;

F215

LATIN LETTER SMALL CAPITAL F

4

&qscap;

F216

LATIN LETTER SMALL CAPITAL Q

1

&sscap;

F217

LATIN LETTER SMALL CAPITAL S

4

&thscap;

F218

LATIN LETTER SMALL CAPITAL THORN

4

&xscap;

F219

LATIN LETTER SMALL CAPITAL X

Reserved space after this range for additions: F21A to F21F (6 code points).

 

Top of document


MUFI subrange 3: Enlarged minuscules

Enlarged minuscules are recognized as separate characters by some scholars, cf. e.g. Andrea de Leeuw van Weenen (A Grammar of Möðruvallabók, CNWS 85, Leiden 2000). The traditional view has been to interpret these characters as variants of majuscules and encode them as such. It can be argued that this is a functional rather than a graphemic point of view and that it obscures the obvious distinction between e.g. "A" (the majuscule) and "a" (the enlarged minuscule). 

Glyph

Cat.

Entity

MUFI

Descriptive name

3

&aenl;

F220

LATIN ENLARGED LETTER SMALL A

3

&benl;

F221

LATIN ENLARGED LETTER SMALL B

4

&cenl;

F222

LATIN ENLARGED LETTER SMALL C

3

&denl;

F223

LATIN ENLARGED LETTER SMALL D

3

&duncenl;

F224

LATIN ENLARGED LETTER UNCIAL D

4

&ethenl;

F225

LATIN ENLARGED LETTER SMALL ETH

3

&eenl;

F226

LATIN ENLARGED LETTER SMALL E

3

&fenl;

F227

LATIN ENLARGED LETTER SMALL F

3

&genl;

F228

LATIN ENLARGED LETTER SMALL G

3

&henl;

F229

LATIN ENLARGED LETTER SMALL H

3

&ienl;

F22A

LATIN ENLARGED LETTER SMALL I

3

&jenl;

F22B

LATIN ENLARGED LETTER SMALL J

3

&kenl;

F22C

LATIN ENLARGED LETTER SMALL K

4

&lenl;

F22D

LATIN ENLARGED LETTER SMALL L

3

&menl;

F22E

LATIN ENLARGED LETTER SMALL M

3

&nenl;

F22F

LATIN ENLARGED LETTER SMALL N

4

&oenl;

F230

LATIN ENLARGED LETTER SMALL O

3

&penl;

F231

LATIN ENLARGED LETTER SMALL P

3

&qenl;

F232

LATIN ENLARGED LETTER SMALL Q

3

&renl;

F233

LATIN ENLARGED LETTER SMALL R

4

&senl;

F234

LATIN ENLARGED LETTER SMALL S

3

&tenl;

F235

LATIN ENLARGED LETTER SMALL T

3

&thenl;

F236

LATIN ENLARGED LETTER SMALL THORN

4

&uenl;

F237

LATIN ENLARGED LETTER SMALL U

4

&venl;

F238

LATIN ENLARGED LETTER SMALL V

4

&wenl;

F239

LATIN ENLARGED LETTER SMALL W

4

&xenl;

F23A

LATIN ENLARGED LETTER SMALL X

3

&yenl;

F23B

LATIN ENLARGED LETTER SMALL Y

4

&zenl;

F23C

LATIN ENLARGED LETTER SMALL Z

Reserved space after this range for additions: F23D to F23F (3 code points).

The usage of the F200 - F28F section continues with subrange 9 below.

 

Top of document


 

MUFI subrange 4: Base line abbreviation characters

This range includes those abbreviation signs which typically occupy a position on the base line. Very few abbreviation signs are included in Unicode 4.0. An exception is the sign for "et", which is found in the range General punctuation as TIRONEAN SIGN ET (204A).

The Runic characters "f" and "m" are sometimes used as abbreviation marks; they are now included in the Unicode range Runic, as 16AO and 16D8 respectively. Since they are used with their alphabetical names as abbreviation ("fé" and "maðr" respectively), it is not necessary to define them as separate characters.

To minimise conflict with Junicode, code points in the section F142 - F163 have been used for subranges 4-8. Space for additional characters in these subranges twill also be allocated to the F100 - F1FF section (see below this table).

Red colour: code points in Junicode to be considered for deprecation.  

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

5

&apo;

0027

APOSTROPHE

A sign similar to the apostrophe was used as an abbreviation for "i" or "e". Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 125.

1

&con9;

F156

F156

LATIN ABBREVIATION SIGN CON

5

&cono;

0254

LATIN SMALL LETTER OPEN O

5

✗

271D

LATIN CROSS

0

&obiit;

03B8

GREEK SMALL LETTER THETA

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 30.

5

&est;

223B

F150

LATIN ABBREVIATION SIGN EST

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 82. Note that Junicode has allocated a separate code point to this character, but it is recommended to use the existing character 223B.

4

&ET;

F142

F142

TIRONEAN LARGE SIGN ET

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 30 and 81.

0

&et;

204A

F143

TIRONEAN SIGN ET

Note that Junicode has allocated a separate code point to this character, but it is recommended to use the existing character 204A.

2

&etbar;

F158

TIRONEAN SIGN ET WITH CROSSBAR

Identical to 03/09 LATIN CONTRACTION ET in ISO/IEC JTC 1/SC 2 N 3126.

0

&

0026

AMPERSAND

Used in some Medieval Nordic manuscripts for "ok".

1

&rum;

F154

F154

LATIN ABBREVIATION SIGN RUM

1

&de;

F159

LATIN ABBREVIATION SIGN DE

This sign looks similar to "eth", but is used as an abbreviation for "de" and should therefore have its own code point. Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 25, 44 and 60.

1

&is;

F15A

LATIN ABBREVIATION SIGN IS

Cf. Johs. Brøndum-Nielsen, ed., Palæografi: Danmark og Sverige (Nordisk kultur 23:A), Stockholm etc. [1943], p. 32, and D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 125. Possibly identical to 02/14 LATIN CONTRACTION IS in ISO/IEC JTC 1/SC 2 N 3126.

0

&ed;

003B

SEMICOLON

1

&etfin;

F155

F155

LATIN ABBREVIATION SIGN FINAL ET

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 125. Possibly identical to 03/12 LATIN CONTRACTION UM in ISO/IEC JTC 1/SC 2 N 3126.

Reserved space for additions to this subrange: F15C - F15F (4 code points) and F1A0 - F1AF (16 code points).

 

Top of document


MUFI subrange 5: Modified base line abbreviation characters

This range includes a number of abbreviation characters, typically a base line character with a bar across.

To minimise conflicts with Junicode, code points in the section F142 - F163 have been used for subranges 4-8. Space for additional characters in these subranges twill also be allocated to the F100 - F1FF section (see below this table).

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

5

&hhook;

0266

LATIN SMALL LETTER H WITH HOOK (LIGATURE OF H AND TALL S)

5

&khook;

0199

LATIN SMALL LETTER K WITH HOOK (LIGATURE OF K AND TALL S)

1

&thornhook;

F1B0

LATIN SMALL LETTER THORN WITH HOOK (LIGATURE OF THORN AND TALL S)

Unicode 4.0 has a "p with hook", 01A5, but that should probably not be used for "thorn". In many fonts it does not look like a thorn at all.

1

&hhookbar;

F1B1

LATIN SMALL LETTER H WITH HOOK (LIGATURE OF H AND TALL S) AND STROKE

1

&khookbar;

F1B2

LATIN SMALL LETTER K WITH HOOK (LIGATURE OF K AND TALL S) AND STROKE

1

&thornhookbar;

F1B3

LATIN SMALL LETTER THORN WITH HOOK (LIGATURE OF THORN AND TALL S) AND STROKE

1

&kbar;

F14B

F14B

LATIN SMALL LETTER K WITH STROKE

1

&thornbar;

F149

F149

LATIN SMALL LETTER THORN WITH STROKE

1

&stallbar;

F14F

F14F

LATIN SMALL LETTER TALL S WITH STROKE

1

&Pbardes;

F144

F144

LATIN CAPITAL LETTER P WITH STROKE THROUGH DESCENDER

1

&pbardes;

F145

F145

LATIN SMALL LETTER P WITH STROKE THROUGH DESCENDER

1

&qbardes;

F14D

F14D

LATIN SMALL LETTER Q WITH STROKE THROUGH DESCENDER

1

&thbardes;

E7A0

LATIN SMALL LETTER THORN WITH STROKE THROUGH DESCENDER

This character happens to be in the Titus range.

1

&Pflour;

F146

F146

LATIN CAPITAL LETTER P WITH FLOURISH

1

&pflour;

F147

F147

LATIN SMALL LETTER P WITH FLOURISH

0

&ubar;

0289

LATIN SMALL LETTER U BAR

The "u" with a bar across was used in some Late Medieval Danish manuscripts for /y/. Cf. Johs. Brøndum-Nielsen, ed., Palæografi: Danmark og Sverige (Nordisk kultur 23:A), Stockholm etc. [1943], p. 53 and 55.

Reserved space for additions to this subrange: F148, F14A, F14C, F14E (4 code points) and F1B4 - F1BF (12 code points).

 

Top of document


MUFI subrange 6: Combining abbreviation and diacritical marks

This range includes those abbreviation signs which typically occupy a position above, through or below another base line character. Combining diacritical marks are also included in this range.

To minimise conflicts with Junicode, code points in the section F142 - F163 have been used for subranges 4-8. Space for additional characters in these subranges twill also be allocated to the F100 - F1FF section (see below this table).

 Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

5

&bar;

0305

COMBINING ABBREVIATION MARK BAR ABOVE

2

&arbar;

F1C0

COMBINING ABBREVIATION MARK BAR ABOVE WITH DOT

5

&baracr;

0336

COMBINING ABBREVIATION MARK BAR ACROSS

5

&barbl;

0332

COMBINING ABBREVIATION MARK BAR BELOW

1

&er;

F152

F152

COMBINING ABBREVIATION MARK SUPRALINEAR "ER" (ZIG-ZAG SIGN)

1

&ra;

F157

F157

COMBINING ABBREVIATION MARK SUPRALINEAR "RA" (OMEGA SIGN)

2

&rabar;

F1C1

COMBINING ABBREVIATION MARK SUPRALINEAR "RA" (OMEGA SIGN) WITH BAR ABOVE

1

&ur2;

F153

F153

COMBINING ABBREVIATION MARK SUPRALINEAR "UR" (2-SIGN)

1

&ur8;

F1C2

COMBINING ABBREVIATION MARK SUPRALINEAR "UR" (8-SIGN)

3

&ur8open;

F1C3

COMBINING ABBREVIATION MARK SUPRALINEAR "UR" (OPEN 8-SIGN)

1

&us;

F151

F151

COMBINING ABBREVIATION MARK SUPRALINEAR "US" (9-SIGN)

4

F15B

COMBINING ABBREVIATION MARK SUPRALINEAR "US" (9-SIGN)

1

&combcurl;

F1C4

COMBINING CURL

4

F163

F163

COMBINING CIRCUMFLEX OVER TWO CHARACTERS

1

F1C5

COMBINING FLOURISH

The combining flourish looks like a large ogonek and is attached to the end of a character, typically "d", to indicate a suspended ending. Cf. P.L. Hjorth, ed., Karl Magnus' Krønike, Copenhagen 1960, p. XXXI.

Reserved space for additions to this subrange: F1C6 - F1CF (10 code points).

 

Top of document


Subrange 7: Combining superscript characters

This range includes superscript characters, typically placed above another base line character. They are found in many early German printed texts, and in a large number of Medieval manuscripts. The position immediately above a base line character distinguishes them from raised interlinear characters typically occupying a position immediately after another base line character. This latter type includes a handful of phonetic modifiers such as a raised "w" indicating rounding, a raised "h" indicating aspiration etc.

Unicode 4.0 has a selection of 13 superscript characters, namely "a", "e", "i", "o", "u", "c", "d", "h", "m", "r", "t", "v", "x". They are located at the end of the range Combining diacritical marks, 0363-036F.

The characters in the list below are documented in Andrea de Leeuw van Weenen, A Grammar of Möðruvallabók (CNWS 85), Leiden 2000.

Since these characters are closely related to subrange 6 (combining abbreviation and diacritical marks) they are placed next to the latter subrange, commencing with F1D0. 

 Glyph

 Cat.

Entity

MUFI

Descriptive name

1

&aeligsup;

F1D0

COMBINING LATIN SMALL LETTER AE

1

&bsup;

F1D1

COMBINING LATIN SMALL LETTER B

1

&bscapsup;

F1D2

COMBINING LATIN LETTER SMALL CAPITAL B

1

&dhsup;

F1D3

COMBINING LATIN SMALL LETTER ETH

1

&dscapsup;

F1D4

COMBINING LATIN LETTER SMALL CAPITAL D

1

&fsup;

F1D5

COMBINING LATIN SMALL LETTER F

1

&gsup;

F1D6

COMBINING LATIN SMALL LETTER G

1

&gscapsup;

F1D7

COMBINING LATIN LETTER SMALL CAPITAL G

1

&ksup;

F1D8

COMBINING LATIN SMALL LETTER K

1

&kscapsup;

F1D9

COMBINING LATIN LETTER SMALL CAPITAL K

1

&lsup;

F1DA

COMBINING LATIN SMALL LETTER L

1

&lscapsup;

F1DB

COMBINING LATIN LETTER SMALL CAPITAL L

1

&mscapsup;

F1DC

COMBINING LATIN LETTER SMALL CAPITAL M

1

⊅  

F1DD

COMBINING LATIN SMALL LETTER N

1

&nscapsup;

F1DE

COMBINING LATIN LETTER SMALL CAPITAL N

1

&psup;

F1DF

COMBINING LATIN SMALL LETTER P

1

&rscapsup;

F1E0

COMBINING LATIN LETTER SMALL CAPITAL R

1

&ssup;

F1E1

COMBINING LATIN SMALL LETTER S

1

&stallsup;

F1E2

COMBINING LATIN LETTER TALL S

1

&tscapsup;

F1E3

COMBINING LATIN LETTER SMALL CAPITAL T

1

&ysup;

F1E4

COMBINING LATIN SMALL LETTER Y

1

&zsup;

F1E5

COMBINING LATIN SMALL LETTER Z

Reserved space for additions to this subrange: F1E6 to F1EF (10 code points).

 

Top of document


 

MUFI subrange 8: Punctuation marks

This range includes those punctuation marks that are not included in the official Unicode ranges. The common marks, such as full stop, comma, colon, semicolon, question mark, hyphen and solidus, are all found in Unicode 4.0 Basic Latin. The middle dot is included in Latin-1 Supplement, (00B7), and although it is not defined specifically as a punctuation mark in this range, it is probably not necessary to duplicate it in the present range.

To minimise conflicts with Junicode, code points in the section F142 - F163 have been used for subranges 4-8. Space for additional characters in these subranges twill also be allocated to the F100 - F1FF section (see below this table).

Red colour: code point in Junicode to be considered for deprecation.

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

1

&seminv;

F160

F160

PUNCTUATION MARK PUNCTUS ELEVATUS (INVERTED SEMICOLON)

3

&seminvdiag;

F1F0

PUNCTUATION MARK PUNCTUS ELEVATUS DIAGONAL STROKE

3

&questcurl;

F161

F161

PUNCTUATION MARK PUNCTUS INTERROGATIVUS

3

&quest8;

F1F1

QUESTION MARK HORIZONTAL 8 FORM

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 63.

1

&diacom;

F1F2

PUNCTUATION MARK DIAERESIS ABOVE COMMA

* Cf. Hreinn Benediktsson, Early Icelandic Script, Reykjavík 1965, p. 95, and D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 63. Note that the bottom part should look like a comma, not an ogonek.

4

&brevdot;

F1F3

PUNCTUATION MARK BREVE ABOVE DOT

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 34.

0

&tridotsupw;

2234

THEREFORE

0

&tridotsright;

10FB

F162

GEORGIAN PARAGRAPH SEPARATOR

Note that Junicode has allocated a separate code point to this character, but it is recommended to use the existing character 10FB.

Reserved space for additions to this subrange: F1F4 to F1FF (12 code points).

 

Top of document


MUFI subrange 9: Critical and epigraphical signs

Critical signs are used in printed editions, indicating corruptions, deletions, additions etc. The majority of these signs are already in Unicode 4.0, such as the asterisk (002A), the obelus (= dagger, 2020), curly brackets (007B, 007D), square brackets (005B, 005D), single vertical line (007C), double vertical line (2016), and open brackets (= angle brackets, 3008, 3009).

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

1

&whsqbl;

F240

LEFT WHITE SQUARE BRACKET

Or possibly 301A LEFT WHITE SQUARE BRACKET in CJK Symbols and Punctuation (problematic since this sign typically is very wide).

1

&whsqbr;

F241

RIGHT WHITE SQUARE BRACKET

Or possibly 301B RIGHT WHITE SQUARE BRACKET in CJK Symbols and Punctuation (problematic since this sign typically is very wide).

1

&hsqblu;

F242

LEFT UPPER HALF SQUARE BRACKET

Or possibly 2308 LEFT CEILING in Miscellaneous Technical.

1

&hsqbru;

F243

RIGHT UPPER HALF SQUARE BRACKET

Or possibly 2309 RIGHT CEILING in Miscellaneous Technical.

4

&hsqbll;

F244

LEFT LOWER HALF SQUARE BRACKET

Or possibly 230A LEFT FLOOR in Miscellaneous Technical. Used in the Scriptorium system to indicate beginning of reading supplemented from another source, cf. Michelle P. Brown, A Guide to Western Historical Scripts from Antiquity to 1600, London: The British Library, 1993, p. 6.

4

&hsqbrl;

F245

RIGHT LOWER HALF SQUARE BRACKET

Or possibly 230B RIGHT FLOOR in Miscellaneous Technical. Used in the Scriptorium system to indicate end of reading supplemented from another source, cf. Michelle P. Brown, A Guide to Western Historical Scripts from Antiquity to 1600, London: The British Library, 1993, p. 6.

1

&slstlu;

F16E

F16E

LEFT UPPER SLANTED STROKE

1

&slstru;

F16F

F16F

RIGHT UPPER SLANTED STROKE

1

&slstll;

F246

LEFT LOWER SLANTED STROKE

1

&slstrl;

F247

RIGHT LOWER SLANTED STROKE

Reserved space after this range: F248 to F24F (8 code points).

 

Top of document


MUFI subrange 10: Metrical symbols

This is the type of symbols used by Eduard Sievers in his Altgermanische metrik (Halle: Max Niemeyer, 1893). They are still frequently used in text books, monographs and articles on Medieval Nordic metrics.

Thesaurus Linguae Grecae has recently proposed a set of metrical symbols for Greek. This proposal has been approved by the Unicode Technical Committee, but not yet by ISO-10646.

Thesaurus Linguae Grecae: proposal for metrical symbols

This proposal includes symbols for the short syllable (breve) and for the combination of a short and long syllable (metrical short over long). The default character (anceps) is identified with MULTIPLICATION SIGN (00D7) in the range Latin-1 Supplement, and the symbol for long syllable (longum) is identified with FIGURE DASH (2012) or EN DASH (2013) in the range General Punctuation. Combinations of anceps, breve, longum and grave or acute accent can presumably be achieved by using COMBINING ACUTE ACCENT (0300) and COMBINING GRAVE ACCENT (0301) in the range Combining Diacritical Marks. However, this does not work well in most applications, so for the time being precomposed characters will be needed.

Since Junicode and Alphabetum both have metrical symbols in different locations of the Private Use Area it has not been possible to avoid code point conflict for this range. Since Alphabetum has the largest selection, code points in this font have been chosen, so that metrical symbols will be placed in the F700 - F71F section. Code points marked ALPHA should be assigned by Juan-José Marcos according to the Alphabetum font.

Note that metrical symbols may soon be decommissioned following the Unicode Technical Committee approval.

Yellow colour: code points in Junicode which might be moved to Alphabetum area (and to be deprecated at a later date).

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

3

&anc;

F70A

F164

METRICAL SYMBOL ANCEPS

This symbol is almost identical to the MULTIPLICATION SIGN (00D7) in the range Latin-1 Supplement, but the anceps should be placed slightly lower, touching the base line and wholly within the x height of the font.

2

&ancacute;

F70B

F165

METRICAL SYMBOL ANCEPS WITH PRIMARY STRESS

2

&ancgrave;

F70C

F166

METRICAL SYMBOL ANCEPS WITH SECONDARY STRESS

3

˘

F701

F16A

METRICAL SYMBOL SHORT SYLLABLE

* This symbol is similar to SPACING BREVE (02D8) in the range Spacing Modifying letters, but should be positioned much closer to the base line.

2

&breveacute;

F706

F16B

METRICAL SYMBOL SHORT SYLLABLE WITH PRIMARY STRESS

2

&brevedblac;

ALPHA

[F715 ?]

METRICAL SYMBOL SHORT SYLLABLE WITH PRIMARY STRESS AND ALLITERATION

[Similar to F706, but with two stress marks like a double acute]

2

&brevegrave;

F707

F16C

METRICAL SYMBOL SHORT SYLLABLE WITH SECONDARY STRESS

3

¯

F700

F167

METRICAL SYMBOL LONG SYLLABLE

This symbol is similar to FIGURE DASH (2012) or EN DASH (2013) in the range General Punctuation, but usually positioned closer to the base line. FIGURE DASH seems to have the same graphical properties as EN DASH.

2

&macracute;

F704

F168

METRICAL SYMBOL LONG SYLLABLE WITH PRIMARY STRESS

2

&macrdblac;

ALPHA

[F714 ?]

METRICAL SYMBOL LONG SYLLABLE WITH PRIMARY STRESS AND ALLITERATION

[Similar to F704, but with two stress marks like a double acute]

2

&macrgrave;

F705

F169

METRICAL SYMBOL LONG SYLLABLE WITH SECONDARY STRESS

2

&macrbreve;

F702

METRICAL SYMBOL SHORT OR LONG SYLLABLE

2

&macrbreveacute;

F708

METRICAL SYMBOL SHORT OR LONG SYLLABLE WITH PRIMARY STRESS

2

&macrbrevegrave;

F709

METRICAL SYMBOL SHORT OR LONG SYLLABLE WITH SECONDARY STRESS

2

&dblbrevemacracute;

ALPHA

[F716 ?]

METRICAL SYMBOL RESOLVED LIFT

[From bottom upwards: two breve symbols, like in F701, side by side, a horizontal line immediately over these, and finally a stress mark like in F704 symmetrically positioned over this line]

2

&dblbrevemacrdblac;

ALPHA

[F717?]

METRICAL SYMBOL RESOLVED LIFT WITH ALLITERATION

[From bottom upwards: two breve symbols, like in F701, side by side, a horizontal line immediately over these, and finally two parallel stress marks like in F704 symmetrically positioned over this line]

MUFI subranges 11-14

Reserved for future additions: F250 to F28F (64 code points).

 

 Top of document

 

B: Precomposed characters (E000 - F0FF)

Precomposed characters are base line characters with one or more diacritical marks. They have been coordinated with characters in the Titus project and are allocated code points in the E000 - F0FF section of the Private Use Area. Precomposed characters will be decommissioned as soon as mature smart font technology is widely available.

Characters marked TITUS will receive code points by the TITUS project.

 


MUFI subrange 15: Base line characters with macron or overline

The overline (bar above) is probably the most used and also the most ambiguous of all abbreviation marks. There are two typical positions of the overline: above the full height of the majuscules and above the x-height of the minuscules. In the latter position it typically crosses the ascender of characters like "b", "d", "h", "k", "l" "þ" and tall "s". If the word has a mixture of characters with and without ascenders, the bar should sometimes be kept in the upper position over all characters.

There are two typical lengths of the overline: less than the width of a character, like the macron, or the full width of the character, so that it can extend as a continuous line over several characters.

With present font technology, the overline is particularly difficult. With some fonts and operating systems it will change its vertical position depending on the height of each character. Thus, in an abbreviation such as "ihc" for "Iesus", the overline may have one position over "c", a slightly higher position over "i" and an even higher position over "h".

This range is intended as a work-around until there is mature smart font technology. It has separate code points for all characters where the overline crosses the ascender, either as a single stroke (macron-length) or as a continuous stroke (overline). Some of these characters are already in the standard, such as "b" with bar across. The font designer should take care to align the overlines on all characters.

This range also has two combining overlines, one for minuscules (in the same heigth as the dot over "i") and one for majuscules (in the same height as the accents). Note that these combining overlines should have "hard" positions, so that they will have the same height regardless of the characters below. In this respect, they will differ from 0304 COMBINING MACRON and 0305 COMBINING OVERLINE, which may be displayed with variable heigth due to limitations in present font technology (cf. subrange 12 below). The combining overlines ought to have medium width, suitable for characters like "O" and "o".

Finally, the range includes a few precomposed characters for extra narrow and wide characters, such as "i", "j", "l" and "m". Subrange 5 has three characters that would otherwise go into the present subrange, i.e. LATIN SMALL LETTER K WITH STROKE (F14B), LATIN SMALL LETTER TALL S WITH STROKE (F14F) and LATIN SMALL LETTER THORN WITH STROKE (F149).

 

Glyph

Cat.

Entity

MUFI

Descriptive name

3

&macrfixhi;

TITUS

COMBINING FIXED-HEIGHT HIGH MACRON

3

&macrfixmed;

TITUS

COMBINING FIXED-HEIGHT MEDIUM-HIGH MACRON

3

&ovlfixhi;

TITUS

COMBINING FIXED-HEIGHT HIGH OVERLINE

3

&ovlfixmed;

TITUS

COMBINING FIXED-HEIGHT MEDIUM-HIGH OVERLINE

0

&bbar;

0180

LATIN SMALL LETTER B WITH STROKE

2

&bovl;

TITUS

LATIN SMALL LETTER B WITH FULL-WIDTH STROKE

0

&dbar;

0111

LATIN SMALL LETTER D WITH STROKE

2

&dovl;

TITUS

LATIN SMALL LETTER D WITH FULL-WIDTH STROKE

0

ℏ

F127

LATIN SMALL LETTER H WITH STROKE

2

&hovl;

TITUS

LATIN SMALL LETTER H WITH FULL-WIDTH STROKE

2

&kovl;

TITUS

LATIN SMALL LETTER K WITH FULL-WIDTH STROKE

2

&lbar;

TITUS

LATIN SMALL LETTER L WITH STROKE

2

&lovl;

TITUS

LATIN SMALL LETTER L WITH FULL-WIDTH STROKE

2

&stallovl;

TITUS

LATIN SMALL LETTER TALL S WITH FULL-WIDTH STROKE

2

&thornovl;

TITUS

LATIN SMALL LETTER THORN WITH FULL-WIDTH STROKE

0

Ī

012A

LATIN CAPITAL LETTER I WITH MACRON

2

&Iovl;

TITUS

LATIN CAPITAL LETTER I WITH OVERLINE

0

ī

012B

LATIN SMALL LETTER I WITH MACRON

2

&iovl;

TITUS

LATIN SMALL LETTER I WITH OVERLINE

2

&Jmacr;

E154

LATIN CAPITAL LETTER J WITH MACRON

2

&jovl;

TITUS

LATIN CAPITAL LETTER J WITH OVERLINE

2

&jmacr;

TITUS

LATIN SMALL LETTER J WITH MACRON

2

&jovl;

TITUS

LATIN SMALL LETTER J WITH OVERLINE

2

&lmacr;

TITUS

LATIN SMALL LETTER L WITH MACRON

2

&lovl;

TITUS

LATIN SMALL LETTER L WITH OVERLINE

2

&Mmacr;

E1B8

LATIN CAPITAL LETTER M WITH MACRON

2

&Movl;

TITUS

LATIN CAPITAL LETTER M WITH OVERLINE

2

&mmacr;

E5CC

LATIN SMALL LETTER M WITH MACRON

2

&movl;

TITUS

LATIN SMALL LETTER M WITH OVERLINE

 

Top of document

 


MUFI subrange 16: Precomposed characters with acute accent

Unicode 4.0 includes acute over the vowels small and capital "a", "e", "i", "o", "u", "y", "æ", and "ø", and also over the consonants (small and capital form) "c", "g", "k", "l", "m", "n", "p", "r", "s", "w", and "z". The list below contains additional character combinations.

Yellow colour: code points in Junicode which might be moved to Titus area (and to be deprecated at a later date).

  

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

2

&AAligacute;;
= &AAlig; + &combacute;

TITUS
=
F200 + 0301

LATIN CAPITAL LIGATURE AA WITH ACUTE
=
LATIN CAPITAL LIGATURE AA + COMBINING ACUTE ACCENT

2

&aaligacute;;
= &aalig; + &combacute;

TITUS
=
F201 + 0301

LATIN SMALL LIGATURE AA WITH ACUTE
=
LATIN SMALL LIGATURE AA + COMBINING ACUTE ACCENT

2

&AOligacute;;
= &AOlig; + &combacute;

TITUS
=
F202 + 030

LATIN CAPITAL LIGATURE AO WITH ACUTE
=
LATIN CAPITAL LIGATURE AO + COMBINING ACUTE ACCENT

2

&aoligacute;;
= &aolig; + &combacute;

TITUS
=
F203 + 0301

LATIN SMALL LIGATURE AO WITH ACUTE
=
LATIN SMALL LIGATURE AO + COMBINING ACUTE ACCENT

2

&AUligacute;;
= &AUlig; + &combacute;

TITUS
=
F204 + 0301

LATIN CAPITAL LIGATURE AU WITH ACUTE
=
LATIN CAPITAL LIGATURE AU + COMBINING ACUTE ACCENT

2

&auligacute;;
= &aulig; + &combacute;

TITUS
=
F205 + 0301

LATIN SMALL LIGATURE AU WITH ACUTE
=
LATIN SMALL LIGATURE AU + COMBINING ACUTE ACCENT

2

&AVligacute;;
= &AVlig; + &combacute;

TITUS
=
F206 + 0301

LATIN CAPITAL LIGATURE AV WITH ACUTE
=
LATIN CAPITAL LIGATURE AV + COMBINING ACUTE ACCENT

2

&avligacute;;
= &avlig; + &combacute;

TITUS
=
F207 + 0301

LATIN SMALL LIGATURE AV WITH ACUTE
=
LATIN SMALL LIGATURE AV + COMBINING ACUTE ACCENT

2

&Eogonacute;
= Ę + &combacute;
= E + &combogon; + &combacute;

E099
= 0118 + 0301
= 0035 + 0328 + 0301

LATIN CAPITAL LETTER E WITH OGONEK AND ACUTE
= LATIN CAPITAL LETTER E WITH OGONEK + COMBINING ACUTE ACCENT
= LATIN CAPITAL LETTER E + COMBINING OGONEK + COMBINING ACUTE ACCENT

2

&eogonacute;
= ę + &combacute;
= e + &combogon; + &combacute;

E499
= 0119 + 0301
= 0065 + 0328 + 0301

LATIN SMALL LETTER E WITH OGONEK AND ACUTE
= LATIN SMALL LETTER E WITH OGONEK + COMBINING ACUTE ACCENT
= LATIN SMALL LETTER E + COMBINING OGONEK + COMBINING ACUTE ACCENT

2

&Jacute;
= J + &combacute;

E153
= 004A + 0301

LATIN CAPITAL LETTER J WITH ACUTE
= LATIN CAPITAL LETTER J + COMBINING ACUTE ACCENT

2

&jacute;
= j + &combacute;

E553
= 006A + 0301

LATIN SMALL LETTER J WITH ACUTE
= LATIN SMALL LETTER J + COMBINING ACUTE ACCENT

2

&Oogonacute;
= &Oogon; + &combacute;
= O + &combogon; + &combacute;

E20C
= 01EA + 0301
= 004F + 0328 + 0301

F190

LATIN CAPITAL LETTER O WITH OGONEK AND ACUTE
= LATIN CAPITAL LETTER O WITH OGONEK + COMBINING ACUTE ACCENT
= LATIN CAPITAL LETTER O + COMBINING OGONEK + COMBINING ACUTE ACCENT

2

&oogonacute;
= &oogon; + &combacute;
= o + &combogon; + &combacute;

E60C
= 01EB + 0301
= 006F + 0328 + 0301

F191

LATIN SMALL LETTER O WITH OGONEK AND ACUTE
= LATIN SMALL LETTER O WITH OGONEK + COMBINING ACUTE ACCENT
= LATIN SMALL LETTER O + COMBINING OGONEK + COMBINING ACUTE ACCENT

2

&Oslashogonacute;
= &Oslashogon; + &combacute;
= Ø + &combogon; + &combacute;

TITUS
= 01FE + 0328
= 00D8 + 0328 + 0301

LATIN CAPITAL LETTER O WITH STROKE AND OGONEK AND ACUTE
=
LATIN CAPITAL LETTER O WITH STROKE AND OGONEK + COMBINING OGONEK
= LATIN CAPITAL LETTER O WITH STROKE + COMBINING OGONEK + COMBINING ACUTE ACCENT

2

&oslashogonacute;
= &oslashacute; + ˛
= ø + &combogon; + &combacute;

TITUS
= 01FF + 0328
= 00F8 + 0328 + 0301

LATIN SMALL LETTER O WITH STROKE AND OGONEK AND ACUTE
=
LATIN SMALL LETTER O WITH STROKE AND OGONEK + COMBINING OGONEK
= LATIN SMALL LETTER O WITH STROKE + COMBINING OGONEK + COMBINING ACUTE ACCENT

2

&OOligacute;;
= &OOlig; + &combacute;

TITUS
=
F29C + 0301

LATIN CAPITAL LIGATURE OO WITH ACUTE
=
LATIN CAPITAL LIGATURE OO + COMBINING ACUTE ACCENT

2

&ooligacute;;
= &oolig; + &combacute;

TITUS
=
F29D + 0301

LATIN SMALL LIGATURE OO WITH ACUTE
=
LATIN SMALL LIGATURE OO + COMBINING ACUTE ACCENT

2

&Vacute;
= V + &combacute;

E33A
= 0056 + 0301

LATIN CAPITAL LETTER V WITH ACUTE
= LATIN CAPITAL LETTER V + COMBINING ACUTE ACCENT

2

&vacute;
= v + &combacute;

E73A
= 0076 + 0301

LATIN SMALL LETTER V WITH ACUTE
= LATIN SMALL LETTER V + COMBINING ACUTE ACCENT

 

Top of document


MUFI subrange 17: Precomposed characters with double acute accent

Unicode 4.0 includes double acute over the small and capital "o" and "u". The list below contains additional character combinations.

 

Glyph

Cat.

Entity

MUFI

Descriptive name

2

&Adacute;
= A + &combdblac;

E025
= 0041 + 030B

LATIN CAPITAL LETTER A WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER A + COMBINING DOUBLE ACUTE ACCENT

2

&adblac;
= a + &combdblac;

E425
= 0061 + 030B

LATIN SMALL LETTER A WITH DOUBLE ACUTE
= LATIN SMALL LETTER A + COMBINING DOUBLE ACUTE ACCENT

2

&AAligdblac;
= &AAlig; + &combdblac;

TITUS
=
F200 + 030B

LATIN CAPITAL LIGATURE AA WITH DOUBLE ACUTE
=
LATIN CAPITAL LIGATURE AA + COMBINING DOUBLE ACUTE ACCENT

2

&aaligdblac;
= &aalig; + &combdblac;

TITUS
=
F201 + 030B

LATIN SMALL LIGATURE AA WITH DOUBLE ACUTE
=
LATIN SMALL LIGATURE AA + COMBINING DOUBLE ACUTE ACCENT

2

&AEligdblac;
= Æ + &combdblac;

TITUS
= 00C6 + 030B

LATIN CAPITAL LETTER AE WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER AE + COMBINING DOUBLE ACUTE ACCENT

2

&aeligdblac;
= æ + &combdblac;

TITUS
= 00E6 + 030B

LATIN SMALL LETTER AE WITH DOUBLE ACUTE
= LATIN SMALL LETTER AE + COMBINING DOUBLE ACUTE ACCENT

2

&Edblac;
= E + &combdblac;

E0D1
= 0035 + 030B

LATIN CAPITAL LETTER E WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER E + COMBINING DOUBLE ACUTE ACCENT

2

&edblac;
= e + &combdblac;

E4D1
= 0065 + 030B

LATIN SMALL LETTER E WITH DOUBLE ACUTE
= LATIN SMALL LETTER E + COMBINING DOUBLE ACUTE ACCENT

2

&Idblac;
= I + &combdblac;

E143
= 0049 + 030B

LATIN CAPITAL LETTER I WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER I + COMBINING DOUBLE ACUTE ACCENT

2

&idblac;
= i + &combdblac;

E153
= 0069 + 030B

LATIN SMALL LETTER I WITH DOUBLE ACUTE
= LATIN SMALL LETTER I + COMBINING DOUBLE ACUTE ACCENT

2

&Jdblac;
= J + &combdblac;

TITUS
= 004A + 030B

LATIN CAPITAL LETTER J WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER J + COMBINING DOUBLE ACUTE ACCENT

2

&jdblac;
= j + &combdblac;

TITUS
= 006A + 030B

LATIN SMALL LETTER J WITH DOUBLE ACUTE
= LATIN SMALL LETTER J + COMBINING DOUBLE ACUTE ACCENT

2

&OOligdblac;
= &OOlig; + &combdblac;

TITUS
=
F29C + 030B

LATIN CAPITAL LIGATURE OO WITH DOUBLE ACUTE
=
LATIN CAPITAL LIGATURE OO + COMBINING DOUBLE ACUTE ACCENT

2

&ooligdblac;
= &oolig; + &combdblac;

TITUS
=
F29D + 030B

LATIN SMALL LIGATURE OO WITH DOUBLE ACUTE
=
LATIN SMALL LIGATURE OO + COMBINING DOUBLE ACUTE ACCENT

2

&Vdblac;
= V + &combdblac;

TITUS
= 0056 + 030B

LATIN CAPITAL LETTER V WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER V + COMBINING DOUBLE ACUTE ACCENT

2

&vdblac;
= v + &combdblac;

TITUS
= 0076 + 030B

LATIN SMALL LETTER V WITH DOUBLE ACUTE
= LATIN SMALL LETTER V + COMBINING DOUBLE ACUTE ACCENT

2

&Ydblac;
= Y + &combdblac;

E37C
= 0059 + 030B

LATIN CAPITAL LETTER Y WITH DOUBLE ACUTE
= LATIN CAPITAL LETTER Y + COMBINING DOUBLE ACUTE ACCENT

2

&ydblac;
= y + &combdblac;

E77C
= 0079 + 030B

LATIN SMALL LETTER Y WITH DOUBLE ACUTE
= LATIN SMALL LETTER Y + COMBINING DOUBLE ACUTE ACCENT

2

&Ydotacute;
= Y + &combdot; + &combacute;

TITUS
= 0059 + 030B + 0301

LATIN CAPITAL LETTER Y WITH DOT ABOVE AND ACUTE
= LATIN CAPITAL LETTER Y + COMBINING DOT ABOVE + COMBINING ACUTE ACCENT

2

&ydotacute;
= y + &combdot; + &combacute;

TITUS
= 0079 + 030B + 0301

LATIN SMALL LETTER Y WITH DOT ABOVE AND ACUTE
= LATIN SMALL LETTER Y + COMBINING DOT ABOVE + COMBINING ACUTE ACCENT

 

Top of document


 

MUFI subrange 18: Precomposed characters with dot above

Unicode 4.0 includes approx. 40 characters with a dot above in three ranges, Latin Extended-A, Latin Extended-B and Latin Extended Additional, intended for use in several languages, mostly Irish Gaelic (old orthography), and - added in 3.2 - also for Livonian. The precomposed characters are small and capital forms of "a", "b", "c", "d", "e", "f", "g", "h", "m", "n", "o", "p", "r", "s", "t", "w", "x", "y", "z", and tall "s" (no capital version). The list below contains additional character combinations.

Glyph

Cat.

Entity

MUFI

Descriptive name

2

&AAligdot;
= &AAlig; + &combdot;

TITUS
=
F200 + 0307

LATIN CAPITAL LIGATURE AA WITH DOT ABOVE
=
LATIN CAPITAL LIGATURE AA + COMBINING DOT ABOVE

2

&aaligdot;
= &aalig; + &combdot;

TITUS
=
F201 + 0307

LATIN SMALL LIGATURE AA WITH DOT ABOVE
=
LATIN SMALL LIGATURE AA + COMBINING DOT ABOVE

2

&AYligdot;
= &AYlig; + &combdot;

TITUS
=
F20A + 0307

LATIN CAPITAL LIGATURE AY WITH DOT ABOVE
=
LATIN CAPITAL LIGATURE AY + COMBINING DOT ABOVE

2

&ayligdot;
= &aylig; + &combdot;

TITUS
=
F20B + 0307

LATIN SMALL LIGATURE AY WITH DOT ABOVE
=
LATIN SMALL LIGATURE AY + COMBINING DOT ABOVE

2

&gscapdot;
= &gscap; + &combdot;

TITUS
= 0262 + 0307

LATIN LETTER SMALL CAPITAL G WITH DOT ABOVE
= LATIN LETTER SMALL CAPITAL G + COMBINING DOT ABOVE

2

&kdot;
= k + &combdot;

TITUS
= 006B + 0307

LATIN SMALL LETTER K WITH DOT ABOVE =
= LATIN SMALL LETTER K + COMBINING DOT ABOVE

2

&nscapdot;
= &nscap; + &combdot;

TITUS
= 0274 + 0307

LATIN LETTER SMALL CAPITAL N WITH DOT ABOVE
= LATIN LETTER SMALL CAPITAL N + COMBINING DOT ABOVE

2

&rscapdot;
= &rscap; + &combdot;

TITUS
= 0280 + 0307

LATIN LETTER SMALL CAPITAL R WITH DOT ABOVE
= LATIN LETTER SMALL CAPITAL R + COMBINING DOT ABOVE

2

&sscapdot;
= &sscap; + &combdot;

TITUS
=
F217 + 0307

LATIN LETTER SMALL CAPITAL S WITH DOT ABOVE
=
LATIN LETTER SMALL CAPITAL S + COMBINING DOT ABOVE

2

&tscapdot;
= &tscap; + &combdot;

TITUS
= 1D1B + 0307

LATIN LETTER SMALL CAPITAL T WITH DOT ABOVE
= LATIN LETTER SMALL CAPITAL T + COMBINING DOT ABOVE

2

&Vdot;
= V + &combdot;

TITUS
= 0056 + 0307

LATIN CAPITAL LETTER V WITH DOT ABOVE
= LATIN CAPITAL LETTER V + COMBINING DOT ABOVE

2

&vdot;
= v + &combdot;

TITUS
= 0076 + 0307

LATIN SMALL LETTER V WITH DOT ABOVE
= LATIN SMALL LETTER V + COMBINING DOT ABOVE

2

&Vinsdot;
= &Vins; + &combdot;

TITUS
=
F2A1 + 0307

LATIN CAPITAL LETTER INSULAR V (VENTH) WITH DOT ABOVE
=
LATIN CAPITAL LETTER INSULAR V (VENTH) + COMBINING DOT ABOVE

2

&vinsdot;
= &vins; + &combdot;

TITUS
=
F2A2 + 0307

LATIN SMALL LETTER INSULAR V (VENTH) WITH DOT ABOVE
=
LATIN SMALL LETTER INSULAR V (VENTH) + COMBINING DOT ABOVE

 

Top of document


MUFI subrange 19: Precomposed characters with dot below

Unicode 4.0 includes no less than 38 characters with a dot below, basically the whole alphabet, "a-z" in the Latin Extended Additional range. The list below contains additional character combinations.

Glyph

Cat.

Entity

MUFI

Descriptive name

2

&AAligdotbl;
= &AAlig; + &combdotbl;

TITUS
= F200 + 0323

LATIN CAPITAL LIGATURE AA WITH DOT BELOW
= LATIN CAPITAL LIGATURE AA + COMBINING DOT BELOW

2

&aaligdotbl;
= &aalig; + &combdotbl;

TITUS
= F201 + 0323

LATIN SMALL LIGATURE AA WITH DOT BELOW
= LATIN SMALL LIGATURE AA + COMBINING DOT BELOW

2

&AOligdotbl;
= &AOlig; + &combdotbl;

TITUS
= F202 + 0323

LATIN CAPITAL LIGATURE AO WITH DOT BELOW
= LATIN CAPITAL LIGATURE AO + COMBINING DOT BELOW

2

&aoligdotbl;
= &aolig; + &combdotbl;

TITUS
= F203 + 0323

LATIN SMALL LIGATURE AO WITH DOT BELOW
= LATIN SMALL LIGATURE AO + COMBINING DOT BELOW

2

&AUligdotbl;
= &AUlig; + &combdotbl;

TITUS
= F204 + 0323

LATIN CAPITAL LIGATURE AU WITH DOT BELOW
= LATIN CAPITAL LIGATURE AU + COMBINING DOT BELOW

2

&auligdotbl;
= &aulig; + &combdotbl;

TITUS
= F205 + 0323

LATIN SMALL LIGATURE AU WITH DOT BELOW
= LATIN SMALL LIGATURE AU + COMBINING DOT BELOW

2

&AVligdotbl;
= &AVlig; + &combdotbl;

TITUS
= F206 + 0323

LATIN CAPITAL LIGATURE AV WITH DOT BELOW
= LATIN CAPITAL LIGATURE AV + COMBINING DOT BELOW

2

&avligdotbl;
= &avlig; + &combdotbl;

TITUS
= F207 + 0323

LATIN SMALL LIGATURE AV WITH DOT BELOW
= LATIN SMALL LIGATURE AV + COMBINING DOT BELOW

2

&AYligdotbl;
= &AYlig; + &combdotbl;

TITUS
= F20A + 0323

LATIN CAPITAL LIGATURE AY WITH DOT BELOW
= LATIN CAPITAL LIGATURE AY + COMBINING DOT BELOW

2

&ayligdotbl;
= &aylig; + &combdotbl;

TITUS
= F20B + 0323

LATIN SMALL LIGATURE AY WITH DOT BELOW
= LATIN SMALL LIGATURE AY + COMBINING DOT BELOW

2

&bscapdotbl;
= &bscap; + &combdotbl;

TITUS
= 0299 + 0323

LATIN LETTER SMALL CAPITAL B WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL B + COMBINING DOT BELOW

2

&Cdotbl;
= C + &combdotbl;

E066
= 0043 + 0323

LATIN CAPITAL LETTER C WITH DOT BELOW
= LATIN CAPITAL LETTER C + COMBINING DOT BELOW

2

&cdotbl;
= c + &combdotbl;

E466
= 0063 + 0323

LATIN SMALL LETTER C WITH DOT BELOW
= LATIN SMALL LETTER C + COMBINING DOT BELOW

2

&dscapdotbl;
= &dscap; + &combdotbl;

TITUS
= 1D05 + 0323

LATIN LETTER SMALL CAPITAL D WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL D + COMBINING DOT BELOW

2

&ETHdotbl;
= Ð + &combdotbl;

TITUS
= 00D0 + 0323

LATIN CAPITAL LETTER ETH WITH DOT BELOW
= LATIN CAPITAL LETTER ETH + COMBINING DOT BELOW

2

&ethdotbl;
= ð + &combdotbl;

TITUS
= 00F0 + 0323

LATIN SMALL LETTER ETH WITH DOT BELOW
= LATIN SMALL LETTER ETH + COMBINING DOT BELOW

2

&Eogondotbl;
= Ę + &combdotbl;

TITUS
= 0118 + 0323

LATIN CAPITAL LETTER E WITH OGONEK AND DOT BELOW
= LATIN CAPITAL LETTER E WITH OGONEK + COMBINING DOT BELOW

2

&eogondotbl;
= ę + &combdotbl;

TITUS
= 0119 + 0323

LATIN SMALL LETTER E WITH OGONEK AND DOT BELOW
= LATIN SMALL LETTER E WITH OGONEK + COMBINING DOT BELOW

2

&Fdotbl;
= F + &combdotbl;

E0EE
= 0046 + 0323

LATIN CAPITAL LETTER F WITH DOT BELOW
= LATIN CAPITAL LETTER F + COMBINING DOT BELOW

2

&fdotbl;
= f + &combdotbl;

E4EE
= 0066 + 0323

LATIN SMALL LETTER F WITH DOT BELOW
= LATIN SMALL LETTER F + COMBINING DOT BELOW

2

&Finsdotbl;
= &Fins; + &combdotbl;

TITUS
= F297 + 0323

LATIN CAPITAL LETTER INSULAR F + COMBINING DOT BELOW

2

&finsdotbl;
= &fins; + &combdotbl;

TITUS
= F103 + 0323

LATIN SMALL LETTER INSULAR F WITH DOT BELOW
= LATIN SMALL LETTER INSULAR F + COMBINING DOT BELOW

2

&Gdotbl;
= G + &combdotbl;

E101
= 0047 + 0323

LATIN CAPITAL LETTER F WITH DOT BELOW
= LATIN CAPITAL LETTER G + COMBINING DOT BELOW

2

&gdotbl;
= g + &combdotbl;

E501
= 0067 + 0323

LATIN SMALL LETTER G WITH DOT BELOW
= LATIN SMALL LETTER G + COMBINING DOT BELOW

2

&gscapdotbl;
= &gscap; + &combdotbl;

TITUS
= 0262 + 0323

LATIN LETTER SMALL CAPITAL G WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL G + COMBINING DOT BELOW

2

&Jdotbl;
= J + &combdotbl;

TITUS
= 004A + 0323

LATIN CAPITAL LETTER J WITH DOT BELOW
= LATIN CAPITAL LETTER J + COMBINING DOT BELOW

2

&jdotbl;
= j + &combdotbl;

TITUS
= 006A + 0323

LATIN SMALL LETTER J WITH DOT BELOW
= LATIN SMALL LETTER J + COMBINING DOT BELOW

2

&lscapdotbl;
= &lscap; + &combdotbl;

TITUS
= 029F + 0323

LATIN LETTER SMALL CAPITAL L WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL L + COMBINING DOT BELOW

2

&mscapdotbl;
= &mscap; + &combdotbl;

TITUS
= 1D0D + 0323

LATIN LETTER SMALL CAPITAL M WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL M + COMBINING DOT BELOW

2

&nscapdotbl;
= &nscap; + &combdotbl;

TITUS
= 0274 + 0323

LATIN LETTER SMALL CAPITAL N WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL N + COMBINING DOT BELOW

2

&Oogondotbl;
= &Oogon; + &combdotbl;

TITUS
= 01EA + 0323

LATIN CAPITAL LETTER O WITH OGONEK AND DOT BELOW
= LATIN CAPITAL LETTER O WITH OGONEK+ COMBINING DOT BELOW

2

&oogondotbl;
= &oogon; + &combdotbl;

TITUS
= 01EB + 0323

LATIN SMALL LETTER O WITH OGONEK AND DOT BELOW
= LATIN SMALL LETTER O WITH OGONEK + COMBINING DOT BELOW

2

&OOligdotbl;
= &OOlig; + &combdotbl;

TITUS
= F29C + 0323

LATIN CAPITAL LIGATURE OO WITH DOT BELOW
= LATIN CAPITAL LIGATURE OO + COMBINING DOT BELOW

2

&ooligdotbl;
= &oolig; + &combdotbl;

TITUS
= F29B + 0323

LATIN SMALL LIGATURE OO WITH DOT BELOW
= LATIN SMALL LIGATURE OO + COMBINING DOT BELOW

2

&Pdotbl;
= P + &combdotbl;

E26D
= 0050 + 0323

LATIN CAPITAL LETTER P WITH DOT BELOW
= LATIN CAPITAL LETTER P + COMBINING DOT BELOW

2

&pdotbl;
= p + &combdotbl;

E66D
= 0070 + 0323

LATIN SMALL LETTER P WITH DOT BELOW
= LATIN SMALL LETTER P + COMBINING DOT BELOW

2

&Qdotbl;
= Q + &combdotbl;

E288
= 0051 + 0323

LATIN CAPITAL LETTER Q WITH DOT BELOW
= LATIN CAPITAL LETTER Q + COMBINING DOT BELOW

2

&qdotbl;
= q + &combdotbl;

E688
= 0071 + 0323

LATIN SMALL LETTER P WITH DOT BELOW
= LATIN SMALL LETTER Q + COMBINING DOT BELOW

2

&rscapdotbl;
= &rscap; + &combdotbl;

TITUS
= 0280 + 0323

LATIN LETTER SMALL CAPITAL R WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL R + COMBINING DOT BELOW

2

&rrotdotbl;
= &rrot; + &combdotbl;

TITUS
= F40A + 0323

LATIN SMALL LETTER R ROTUNDA WITH DOT BELOW
= LATIN SMALL LETTER R ROTUNDA + COMBINING DOT BELOW

2

&sscapdotbl;
= &sscap; + &combdotbl;

TITUS
= F4A5 + 0323

LATIN LETTER SMALL CAPITAL S WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL S + COMBINING DOT BELOW

2

&stalldotbl;
= &stall; + &combdotbl;

TITUS
= 017F + 0323

LATIN SMALL LETTER TALL S WITH DOT BELOW
= LATIN SMALL LETTER TALL S + COMBINING DOT BELOW

2

&tscapdotbl;
= &tscap; + &combdotbl;

TITUS
= 1D1B + 0323

LATIN LETTER SMALL CAPITAL S WITH DOT BELOW
= LATIN LETTER SMALL CAPITAL S + COMBINING DOT BELOW

2

&THORNdotbl;
= Þ + &combdotbl;

TITUS
= 00DE + 0323

LATIN CAPITAL LETTER THORN WITH DOT BELOW
= LATIN CAPITAL LETTER THORN + COMBINING DOT BELOW

2

&thorndotbl;
= þ + &combdotbl;

TITUS
= 00FE + 0323

LATIN SMALL LETTER THORN WITH DOT BELOW
= LATIN SMALL LETTER THORN + COMBINING DOT BELOW

2

&Vinsdotbl;
= &Vins; + &combdotbl;

TITUS
= F2A1 + 0323

LATIN CAPITAL LETTER INSULAR V (VENTH) WITH DOT BELOW
= LATIN CAPITAL LETTER INSULAR V (VENTH) + COMBINING DOT BELOW

2

&vinsdotbl;
= &vins; + &combdotbl;

TITUS
= F2A2 + 0323

LATIN SMALL LETTER INSULAR V (VENTH) WITH DOT BELOW
= LATIN SMALL LETTER INSULAR V (VENTH) + COMBINING DOT BELOW

 

Top of document


MUFI subrange 20: Precomposed characters with diaeresis

Unicode 4.0 includes double dot (diaeresis) over the small and capital characters "a", "e", "i", "o", "u", "y" in the ranges Latin-1 Supplement (all of these except capital "y") and Latin Extended-A (capital "y"), as well as small and capital "h", "w", "x" and "t" (the latter only small) in the range Latin Extended Additional. The list below contains additional character combinations.

Glyph

Cat.

Entity

MUFI

Descriptive name

2

&AAliguml;
= &AAlig; + &combuml;

TITUS
=
F200 + 0308

LATIN CAPITAL LIGATURE AA WITH DIAERESIS
=
LATIN CAPITAL LIGATURE AA + COMBINING DIAERESIS

2

&aaliguml;
= &aalig; + &combuml;

TITUS
=
F201 + 0308

LATIN SMALL LIGATURE AA WITH DIAERESIS
=
LATIN SMALL LIGATURE AA + COMBINING DIAERESIS

2

&AEliguml;
= Æ + &combuml;

TITUS
= 00C6 + 0308

LATIN CAPITAL LIGATURE AE WITH DIAERESIS
= LATIN CAPITAL LIGATURE AE + COMBINING DIAERESIS

2

&aeliguml;
= æ + &combuml;

TITUS
= 00E6 + 0308

LATIN SMALL LIGATURE AE WITH DIAERESIS
=
LATIN CAPITAL LIGATURE AE + COMBINING DIAERESIS

 

Top of document


MUFI subrange 21: Precomposed characters with a hook above

Unicode 4.0 has a hook above (curl) in combination with small and capital "i" in the Latin Extended Additional range. The list below contains additional character combinations.

Glyph

Cat.

Entity

MUFI

Descriptive name

2

&Acurl;
= A + &combcurl;

TITUS
= 0041 +
F1C4

LATIN CAPITAL LETTER A WITH CURL
= LATIN CAPITAL LETTER A +
COMBINING CURL

2

&acurl;
= a + &combcurl;

TITUS
= 0061 +
F1C4

LATIN SMALL LETTER A WITH CURL
= LATIN SMALL LETTER A +
COMBINING CURL

2

&Ecurl;
= E + &combcurl;

TITUS
= 0035 +
F1C4

LATIN CAPITAL LETTER E WITH CURL
= LATIN CAPITAL LETTER E +
COMBINING CURL

2

&ecurl;
= e + &combcurl;

TITUS
= 0065 +
F1C4

LATIN SMALL LETTER E WITH CURL
= LATIN SMALL LETTER E +
COMBINING CURL

0

&Icurl;
= I + &combcurl;

1EC8
= 0049 +
F1C4

LATIN CAPITAL LETTER I WITH HOOK ABOVE (CURL)
= LATIN CAPITAL LETTER I +
COMBINING CURL

0

&icurl;
= i + &combcurl;

1EC9
= 0131 +
F1C4

LATIN SMALL LETTER I WITH HOOK ABOVE (CURL)
= LATIN SMALL LETTER DOTLESS I +
COMBINING CURL

2

&Jcurl;
= J + &combcurl;

TITUS
= 0049 +
F1C4

LATIN CAPITAL LETTER J WITH CURL
= LATIN CAPITAL LETTER J +
COMBINING CURL

2

&jcurl;
= j + &combcurl;

TITUS
=
F40B + F1C4

LATIN SMALL LETTER J WITH CURL
= LATIN SMALL LETTER DOTLESS J +
COMBINING CURL

2

&Ocurl;
= O + &combcurl;

TITUS
= 004F +
F1C4

LATIN CAPITAL LETTER O WITH CURL
= LATIN CAPITAL LETTER O +
COMBINING CURL

2

&ocurl;
= o + &combcurl;

TITUS
= 006F +
F1C4

LATIN SMALL LETTER O WITH CURL
= LATIN SMALL LETTER O +
COMBINING CURL

2

&Oslashcurl;
= Ø + &combcurl;

TITUS
= 004F +
F1C4

LATIN CAPITAL LETTER O WITH STROKE AND CURL
= LATIN CAPITAL LETTER O WITH STROKE +
COMBINING CURL

2

&oslashcurl;
= ø + &combcurl;

TITUS
= 0048 +
F1C4

LATIN SMALL LETTER O WITH STROKE AND CURL
= LATIN SMALL LETTER O WITH STROKE +
COMBINING CURL

2

&Ucurl;
= U + &combcurl;

TITUS
= 0055 +
F1C4

LATIN CAPITAL LETTER U WITH CURL
= LATIN CAPITAL LETTER U +
COMBINING CURL

2

&ucurl;
= u + &combcurl;

TITUS
= 0075 +
F1C4

LATIN SMALL LETTER U WITH CURL
= LATIN SMALL LETTER U +
COMBINING CURL

2

&Ycurl;
= Y + &combcurl;

TITUS
= 0059 +
F1C4

LATIN CAPITAL LETTER Y WITH CURL
= LATIN CAPITAL LETTER Y +
COMBINING CURL

2

&ycurl;
= y + &combcurl;

TITUS
= 0079 +
F1C4

LATIN SMALL LETTER Y WITH CURL
= LATIN SMALL LETTER Y +
COMBINING CURL

 

Top of document


MUFI subrange 22: Precomposed characters with ogonek

Unicode 4.0 includes small and capital forms of "a ogonek" and "e ogonek" in the range Latin Extended-A (0104, 0105, 0118, 0119), and "o ogonek" in the range Latin Extended-B (01EA, 01EB). The list below contains additional character combinations. The combining ogonek (0328) is part of the range Combining Diacritical Marks.

 

Glyph

Cat.

Entity

MUFI

Descriptive name

2

&AEligogon;
= Æ + &combogon;

TITUS
= 00C6 + 0328

LATIN CAPITAL LIGATURE AE WITH OGONEK
= LATIN CAPITAL LIGATURE AE + COMBINING OGONEK

2

&aeligogon;
= æ + &combogon;

TITUS
= 00E6 + 0328

LATIN SMALL LIGATURE AE WITH OGONEK
= LATIN SMALL LIGATURE AE + COMBINING OGONEK

2

&Oogoncurl;
= O + &combogon; + &combcurl;

TITUS
= 004F + 0328 +
F1C4

LATIN CAPITAL LETTER O WITH OGONEK AND CURL
= LATIN CAPITAL LETTER O + COMBINING OGONEK + COMBINING CURL

2

&oogoncurl;
= o + &combogon; + &combcurl;

TITUS
= 006F + 0328 +
F1C4

LATIN SMALL LETTER O WITH OGONEK AND CURL
= LATIN SMALL LETTER O + COMBINING OGONEK + COMBINING CURL

2

&Oslashogon;
= Ø + &combogon;

TITUS
= 00D8 + 0328

LATIN CAPITAL LETTER O WITH STROKE AND OGONEK
= LATIN CAPITAL LETTER O WITH STROKE + COMBINING OGONEK

2

&oslashogon;
= ø + &combogon;

TITUS
= 00F8 + 0328

LATIN SMALL LETTER O WITH STROKE AND OGONEK
= LATIN SMALL LETTER O WITH STROKE + COMBINING OGONEK

2

&togon;
= t + &combogon;

TITUS
= 0074 + 0328

LATIN SMALL LETTER T WITH OGONEK
= LATIN SMALL LETTER T + COMBINING OGONEK

 

Top of document


 

MUFI subrange 23: Precomposed characters with breve

Unicode 4.0 includes the breve over small and capital "a", "e", "i", "o", and "u", and also over the consonant "g", all in the range Latin Extended-A. The list below contains additional character combinations.

Green colour: code points in Junicode which may be allocated by TITUS, even if they are outside the TITUS area (E000 - F0FF).

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

2

&AEligbreve;
= Æ + &combbreve;

TITUS
= 00C6 + 0306

F13C

LATIN CAPITAL LIGATURE AE WITH BREVE
= LATIN CAPITAL LIGATURE AE + COMBINING BREVE

2

&aeligbreve;
= æ + &combbreve;

TITUS
= 00E6 + 0306

F13D

LATIN SMALL LIGATURE AE WITH BREVE
= LATIN SMALL LIGATURE AE + COMBINING BREVE

2

&Ybreve;
= Y + &combbreve;

TITUS
= 0054 + 0306

F140

LATIN CAPITAL LETTER Y WITH BREVE
= LATIN CAPITAL LETTER Y + COMBINING BREVE

2

&ybreve;
= y + &combbreve;

TITUS
= 0079 + 0306

F141

LATIN SMALL LETTER Y WITH BREVE
= LATIN SMALL LETTER Y + COMBINING BREVE

 

Top of document


MUFI subrange 24: Precomposed characters with macron and breve (prev. part of subrange 11)

Unicode 4.0 has no precomposed characters with both macron and breve. The list below contains expected character combinations.

Green colour: code points in Junicode which may be allocated by TITUS, even if they are outside the TITUS area (E000 - F0FF).

 

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

2

&Amacrbreve;
= a + &combmacr; + &combbreve;

TITUS
= 0041 + 0304 + 0306

F170 (?)

LATIN CAPITAL LETTER A WITH MACRON AND BREVE
= LATIN CAPITAL LETTER A + COMBINING MACRON + COMBINING BREVE

2

&amacrbreve;
= a + &combmacr; + &combbreve;

TITUS
= 0061 + 0304 + 0306

F171

LATIN SMALL LETTER A WITH MACRON AND BREVE
= LATIN SMALL LETTER A + COMBINING MACRON + COMBINING BREVE

2

&AEligmacrbreve;
= Æ + &combmacr; + &combbreve;

TITUS
= 00C6 + 0304 + 0306

F182 (?)

LATIN CAPITAL LIGATURE AE WITH MACRON AND BREVE
= LATIN CAPITAL LIGATURE AE + COMBINING MACRON + COMBINING BREVE

2

&aeligmacrbreve;
= æ + &combmacr; + &combbreve;

TITUS
= 00E6 + 0304 + 0306

F173

LATIN SMALL LIGATURE AE WITH MACRON AND BREVE
= LATIN SMALL LIGATURE AE + COMBINING MACRON + COMBINING BREVE

2

&Emacrbreve;
= E + &combmacr; + &combbreve;

TITUS
= 0035 + 0304 + 0306

F174 (?)

LATIN CAPITAL LETTER E WITH MACRON AND BREVE
= LATIN CAPITAL LETTER E + COMBINING MACRON + COMBINING BREVE

2

&Emacrbreve;
= e + &combmacr; + &combbreve;

TITUS
= 0065 + 0304 + 0306

F175

LATIN SMALL LETTER E WITH MACRON AND BREVE
= LATIN SMALL LETTER E + COMBINING MACRON + COMBINING BREVE

2

&Imacrbreve;
= I + &combmacr; + &combbreve;

TITUS
= 0049 + 0304 + 0306

F176 (?)

LATIN CAPITAL LETTER I WITH MACRON AND BREVE
= LATIN CAPITAL LETTER I + COMBINING MACRON + COMBINING BREVE

2

&imacrbreve;
= i + &combmacr; + &combbreve;

TITUS
= 0069 + 0304 + 0306

F177

LATIN SMALL LETTER I WITH MACRON AND BREVE
= LATIN SMALL LETTER I + COMBINING MACRON + COMBINING BREVE

2

&Omacrbreve;
= O + &combmacr; + &combbreve;

TITUS
= 004F + 0304 + 0306

F178 (?)

LATIN CAPITAL LETTER O WITH MACRON AND BREVE
= LATIN CAPITAL LETTER O + COMBINING MACRON + COMBINING BREVE

2

&omacrbreve;
= o + &combmacr; + &combbreve;

TITUS
= 006F + 0304 + 0306

F179

LATIN SMALL LETTER O WITH MACRON AND BREVE
= LATIN SMALL LETTER O + COMBINING MACRON + COMBINING BREVE

2

&Umacrbreve;
= U + &combmacr; + &combbreve;

TITUS
= 0055 + 0304 + 0306

F17A (?)

LATIN CAPITAL LETTER U WITH MACRON AND BREVE
= LATIN CAPITAL LETTER U + COMBINING MACRON + COMBINING BREVE

2

&umacrbreve;
= u + &combmacr; + &combbreve;

TITUS
= 0075 + 0304 + 0306

F17B

LATIN SMALL LETTER U WITH MACRON AND BREVE
= LATIN SMALL LETTER U + COMBINING MACRON + COMBINING BREVE

2

&Ymacrbreve;
= Y + &combmacr; + &combbreve;

TITUS
= 0054 + 0304 + 0306

F17C (?)

LATIN CAPITAL LETTER Y WITH MACRON AND BREVE
= LATIN CAPITAL LETTER Y + COMBINING MACRON + COMBINING BREVE

2

&ymacrbreve;
= y + &combmacr; + &combbreve;

TITUS
= 0079 + 0304 + 0306

F17D

LATIN SMALL LETTER Y WITH MACRON AND BREVE
= LATIN SMALL LETTER Y + COMBINING MACRON + COMBINING BREVE

 

Top of document


MUFI subrange 25: Precomposed characters with macron and acute

Unicode 4.0 includes macron and acute over small and capital "e" and "o" in the range Latin Extended Additional (1E16, 1E17, 1E52, 1E53). Note that F185 (= 1E16) and F189 (= 1E52) should be considered for deprecation in Junicode. The list below contains additional character combinations.

Green colour: code points in Junicode which may be allocated by TITUS, even if they are outside the TITUS area (E000 - F0FF).  

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

2

&Amacracute;
= a + &combmacr; + &combacute;

TITUS
= 0041 + 0304 + 0301

F180 (?)

LATIN CAPITAL LETTER A WITH MACRON AND ACUTE
= LATIN CAPITAL LETTER A + COMBINING MACRON + COMBINING ACUTE

2

&amacracute;
= a + &combmacr; + &combacute;

TITUS
= 0061 + 0304 + 0301

F181

LATIN SMALL LETTER A WITH MACRON AND ACUTE
= LATIN SMALL LETTER A + COMBINING MACRON + COMBINING ACUTE

2

&AEligmacracute;
= Æ + &combmacr; + &combacute;

TITUS
= 00C6 + 0304 + 0301

F182 (?)

LATIN CAPITAL LIGATURE AE WITH MACRON AND ACUTE
= LATIN CAPITAL LIGATURE AE + COMBINING MACRON + COMBINING ACUTE

2

&aeligmacracute;
= æ + &combmacr; + &combacute;

TITUS
= 00E6 + 0304 + 0301

F183

LATIN SMALL LIGATURE AE WITH MACRON AND ACUTE
= LATIN SMALL LIGATURE AE + COMBINING MACRON + COMBINING ACUTE

2

&Imacracute;
= I + &combmacr; + &combacute;

TITUS
= 0049 + 0304 + 0301

F186 (?)

LATIN CAPITAL LETTER I WITH MACRON AND ACUTE
= LATIN CAPITAL LETTER I + COMBINING MACRON + COMBINING ACUTE

2

&imacracute;
= i + &combmacr; + &combacute;

TITUS
= 0069 + 0304 + 0301

F187

LATIN SMALL LETTER I WITH MACRON AND ACUTE
= LATIN SMALL LETTER I + COMBINING MACRON + COMBINING ACUTE

2

&Umacracute;
= U + &combmacr; + &combacute;

TITUS
= 0055 + 0304 + 0301

F18A (?)

LATIN CAPITAL LETTER U WITH MACRON AND ACUTE
= LATIN CAPITAL LETTER U + COMBINING MACRON + COMBINING ACUTE

2

&umacracute;
= u + &combmacr; + &combacute;

TITUS
= 0075 + 0304 + 0301

F18B

LATIN SMALL LETTER U WITH MACRON AND ACUTE
= LATIN SMALL LETTER U + COMBINING MACRON + COMBINING ACUTE

2

&Ymacracute;
= Y + &combmacr; + &combacute;

TITUS
= 0054 + 0304 + 0301

F18C (?)

LATIN CAPITAL LETTER Y WITH MACRON AND ACUTE
= LATIN CAPITAL LETTER Y + COMBINING MACRON + COMBINING ACUTE

2

&ymacracute;
= y + &combmacr; + &combacute;

TITUS
= 0079 + 0304 + 0301

F18D

LATIN SMALL LETTER Y WITH MACRON AND ACUTE
= LATIN SMALL LETTER Y + COMBINING MACRON + COMBINING ACUTE

Four subranges, 26 - 29, have been reserved for future additions of precomposed characters.

 

Top of document  

 

C: Variant letter forms (F290 - F2FF and F300 - F3FF)


MUFI subrange 30: Nordic

This subrange contains commonly recognised variant letter forms in Medieval Nordic manuscripts. Note that the majority of these characters are inherited from Insular and Continental styles, and will thus be relevant for many other European literatures.

Variant letter forms may be defined as graphemes, but are normally not characters in the Unicode sense of the word. However, the distinction between characters and character variants is not always clear, and some of the letter forms in the list may prove to be characters according to the strict Unicode criteria.

To minimise conflicts with Junicode, code points in this font have been used wherever possible. Additional letter forms have been placed in a new section commencing with F290.

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

3

&ains;

F290

LATIN SMALL LETTER INSULAR A

This is the single-storey "a" of the Insular script, similar in shape to modern italic "a".

3

&ainsenl;

F291

LATIN ENLARGED SMALL LETTER INSULAR A

This is the enlarged variant av the single-storey "a", triangular in shape. Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 27.

3

&aopen;

F292

LATIN SMALL LETTER A OPEN FORM

This is the traditional "open a" from early Carolingian script, similar in shape to "cc".

3

&aclose;

F293

LATIN SMALL LETTER A CLOSED FORM

Cf. D.A. Seip, Palæografi: Norge og Island (Nordisk kultur 23:B), Oslo etc. 1954, p. 70.

3

&aelighook;

F294

LATIN SMALL LIGATURE AE WITH HOOK

Cf. Johs. Brøndum-Nielsen, ed., Palæografi: Danmark og Sverige (Nordisk kultur 23:A), Stockholm etc. [1943], p. 102.

3

&Olll;

F295

LATIN CAPITAL LIGATURE AO NECKLESS

3

&olll;

F296

LATIN SMALL LIGATURE AO NECKLESS

Theses two letter forms should be analysed and designed as the ligature of "a+o" without the neck of the "a".

1

&dunc;

F109

F109

LATIN LETTER UNCIAL D

This is a separate character in Medieval Welsh.

3

&eunc;

F10A

F10A

LATIN LETTER UNCIAL E

4

&Fins;

F297

LATIN CAPITAL LETTER INSULAR F

3

&fins;

F103

F103

LATIN SMALL LETTER INSULAR F

3

&finsclose;

F298

LATIN SMALL LETTER INSULAR F CLOSED FORM

3

&hins;

F110

F110

LATIN CAPITAL LETTER INSULAR H

1

&jdl;

F299

LATIN SMALL LETTER J DOTLESS

1

&kunc;

F29A

LATIN LETTER UNCIAL K

3

&kclose;

F29B

LATIN SMALL LETTER K CLOSED FORM

3

&munc;

F11A

F11A

LATIN LETTER UNCIAL M

3

&OOlig;

F29C

LATIN CAPITAL LIGATURE OO

3

&oolig;

F29D

LATIN SMALL LIGATURE OO

The "o+o" ligature was introduced by the Icelander Jón Ólafsson from Grunnavík in the 18th century. Cf. Jón Helgason, Jon Olafsson fra Grunnavik, Copenhagen 1926.

3

&Ourl;

F29E

LATIN CAPITAL LIGATURE OE WITH MISSING BOTTOM STROKE

3

&ourl;

F29F

LATIN SMALL LIGATURE OE WITH MISSING BOTTOM STROKE

These two letter forms should be analysed and designed as the ligature of "o+e" without the bottom (finishing) stroke of the "e".

1

&rrot;

F2A0

LATIN SMALL LETTER R ROTUNDA

3

&rins;

F125

F125

LATIN SMALL LETTER INSULAR R

0

&rdes;

027C

LATIN SMALL LETTER R WITH LONG LEG

3

&stalldes;

F127

F127

LATIN SMALL LETTER TALL S DESCENDING

This character extends below the base line, while the ordinary tall "s" is located on the base line.

3

&tunc;

F129

F129

LATIN LETTER UNCIAL T

1

&Vins;

F2A1

LATIN CAPITAL LETTER INSULAR V (VENTH)

Partially similar to 01F7 LATIN CAPITAL LETTER WYNN in Latin Extended-B. Note that the Old Norse variant is open, resembling the character "Y", and is translitterated with "v", not with "w" like in Old English.

1

&vins;

F2A2

LATIN SMALL LETTER INSULAR V (VENTH)

Partially similar to 01BF LATIN LETTER WYNN in Latin Extended-B. Note that the Old Norse variant is open, resembling the character "y", and is translitterated with "v", not with "w" like in Old English.

3

&YYlig;

F2A3

LATIN CAPITAL LIGATURE YY

3

&yylig;

F2A4

LATIN SMALL LIGATURE YY

The "y+y" ligature was introduced by the Icelander Jón Ólafsson from Grunnavík in the 18th century. Cf. Jón Helgason, Jon Olafsson fra Grunnavik, Copenhagen 1926.

3

&zbar;

F2A5

LATIN SMALL LETTER Z WITH CROSSBAR

Reserved space after this range: F2A6 to F2C0 (26 code points).

 

Top of document


MUFI subrange 31: English

This subrange contains commonly recognised variant letter forms in Old and Middle English manuscripts. Note that several of the Insular letter forms have been included in the Nordic section, subrange 30.

To minimise conflicts with Junicode, code points in this font has been used wherever possible. Additional letter forms should preferably be placed in vacant slots in the section F100 - F13B, where Junicode has English letter forms. Here, 47 slots are at disposal.

Golden colour: code points to be allocated by Peter S. Baker (Junicode).

Glyph

Cat.

Entity

MUFI

Junicode

Descriptive name

3

&Ainsqu;

F13A

F13A

LATIN CAPITAL LETTER INSULAR A SQUARE FORM

3

&Cinssqu;

F106

F106

LATIN CAPITAL LETTER INSULAR C SQUARE FORM

3

&Ginssqu;

F10E

F10E

LATIN CAPITAL LETTER INSULAR G SQUARE FORM

3

&Gins;

F10C (?)

F10C (?)

LATIN CAPITAL LETTER INSULAR G

3

&gins;

F10F

F10F

LATIN SMALL LETTER INSULAR G

3

&sins;

F126

F126

LATIN SMALL LETTER INSULAR S

F192 (?)

F192

SEENET CHARACTER 1

F193 (?)

F193

SEENET CHARACTER 2

F194 (?)

F194

SEENET CHARACTER 3

F195 (?)

F195

SEENET CHARACTER 4

F196 (?)

F196

SEENET CHARACTER 5

F197 (?)

F197

SEENET CHARACTER 6

F19E (?)

F19E

ALTERNATIVE CEDILLA

 

Top of document

 


MUFI subrange 32: Portuguese

To be added in the next version of this proposal. For a preview, please consult The Portuguese Medieval Font Project (PDF file, 45 pages).