ImageGear for C and C++ on Windows v19.1 - Updated
Characters, Languages, and Modules
User Guide > How to Work with... > OCR > Technical Specifications > Characters, Languages, and Modules

This is an alphabetical listing of all the accented letters of the Latin alphabet that the Engine can handle. The recognition modules cannot be reliably trained to read accented letters beyond this listing. The Engine can also read digits, punctuation, and miscellaneous characters and the Greek and Cyrillic alphabets, as listed elsewhere.

Letter Images

These are displayed as bitmapped images and represent current Windows representation with the font Arial.

Letter Names

These follow the official ISO and UNICODE character names.

UNICODE Character Codes

Where two codes appear, the first is the uppercase letter, the second lowercase.

Languages

This defines the languages for which the character is automatically validated. Characters needed but not displayed for a given language can be validated for recognition individually.

Dictionary Support

This relates in the first instance to the MOR recognition module. Other modules offer varying levels of dictionary support.

Module Support

MOR supports all letters; therefore it is not listed. DOT* = lowercase support only.

Code Page Support

This is shown for two basic West European Code Pages: ASCII 437 for DOS and ANSI 1252 for Windows. Support is indicated by the code value; the first code is for uppercase, the second for lowercase. MIS = Missing.

Other Code Pages

1250 = Windows East European; 1254 = Windows Turkish; 1257 = Windows Baltic

Letter Image

Letter Name

Unicode

Languages Using the Character

Dictionary support; without support

Module support

ASCII
437

ANSI 1252

Other
Code Pages

a_diaresis.jpg

A Diaeresis

00c4
00e4

Finnish, German, Swedish, Chamorro, Eskimo, Estonian, Guarani, Mayan, Slovak.

MTX, FRX, DOT

142
132

196
228

1250 1254 1257

a_acute.jpg

A Acute

00c1
00e1

Czech, Dutch, French, Hungarian, Icelandic, Portuguese, Spanish, Eskimo, Faroese, Gaelic (I), Guarani, Malagasy, Papiamento, Welsh.

MTX, FRX, DOT

MIS
160

193
225

1250 1254

a_ring.jpg

A Ring

00c5
00e5

Danish, Finnish, Norwegian, Swedish.

MTX, FRX, DOT

143
134

197
229

1254 1257

a_circumflex.jpg

A Circumflex

00c2
00e2

French, Portuguese, Eskimo, Frisian, Friulian, Luxembourgian, Rhaetic, Romanian, Tahitian, Turkish, Welsh, Wolof.

MTX, FRX, DOT

MIS
131

194
226

1250 1254

a_grave.jpg

A Grave

00c0
00e0

Catalan, Dutch, French, Italian, Portuguese, Breton, Friulian, Gaelic (S), Malagasy, Provencal, Rhaetic, Sardinian.

MTX, FRX, DOT*

MIS
133

192
224

1254

a_tilde.jpg

A Tilde

00c3
00e3

Portuguese.

MTX, FRX, DOT

MIS

195
227

 

sml_a.jpg

Sml. A Feminine Ordinal indicator

00aa

Spanish, Portuguese.

MTX, FRX

166

170

1254

a_breve.jpg

A Breve

0102
0103

Romanian.

FRX

MIS

MIS

1250

a_ogonok.jpg

A Ogonok

0104
0105

Polish, Kasub, Lithuanian.

FRX

MIS

MIS

1250  1257

a_macron.jpg

A Macron

0100
0101

Eskimo, Fijian, Latvian,Samoan.

FRX, DOT* 

MIS

MIS

1257

ae.jpg

AE

00c6
00e6

Danish, Norwegian, Faroese, Icelandic.

MTX FRX, DOT

146
145

198
230

1254  1257

c_acute.jpg

C Acute

0106
0107

Polish, Croatian, Kasub, Sorbian.

FRX

MIS

MIS

1250  1257

c_circumflex.jpg

C Circumflex

0108
0109

Esperanto.

 

MIS

MIS

 

c_hacek.jpg

C Hacek

010c
010d

Czech, Croatian, Lappish, Latvian, Lithuanian, Romany, Slovak, Slovenian, Sorbian.

FRX

MIS

MIS

1250  1257

c_dot.jpg

C Dot

010a
010b

Maltese.

 

MIS

MIS

 

c_cedilla.jpg

C Cedilla

00c7
00e7

Catalan, French, Portuguese, Albanian, Kurdish, Papiamento, Provencal, Turkish.

FRX, MTX, DOT

128
135

199
231

1250  1254

d_hacek.jpg

D Hacek (lower as apostrophe)

010e
010f

Czech, Slovak.

FRX

MIS

MIS

1250 

d_stroke.jpg

D Stroke

0110
0011

Croatian

FRX

MIS

208
MIS

1250 

ETH.jpg

ETH

00d0
00f0

Icelandic, Faroese.

FRX

MIS

208
240

Uppercase in 1250

e_acute.jpg

E Acute

00c9
00e9

Catalan, Czech, Dutch, French, Hungarian, Italian, Portuguese, Spanish, Breton, Friulian, Gaelic (I), Guarani, Icelandic, Kasub, Luxembourgian, Malagasy, Malinke, Papiamento, Provencal, Sardinian, Slovak, Welsh, Wolof.

FRX, MTX, DOT

144
130

201
233

1250 1254 1257 

e_diaeresis.jpg

E Diaeresis

00cb
00eb

Dutch, French, Portuguese, Afrikaans,  Albanian, Breton, Kasub, Welsh.

FRX, MTX, DOT*

MIS
137

203
235

1250 1254

e_circumflex.jpg

E Circumflex

00ca
00ea

Dutch, French, Portuguese, Afrikaans, Breton, Chuana, Eskimo, Frisian, Friulian, Kurdish, Rhaetic, Tahitian, Wolof.

FRX, MTX, DOT*

MIS
136

202
234

1254

e_grave.jpg

E Grave

00c8
00e8

Catalan, Dutch, French, Italian, Portuguese, Breton, Chuana, Gaelic (S), Malagasy, Malinke, Provencal, Rhaetic, Sardinian, Suto, Wolof.

FRX, MTX, DOT*

MIS
138

200
232

1254

e_hacek.jpg

E Hacek

011a
011b

Czech, Malay (really E-breve), Slovak,Sorbian, Sundanese.

FRX

MIS

MIS

1250

e_dot.jpg

E Dot

0116
0117

Lithuanian.

FRX

MIS

MIS

1257

e_ogonok.jpg

E Ogonok

0118
0119

Polish, Kasub, Lithuanian.

FRX

MIS

MIS

1250  1257

e_macron.jpg

E Macron

0112
0113

Basque, Eskimo, Fijian, Latvian, Minankabaw.

FRX

MIS

MIS

1257

g_circumflex.jpg

G Circumflex

011c
011d

Esperanto.

 

MIS

MIS

 

g_breve.jpg

G Breve

011e
011f

Turkish.

FRX

MIS

MIS

1254

g_dot.jpg

G Dot

0120
0121

Maltese.

 

MIS

MIS

 

g_cedilla.jpg

Cap.G Cedilla Sml. G apost.

0122
0123

Lappish, Latvian.

FRX

MIS

MIS

1257

h_circumflex.jpg

H Circumflex

0124
0125

Esperanto.

 

MIS

MIS

 

h_bar.jpg

H Bar

0126
0127

Maltese.

 

MIS

MIS

 

i_diaeresis.jpg

I Diaeresis

00cf
00ef

Catalan, Dutch, French, Portuguese, Breton, Minankabaw, Provencal, Tahitian.

FRX, MTX, DOT*

MIS
139

207
239

1254

i_grave.jpg

I Grave

00cc
00ec

Italian, Portuguese, Corsican, Friulian, Gaelic (S), Malagasy, Sardinian.

FRX, MTX, DOT*

MIS
141

204
236

1254

i_circumflex.jpg

I Circumflex

00ce
00ee

French, Italian, Breton, Eskimo, Romanian, Turkish.

FRX, MTX, DOT*

MIS
140

206
238

1254

i_acute.jpg

I Acute

00cd
00ed

Catalan, Czech, Dutch, Hungarian, Italian, Portuguese, Spanish, Eskimo, Faroese, Gaelic (I), Guarani, Icelandic, Malagasy, Slovak.

FRX, MTX, DOT

MIS
161

205
237

1250  1254

i_tilde.jpg

I Tilde

0128
0129

Eskimo, Kikuyu.

 

MIS

MIS

 

cap_i_dot.jpg

Cap. I Dot
Small Dotless I

0130
0131

Turkish.

FRX

MIS

MIS

1254

i_ogonok.jpg

I Ogonok

012e
012f

Lithuanian.

FRX

MIS

MIS

1257

i_macron.jpg

I Macron

012a
012b

Eskimo, Fijian, Latvian.

FRX

MIS

MIS

1257

i_hacek.jpg

I Hacek

012c
012d

(Latin – for I-breve)
Currently not enabled.

 

MIS

MIS

 

j_circumflex.jpg

J Circumflex

0134
0135

Esperanto.

 

MIS

MIS

 

k_cedilla.jpg

K Cedilla

0136
0137

Latvian.

FRX

MIS

MIS

1257

l_acute.jpg

L Acute

0139
013a

Slovak.

FRX

MIS

MIS

1250

l_hacek.jpg

L Hacek (shown as apostrophe)

013d
013e

Slovak.

FRX

MIS

MIS

1250

l_slash.jpg

L Slash

0141
0142

Polish, Kasub, Sorbian.

FRX

MIS

MIS

1250  1257

l_cedilla.jpg

L Cedilla

013b
013c

Latvian.

FRX

MIS

MIS

1257

n_tilde.jpg

N Tilde

00d1
00f1

Spanish, Aymara, Basque, Chuana, Guarani, Luba, Papiamento, Quechua.

FRX, MTX, DOT

165
164

209
241

1254

n_acute.jpg

N Acute

0143
0144

Polish, Sorbian.

FRX

MIS

MIS

1250 1257

n_hacek.jpg

N Hacek

0147
0148

Czech, Slovak.

FRX

MIS

MIS

1250

n_cedilla.jpg

N Cedilla

0145
0146

Ganda, Latvian.

FRX

MIS

MIS

1257

ENG.jpg

ENG

014a
014b

Lappish, IPA (currently not supported)

 

MIS

MIS

 

o_diaeresis.jpg

O Diaeresis

00d6
00f6

Finnish, German, Hungarian, Swedish, Estonian, Guarani, Icelandic, Kurdish, Turkish.

FRX, MTX, DOT

153
148

214
246

1250  1254  1257

o_acute.jpg

O Acute

00d3
00f3

Catalan, Czech, Dutch, Hungarian, Italian, Polish, Portuguese, Spanish, Faroese, Gaelic (I), Icelandic, Malagasy, Papiamento, Provencal, Slovak, Welsh.

FRX, MTX, DOT

MIS
162

211
243

1250  1254  1257

o_circumflex.jpg

O Circumflex

00d4
00f4

French, Portuguese, Breton, Tahitian, Slovak, Welsh, Wolof.

FRX, MTX, DOT*

MIS
147

212
244

1250  1254

o_grave.jpg

O Grave

00d2
00f2

Catalan, Italian, Portuguese, Tswana, Friulian, Gaelic (S), Malagasy, Sardinian.

FRX, MTX, DOT*

MIS
149

210
242

1254

o_tilde.jpg

O Tilde

00d5
00f5

Portuguese, Estonian.

FRX, MTX, DOT

MIS

213
245

1254  1257

sml_o.jpg

Sml. O Masculine Ordinal indicator

00ba

Spanish, Portuguese.

FRX, MTX, DOT

167

186

1254

o_doubleacute.jpg

O Double Acute

0150
0151

Hungarian.

FRX, MTX, DOT

MIS

MIS

1250

o_macron.jpg

O Macron

014c
014d

Eskimo, Fijian, Latvian, Samoan, Sotho, Tswana.

FRX, DOT*

MIS

MIS

1257

o_breve.jpg

O Breve

014e
014f

(Latin) 

Currently not implemented

 

MIS

MIS

 

o_slash.jpg

O Slash

00d8
00f8

Danish, Norwegian, Faroese.

FRX, MTX, DOT

MIS

216
248

1254  1257

oe.jpg

O E

0152
0153

French.

FRX, MTX, DOT

MIS

140
156

 

r_acute.jpg

R Acute

0154
0155

Basque, Eskimo, Slovak.

FRX

MIS

MIS

1250

r_hacek.jpg

R Hacek

0158
0159

Czech, Sorbian.

FRX

MIS

MIS

1250

r_cedilla.jpg

R Cedilla

0156
0157

Latvian.

FRX

MIS

MIS

1257

s_hacek.jpg

S Hacek

0160
0161

Czech, Croatian,  Lappish, Latvian, Lithuanian, Luba, Romany, Slovak, Slovenian, Sorbian, Sotho, Tswana.

FRX

MIS

138
154

1250  1257

s_acute.jpg

S Acute

015a
015b

Polish, Sorbian.

FRX

MIS

MIS

1250  1257

s_circumflex.jpg

S Circumflex

015c
015d

Esperanto.

 

MIS

MIS

 

s_cedilla.jpg

S Cedilla

015e
015f

Kurdish, Romanian, Turkish.

FRX

MIS

MIS

1250  1254

t_hacek.jpg

T Hacek (lower as apostrophe)

0164
0165

Czech, Slovak. (Lower case sometimes appears as hacek)

FRX

MIS

MIS

1250

t_cedilla.jpg

T Cedilla

0162
0163

Romanian.

FRX

MIS

MIS

1250

t_thorn.jpg

T Thorn

00de
00fe

Icelandic.

 

MIS

222
254

 

t_bar.jpg

T Bar

0166
0167

Lappish

(currently not supported).

 

MIS

MIS

 

u_diaeresis.jpg

U Diaeresis

00dc
00fc

Catalan, French, German, Portuguese, Spanish, Breton, Estonian, Guarani, Turkish.

FRX, MTX, DOT

154
129

220
252

1250  1254  1257

u_acute.jpg

U Acute

00da
00fa

Catalan, Czech, Hungarian, Italian, Portuguese, Spanish, Faroese, Frisian, Gaelic (I), Icelandic, Slovak.

FRX, MTX, DOT

MIS
163

218
250

1250  1254

u_circumfelx.jpg

U Circumflex

00db
00fb

French, Frisian, Kurdish, Luxembourgian, Turkish, Wolof.

FRX, MTX, DOT*

MIS
150

219
251

1254

u_grave.jpg

U Grave

00d9
00f9

French, Italian, Frisian, Friulian, Gaelic (S), Sardinian.

FRX, MTX, DOT*

MIS
151

217
249

1254

u_tilde.jpg

U Tilde

0168
0169

Kikuyu.

 

MIS

MIS

 

u_doubleacute.jpg

U Double Acute

0170
0171

Hungarian.

FRX, DOT

MIS

MIS

1250

u_breve.jpg

U Breve

016c
016d

Esperanto,( Latin).

 

MIS

MIS

 

u_ring.jpg

U Ring

016e
016f

Czech.

FRX

MIS

MIS

1250

u_macron.jpg

U Macron

016a
016b

Fijian, Latvian, Lithuanian.

FRX

MIS

MIS

1257

u_ogonok.jpg

U Ogonok

0172
0173

Lithuanian.

FRX

MIS

MIS

1257

w_circumflex.jpg

W Circumflex

0174
0175

Welsh. Not frequent

 

MIS

MIS

 

y_acute.jpg

Y Acute

00dd
00fd

Czech, Faroese, Icelandic, Malagasy, Slovak, Welsh.

FRX

MIS

221
253

1250

y_diaeresis.jpg

Y Diaeresis

0178
00ff

French (very rare).

At present disabled

FRX, MTX, DOT

MIS
152

159
255

1254

y_circumflex.jpg

Y Circumflex

0176
0177

Welsh.

Not frequent.

 

MIS

MIS

 

z_acute.jpg

Z Acute

0179
017a

Sorbian.

FRX

MIS

MIS

1250

z_hacek.jpg

Z Hacek

017d
017e

Czech, Polish, Croatian, Latvian, Lithuanian, Luba, Romany, Slovak, Slovenian, Sorbian.

FRX

MIS

MIS

1250   1257

z_dot.jpg

Z Dot

017b
017c

Polish, Maltese.

FRX

MIS

MIS

1250   1257