LanguageTool Community

<< return to list of rules

Rule "Non-standard character in a word"

This is one of many errors that LanguageTool can detect. Visit the LanguageTool homepage to use it online or download it for free.

Description: Non-standard character in a word
Message: The character '\1' in '\1' is not an English character, although it can look like an English character. The character can cause a spelling error and disambiguation errors. Type the word again.
Category: Typography (ID: TYPOGRAPHY)
Link: https://www.unicode.org/charts/
Incorrect sentences
that this rule can detect:
  • Please оpen the door. (Contains Cyrillic small letter 'o', Unicode U+043E.)
  • Can you ѕee it? (Contains Cyrillic small letter 'dze', Unicode U+0455.)
  • Did you еat the last biscuit? (Contains Cyrillic small letter 'ie', Unicode, U+0435.)
  • This is a рea. (Contains Cyrillic small letter 'er', Unicode U+0440.)
  • , thank you. (Contains Cyrillic small letter 'o'.)
  • Thiѕ is not correct. (Contains Cyrillic small letter 'dze')
  • Tell more! (Contains Cyrillic small letter 'ie'.)
  • Do you like the souр? (Contains Cyrillic small letter 'er')
  • The sun is hоt. (Contains Cyrillic small letter 'o'.)
  • Do nοt open the window. (Contains Greek small letter omicron, Unicode U+03BF.)
  • There is nο problem. (Unicode U+03BF.)
  • The cat is οn the mat. (Unicode U+03BF.)
  • Nοrmal operation. (Unicode U+03BF.)
Correct sentences
for comparison:
  • Please open the door.
  • Cyrillic small letter о.
  • ...of China use astronaut while texts in Russian use космонавт (kosmonavt).
  • Russian: Чужой simple:Alien
  • The word 'Hоuѕе' contains 3 non-standard characters.
  • ... ru: ....
  • Contains Cyrillic small letter 'dze'.
  • ... baby's first word "googoogoo".
  • This is ο-u-t of scope. (Unicode U+03BF.)
  • Cognate words are the Greek (ankylοs), meaning... (Unicode U+03BF.)
  • ... the Greek (ankylοs, which means... (Unicode U+03BF.)
Pattern: Show XML · Show in Rule Editor
Check the following text against just this rule:

ID: NON_ENGLISH_CHARACTER_IN_A_WORD [1]
Version: 5.9-SNAPSHOT (2022-07-06 20:33:02 +0000)