«
earthlingsoft Homepage
UnicodeChecker
Explore and convert Unicode
Download
(Version
1.24, 4.6MB, Mac OS X 10.9 or later)
Freeware Donate if you like it.
Exploration Unicode number and UTF encodings for each codepoint • Related codepoints and decomposition • Browsing by plane, codepoint number or character block • Unihan data such as definitions, variants, pronunciations and older encodings for many Asian codepoints • Finding code points by name or their Unihan definition • Finding codepoints via Spotlight.
Conversion To and from: HTML, CSS, URLs, IDNA and various programming languages • Unicode Normalisation with an indication whether the string is already normalised (NFC, NFKC, NFD and NFKD) • Split Up and Diff tools to analyse a string codepoint by codepoint and highlight the differences • Most of the features are available to all applications you use via the ‘Services’ menu and AppleScript.
Please read the included help book for more detailed information.
UnicodeChecker uses data from The Unicode Standard, Version 15.0.0.
Screenshots
•
E-Mail
•
Donate
Older versions
Version History:
- 1.24 (2022-09-19)
- Uses Unicode 15.0.0 data files.
- The new “Unshredder” utility assists with reconstructing text that was originally saved as UTF-8 but interpreted in a different character encoding.
- In the “Split Up” utility, the (selected) table contents can be copied to the clipboard. Either as text string via the “Copy” menu command or in tabular form using the separate button.
- UnicodeChecker generally tries to keep the currently used font when switching between codepoints and a glyph for the new codepoint is available in the current font. However, once a codepoint was only found in the “LastResort” font, this font would be then be used for any newly selected codepoint, even if a glyph was available in one of the installed fonts. UnicodeChecker now tries to avoid using the “LastResort” font.
- Bugfix: The text field in the main window was sometimes (especially in Dark Mode) using inapproprate colors, making the glyphs hard to read.
- Bugfix: Updating UnicodeChecker via the “Check for Update…” menu command did not work.
- 1.23 (2021-10-08)
- Uses Unicode 14.0.0 data files.
- Universal Binary with support for ARM 64 bit (Apple Silicon M1) and Intel 64 bit architectures.
- Requires macOS X 10.9 or later.
- 1.22 (2020-04-21)
- Uses Unicode 13.0.0 data files.
- Adds compatibility with Dark Mode.
- 1.21.1 (2019-05-13)
- Uses Unicode 12.1.0 data files.
- Minor layout updates and bugfix.
- 1.21 (2019-04-24)
- Uses Unicode 12.0.0 data files.
- Requires macOS X 10.8 or later. (This was probably already the case for earlier versions of UnicodeChecker.)
- 1.20 (2018-06-27)
- Uses Unicode 11.0.0 data files.
- Network transfers are now made using HTTPS instead of HTTP.
- UnicodeChecker is now code signed for increased security and can therefore be run with strict Gatekeeper settings.
- Bugfix: The Unihan download window would not be closed.
- Bugfix: The IDNA conversion plugin was missing.
- 1.19 (2017-09-10)
- Uses Unicode 10.0.0 data files.
- 1.18 (2016-08-05)
- Uses Unicode 9.0.0 data files.
- Improved handling of the OS X System Font (introduced in Mac OS X 10.11).
- Minor improvements and bugfixes.
- 1.17 (2015-06-21)
- Uses Unicode 8.0.0 data files.
- Changed the “Age” property in the “Advanced Find…” window to “Version of Assignment” which now provides a better search for Unicode versions: The possible values are shown in a list. Filtering of this field can now be done using relational operators.
- Added human readable property names to Advanced Find popup menus for “General Category” and “Bidirectional Category”.
- Minor user interface fixes and enhancements.
- 1.16.1 (2015-02-25)
- Notifications are now displayed for all conversion methods which are initiated from the system-wide Services menu.
- Minor refinement of the display of results in the “Advanced Find…” window: Slightly dimmed the information about matching properties, so the Unicode number and name are more readable.
- New high-resolution icons for the toolbar of the Utilities window.
- Redesigned and updated the UnicodeChecker Help book.
- Bugfix: Some alert panels did not respond to button clicks.
- Bugfix: Smart substitutions (e.g. quotes) were enabled for the main character text field.
- Bugfix: The red coloring of differing text parts in the Diff utility was incorrect for some cases.
- Bugfix: The text in the “Generate Spotlight Index…” was not completely visible in some cases.
- 1.16 (2015-02-17)
- Uses Unicode 7.0.0 data files.
- New “Advanced Find…” window allows filtering of codepoints using detailed criteria for Unicode properties such as Name, Age, Block, General Category, Unihan properties, etc.
- New integrated QuickLook plug-in for previewing characters found via Spotlight.
- Information from the LineBreak data file is now shown in the “Misc” tab.
- The font size of the displayed codepoint can now be changed.
- When selecting a codepoint in the “Split Up” utility, the corresponding codepoint information is shown in the main window.
- Notifications about character replacement done by one of the conversion functions in the “Services” menu are now shown in Notification Center on Mac OS X 10.8 and later. (Support for notifications using “Growl” has been dropped.)
- Bugfix: Escape utility was not honoring “Escape all codepoints” setting for C and URL formats.
- Bugfix: UTF8-Hex-Input would incorrectly treat 0xEFBBBF as invalid.
- Minor improvements and bugfixes.
- New application icon.
- Requires OS X 10.7 or later.
- 1.15.1 (2010-10-27)
- The code point type can be queried in AppleScript.
- Support for Sudden Termination under Mac OS X 10.6.
- Bugfix: Bidi rule for IDNA 2008 was incorrect.
- Bugfix: When using the IDNA utility in multiple utility windows simultaneously, the instances would not allow to set the protocols individually.
- Bugfix: When using the Escape utility in multiple utility windows simultaneously, the instances would not allow to set the escaping methods individually.
- Bugfix: Utilities windows would not close in some situations.
- Bugfix: Utilities were leaking memory.
- 1.15 (2010-10-21)
- Uses Unicode 6.0.0 data files.
- Support for IDNA 2008 (RFCs 5890-5894).
- Universal Binary with support for Intel 64/32 bit and PPC 32 bit architectures.
- Requires Mac OS X 10.4 or later.
- Minor improvements and bugfixes.
- 1.14 (2009-10-11)
- Uses Unicode 5.2.0 data files.
- Uses version 1.7 of the “Adobe Glyph List For New Fonts”.
- New QuickLook plugin for previewing Codepoints found via the (optionally) generated Spotlight index.
- New “Length” utility for displaying number of codepoints along with number of code units and bytes for UTF-8, UTF-16 and UTF-32 encoding.
- The “Split Up” utility now supports drag and drop for rearranging and adding new codepoints.
- Added option in the “Escape” plugin to escape every codepoint. Previously, only codepoints from a predefined set were escaped.
- Minor improvements and bugfixes.
- 1.13 (2007-08-27)
- New Favorite characters library.
- Codepoint references in the Unihan table are clickable links now.
- Menu items representing the character block containing the current codepoint are highlighted.
- Fixed a bug where Utilities windows were not working on Mac OS X 10.3.
- For codepoints that have been deprecated in Unicode a “Deprecated” label is displayed next to the glyph.
- Minor cosmetic improvements.
- 1.12.1 (2007-02-26)
- UnicodeChecker can now directly download and install the Unihan data file.
- UnicodeChecker Help includes instructions for changing keyboard shortcuts for Service menu commands.
- Improved support for Aliases in file paths.
- Fixed a bug where the saved “Unihan Tag Filter” was initially not used for filtering.
- Fixed a bug where UnicodeChecker would hang if “to custom location” was selected in the Spotlight index generation dialog.
- 1.12 (2007-02-19)
- Includes a new “Diff” Utility for viewing differences between two strings.
- The “Find Character” panel now allows to explicitly exclude strings from matching by prefixing them with “-”.
- Information from the Unicode data file HangulSyllableType is included in the “Misc” tab.
- The “Unihan Tag Filter” is now saved between UnicodeChecker starts.
- Cosmetic changes to the Utilities window:
- Toolbar items stay highlighted.
- Cycling through controls using the Tab key has been improved.
- Checking for UnicodeChecker updates is now done using Sparkle by Andy Matuschak. You can choose to automatically check for updates at each startup of UnicodeChecker in the Preferences dialog.
- Under some circumstances Unicode data files could not be read correctly. This resulted in a warning that a file could not be found although the file existed. This has been fixed.
- Fixed a bug where opening the AppleScript dictionary from Script Editor would crash UnicodeChecker on systems prior to Mac OS X 10.4.
- Fixed a bug where UnicodeChecker would crash on systems prior to Mac OS X 10.4 when the starting codepoint was set to U+0000.
- 1.11 (2006-09-06)
-
- Uses Unicode 5.0 data files
- Information from the Unicode data file NamesList is included in the “Misc” tab.
- Font metrics of the current font can be displayed in the current character text field. (Using “Show Font Metrics” in the Window menu.)
- The Unihan table is now sortable.
- 1.10 (2006-05-22)
-
- UnicodeChecker is now available as Universal Binary.
- Properties (including Unihan) of individual characters can be queried from AppleScript.
- The find panel lets you select whether you want to include Unihan definitions while searching. (Only available when the Unihan data file is installed.)
- The size of the find panel is saved between UnicodeChecker starts.
- The Unicode versions window was missing the files DerivedCoreProperies.txt and PropertyValueAliases.txt. This has been fixed.
- When finding multiple files for the same Unicode data file prefix, UnicodeChecker now returns the last file as this is more probably the lastest version.
- 1.9.1 (2005-11-20)
-
- Added (un-)escaping to/from octal escape sequences in the C Programming Language (using UTF-8 encoding).
- UnicodeChecker Help is now an Apple Help help book instead of a single text file.
- CSS 2/2.1 unescaping would not unescape characters from higher planes (outside BMP). This has been fixed.
- The current character text field will not use ligatures when entering multiple characters that could be represented using a ligature in the current font in order to avoid confusion.
- 1.9 (2005-11-03)
-
- Extended string (un-)escaping to CSS 1, CSS 2, CSS 2.1, C99, Java and URL formats. Available as “String Utility” and from AppleScript.
- Added a Dock menu
- The “Misc” tab now displays information from the DerivedCoreProperties file.
- UTF-8 input method error messages mention allowed value ranges for individual bytes more specifically.
- Property value descriptions are now read from PropertyValueAliases.txt. (Files in “Unicode Descriptions” folders will override the information from PropertyValueAliases.txt)
- The Unicode data files PropList, Scripts and DerivedAge were not read correctly in UnicodeChecker 1.8. This bug has been fixed.
- The keyboard shortcut Command–2 now cycles through the open Utilities windows (or creates a new one if none exists yet). In order to create another Utilities window, use Command–N.
- UnicodeChecker’s core functionality is available to developers through a framework inside the application bundle.
- 1.8 (2005-07-09)
-
- Spotlight support: UnicodeChecker can generate an index for finding characters by name using Spotlight search.
- The UTF-8 input method could have displayed an incorrect error message in the ToolTip. This has been fixed. (Only the error message itself was incorrect, parsing of the string should have been correct.)
- UnicodeChecker starts up much faster.
- The “Find Character by Name” panel performs an AND search of the space delimited substrings. In order to include a space character in a search string, the (sub-)string has to be enclosed in double quotes.
- The Unihan kDefinition field is now included in “Find Character by Name”.
- The currently selected glyph is displayed in the Find panel.
- The “Split” utility supports editing of the individual strings in the glyph table.
- The font size of glyphs displayed in the main window’s glyph table can be changed (from the “Window” menu).
- Displays the version of the currently used Unicode data files. (See section “Unicode Version Information” in the UnicodeChecker Help for more information.)
- Unicode data files can be overridden by placing appropriate files in any “Library/Application Support/UnicodeChecker/Unicode Data” folder.
- Fixed a bug where setting UnicodeChecker preferences to display the most recently displayed codepoint after starting up would actually display a totally different codepoint.
- 1.7 (2005-05-08)
- Uses Unicode 4.1 data files
- Support for the Unihan data file. (The Unihan.txt file is not included with UnicodeChecker as it is several megabytes in size. See the UnicodeChecker Help for further instructions.)
- There was a bug in the AppleScript command “XHTML representation of” which sometimes prevented the option for decimal/hexadecimal entities to work as expected. This has been fixed.
- Added the optional parameter “named entities” to the AppleScript command “XHTML representation of” for selecting whether named entities should be used (the default) when available for a codepoint or not.
- The last codepoint inside a character block would not be displayed as belonging to its character block. This has been fixed.
- In IDNA conversion: Strings with leading separators (i.e. dots) would have the separators in the wrong places after conversion. This has been fixed.
- In IDNA-Utility: Display or non-display of warning message “Contains codepoint unassigned in nameprep” below the conversion output fields could have been wrong. This has been fixed.
- HTML to Unicode conversion can now optionally preserve ASCII entities (previously this was only possible for Unicode to HTML conversion) – this is available for the HTML conversion utility as well as from a new service menu entry and via AppleScript.
- The tab that was selected in the main window when quitting is now remembered.
- Codepoints can be entered numerically using UTF-8. (When entering invalid UTF-8 sequences, the ToolTip of the text field will show detailed diagnostics.)
- Codepoints from the main window table can be dragged and dropped.
- 1.6.2 (2005-01-16)
- Changed normalization implementation to adhere to https://www.unicode.org/review/pr-29.html
- Added AppleScript command 'find codepoint' for remotely opening the find panel
- Provided a search template for use with LaunchBar
- UTF-8 display can be toggled between hexadecimal or octal by clicking on the UTF-8 label in the “Codes” tab
- Pasted characters retain their original font
- New Utility windows are opened with the most recently used utility
- Notifications stating the number of replaced codepoints after using any of the HTML/Unicode conversion services are now displayed using Growl if installed. For more information see http://growl.info
- In the find panel up and down keys can now be used to navigate through the list without the need of setting the focus to the table first
- 1.6.1 (2004-11-01)
- Fixed a bug where converting XHTML to Unicode would remove whitespace and newline characters after the XHTML entity
- Preloading the font menu on UnicodeChecker startup can now be toggled in the preferences
- Uses Unicode 4.0.1 data files
- 1.6 (2004-03-04)
- Significant performance improvements
- Displays the currently used font
- Lets you select the font to use for display on a global basis in the prefs
- Shows all fonts encoding a given glyph and allows you to select one of these fonts for display
- Preference settings for the XHTML service (whether to use decimal, hex or named entities)
- History of visited codepoints
- Support for Adobe Glyph Lists
- Casings now display the mapped character(s) in addition to the mapped code(s)
- Minor cosmetic fixes to window behaviour, menu item names, menu key equivalents, &c.
- Needs Mac OS X 10.3 or greater (sorry!)
- 1.5.7 (2003-04-27)
- Uses Unicode 4.0 data files
- 1.5.6 (2003-04-09)
- Updated to comply with RFC 3490 for IDNA
- AppleScript support for converting to and from IDNA
- AppleScript support, Services menu entry and new utility for adding and replacing percent escape sequences for URLs
- 1.5.5 (2003-01-03)
- AppleScript support for converting to and from HTML entities and displaying character information
- 1.5.4 (2002-11-12)
- Runs in Mac OS X 10.1 again
- 1.5.3 (2002-11-07)
- Fixed a bug where Hangul decompositions would not be displayed correctly
- 1.5.2 (2002-10-26)
- The HTML entity conversion services now report the number of replaced characters/entities
- New Utility for converting to and from HTML entities (formerly only available from the Services menu)
- Unicode Normalization Forms are now also available from the Services menu
- The “Display Character Information” service now recognises U+nnnn, {, Ī and &abc; notations.
- Improved speed for string conversion
- The Unicode 1 names for codepoints are included for the “Find Codepoint” menu command
- 1.5.1 (2002-04-05)
- In the Planes PopUp the “Supplementary Special-purpose Plane” has been corrected to be plane 14 instead of plane 16
- 1.5 (2002-04-03)
- Services
- Lots of additional codepoint information
- Unicode Utilities
- 1.1 (2002-01-23)
- Unicode Character Blocks can be found by name or code
- Individual characters can be found by Unicode name
- The corresponding Unicode name is displayed for the selected character
- 1.0 (2001-11-12)
Complete list