This 2008 technical document from the Apache Subversion project addresses the challenge of handling Unicode filename normalization across different operating systems, particularly focusing on the differences between Normal Form Composed (NFC) and Normal Form Decomposed (NFD). It proposes solutions for consistent filename handling in version control systems, including both short-term workarounds and long-term normalization strategies to ensure cross-platform compatibility.
Background
Unicode characters with diacritical marks can be represented in multiple forms (NFC and NFD), causing compatibility issues in file systems and version control systems. Different operating systems handle these forms differently, with macOS typically using NFD while others may use NFC or other forms.
- Source
- Lobsters
- Published
- Jun 12, 2026 at 09:09 PM
- Score
- 6.0 / 10