package uunf

  1. Overview
  2. Docs
Unicode text normalization for OCaml

Install

Dune Dependency

Authors

Maintainers

Sources

uunf-14.0.0.tbz
sha512=9aac01483abb8a8a5d68832d7f7692909d61559f2c9b0284c1da293f8115d1100df26e9e4cf7280bda6499ba866f5da2c72c4c21ca99a1d594b29d000e9bb051

Description

Uunf is an OCaml library for normalizing Unicode text. It supports all Unicode normalization forms. The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.

Uunf has no dependency. It may optionally depend on Uutf for support on OCaml UTF-X encoded strings. It is distributed under the ISC license.

Home page: http://erratique.ch/software/uunf

README

Uunf — Unicode text normalization for OCaml

v14.0.0

Uunf is an OCaml library for normalizing Unicode text. It supports all Unicode normalization forms. The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.

Uunf has no dependency. It may optionally depend on Uutf for support on OCaml UTF-X encoded strings. It is distributed under the ISC license.

Home page: http://erratique.ch/software/uunf

Installation

Uunf can be installed with opam:

opam install uunf
opam install uutf uunf # for support on OCaml UTF-X encoded strings

If you don't use opam consult the opam file for build instructions and a complete specification of the dependencies.

Documentation

The documentation and API reference can be consulted online or via odig doc uunf.

Sample programs

If you installed Uuseg with opam sample programs are located in the directory opam config var uuseg:doc.

A few test programs are in the test directory of the distribution.

  • test.native tests the library with the Unicode Normalization Test file available from:

    http://www.unicode.org/Public/%%UNICODEVERSION%%/ucd/NormalizationTest.txt

    Nothing should fail.

  • test_string.native tests the UTF-X OCaml string support.

  • unftrip.native inputs Unicode text on stdin and rewrites it on stdout in a given normalization form. Invoke with --help for more information. Depends on uutf and cmdliner.

Dependencies (4)

  1. topkg build & >= "1.0.3"
  2. ocamlbuild build
  3. ocamlfind build
  4. ocaml >= "4.03.0"

Dev Dependencies (1)

  1. uucd dev & >= "14.0.0" & < "15.0.0"

Used by (8)

  1. crowbar
  2. fuzzy_compare
  3. inquire = "0.2.1"
  4. iri
  5. oidc < "0.1.1"
  6. omd >= "2.0.0~alpha3"
  7. slug
  8. uucp >= "2.0.0"

Conflicts (1)

  1. uutf < "1.0.0"