package uuseg

  1. Overview
  2. Docs
Unicode text segmentation for OCaml

Install

Dune Dependency

Authors

Maintainers

Sources

uuseg-15.1.0.tbz
sha512=1e9460dc5a856c985d40c61fd1560bdfdb8bbaf8d7430405814589b47d4a7f7869658d1e3198c7a9132412e9b4b85402ceb4bda5040da426b69e9aef4222a23a

Description

Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.

The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.

Uuseg is distributed under the ISC license. It depends on Uucp.

Homepage: http://erratique.ch/software/uuseg

README

Uuseg — Unicode text segmentation for OCaml

v15.1.0

Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.

The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.

Uuseg is distributed under the ISC license. It depends on Uucp.

Homepage: http://erratique.ch/software/uuseg

Installation

Uuseg can be installed with opam:

opam install uuseg
opam install uuseg cmdliner uutf  # For the usegtrip tool. 

If you don't use opam consult the opam file for build instructions.

Documentation

The documentation can be consulted online or via odig doc uuseg.

Questions are welcome but better asked on the OCaml forum than on the issue tracker.

Sample programs

The usegtrip tool segments text provided on standard input.

See also the doc examples.

Dependencies (5)

  1. uucp >= "15.1.0" & < "16.0.0"
  2. topkg build & >= "1.0.3"
  3. ocamlbuild build
  4. ocamlfind build
  5. ocaml >= "4.14.0"

Dev Dependencies

None

Used by (9)

  1. fuzzy_compare
  2. inquire = "0.2.1"
  3. matita
  4. notty < "0.2.3"
  5. ocamlformat >= "0.10" & < "0.25.1"
  6. ocamlformat-lib
  7. ocamlformat-rpc < "0.21.0"
  8. slug
  9. zed >= "3.2.0"

Conflicts (2)

  1. cmdliner < "1.1.0"
  2. uutf < "1.0.0"