A utility library for analyzing editions of texts following the standards of the Homer Multitext project.
It includes classes for:
choice
element. Current development is focused on the HmtEditorialTokenization
class; in planning is a parallel HmtDiplomaticTokenization
class. The output of this tokenization is a pair of URNS: a CTS URN with subreference identifying the token, and a CITE URN classifying it.HmtValidator
class can validate the contents of lexical items in the XML source, and the referential integrity of attribute values for named entities.See the series of milestones in the project issue tracker. The API docs and live specs on line here may lag the version in the repository as work on hmt-utils is very active in the summer of 2015.
The repository includes a suite of unit and acceptance tests. Some unit tests of the LexicalValidation
class require an installation of morpheus
, the Greek morphological parser from the Perseus project.
The Homer Multitext project virtual machine for editors automatically builds morpheus
. If you clone this repository adjacent to the morpheus
directory in the VM's /vagrant
directory, the unit tests will find and run morpheus
by default in unit tests. The 2015 HMT VM for editors is available from https://github.com/homermultitext/vm2015.