Assembling parts of the archive

HmtArchive includes functions to instantiate parts of the archive as collections of citable content. In the following examples, the hmt variable is an instance of an Archive.

Text corpora

Create a normalized corpus of all texts in the repository:

normedcorpus = normalizedcorpus(hmt)

Create a diplomatic corpus of all texts in the repository:

diplcorpus = diplomaticcorpus(hmt)

DSE indexing

Collect all DSE records:

dsecollection = HmtArchive.dse(hmt)

Indexing content of XML editions

Index scholia commenting on Iliad:

commentaryindex = HmtArchive.commentpairs(hmt)

Summary

In the following summary lines, hmt is an instance of an Archive; archivecex is the output of cex(hmt) (a complete representation of the archive in CEX format).

Each line of the table represents a complete round trip beginning from the archive, to instantiate objects in Julia, serialize the object to CEX, and then instantiate equivalent objects from a CEX serialization of the entire archive.

ComponentInstantiate from archiveSerialize to CEXInstantiate from CEX
collection of all imagesTBDimagecex(hmt)TBA
collection of all codicesTBDcodexcex(hmt) (but currently failing to include CEX for data model?)fromcex(archivecex, Codex)
catalog of all textsfromcex(textcatalogcex(hmt), TextCatalogCollection)textcatalogcex(hmt)fromcex(archivecex, TextCatalogCollection)
diplomatic editions of all textsdiplomaticcorpus(hmt)cex(diplomaticcorpus(hmt))filter full corpus created with fromcex(archivecex, CitableTextCorpus)
normalized editions of all textsnormalizedcorpus(hmt)cex(normalizedcorpus(hmt))filter full corpus created with fromcex(archivecex, CitableTextCorpus)
all DSE recordsdse(hmt)cex(dse(hmt))TBA (fromcex(archivecex, DseCollection)) is broken in current version of CitablePhysicalText)
indexes of scholia to Iliad passagesTBDscholiaindexcex(hmt)TBA (fromcex(archivecex, CitableCommentary) is broken in current version of CitableAnnotations)
other indexes (including Iliad passages to pages)TBDrelationsetscex(hmt)TBA (should be instantiated using fromcex with an implementation of the index's data model)
collections of authority lists for personal names, place names, astronomical entities, and texts no longer extantfromcex(acex, CatalogedCollection)acex = authlistscex(hmt)TBA
collection of all data models in the libraryTBD: accounted for in cex methods for DSE records, codices and imagesdatamodelcex()TBA