Alignment Data
What is alignment data?
Alignment data describes the relationship between two bodies of text in terms of words, groups of words, or parts of words. ClearAligner defines a json
format for alignment data. Alignment json
files can be imported and exported.
The json
alignment format allows for many tokens on either the source
or target
side. Tokens do not need to be contiguous and can cross BCV boundaries.
Example
Details
The
id
field on alignment records is a GUID used for internal change tracking.Values for
source
arrays match the IDs in the canonical source text TSVs used by ClearAligner.The
status
field describes the status of an alignment record. Supported statuses arecreated
,approved
,rejected
, andneedsReview
.While many records can be stored in the
note
array, ClearAligner currently on supports a single note per alignment record.
Last updated