Each diplomatic transcription is encoded in TEI XML format. Each TEI file consists of a <teiHeader> tag and a <text> tag. In the <teiHeader>, the most important tag to fill out is the <idno type=“siglum”> tag, which specifies the siglum of the witness which will be used in any generated apparatus. See the template for more details.
The <text> tag is where the transcription will go. Each <text> should consist of a <body> with a number of paragraphs (<p>) and/or verses (<lg> and <l>). Each paragraph or verse should have a unique xml:id. These xml:id identifiers will be used to collate like sections of text together.
For transcribing, a subset of TEI is used to mark features such as additions, deletions, and marginal annotations.
Indicates text that has been substituted.
rend attribute indicates how the addition or deletion is marked.place attribute indicates where the addition is noted.sarva<subst><del rend="crossed out">vidyā</del><add place="right-margin">śabdā</add></subst>nām
<html>sarvavidyā<ins class=“add”>śabdā</ins>nām</html>
Indicates text that the transcriber has corrected.
va<choice><orig>ṃta</orig><corr>rta</corr></choice>tamānaḥ
<html>va<del class=“orig”>ṃta</del><ins class=“corr”>rta</ins>mānaḥ</html>
Indicates a caesura in a line of verse.
suvarṇarucakādi ya<caesura/>thā yuktaṃ svakair ākāraiḥ
<html>suvarṇarucakādi ya-<br/>thā yuktaṃ svakair ākāraiḥ</html>
Indicates a section of the text that is unreadable by the transcriber.
reason attribute gives a reason for the text being illegible.unit attribute is the unit of measure.quantity attirbute is the length of the gap in units.sā<gap reason="damaged" unit="akṣara" quantity="5"/>rvārthyam
<html>sā<span class=“gap”>…….</span>rvārthyam</html>
Indicates text that has been marked in some way for emphasis.
rend attribute indicates how the text has been marked.<hi rend="rubricated">ātma vastu sva</hi>bhāvaś ca
<html><span class=“hi”>ātma vastu sva</span>bhāvaś ca</html>
Indicates a new line beginning.
n attribute indicates the line number.<lb> tag.vya-<lb n="5"/>vahāre, ātmā <lb n="6"/>tattvaṃ
<html>vya-<span class=“lb”>⸤</span>vahāre, ātmā <span class=“lb”>⸤</span>tattvaṃ</html>
Indicates a character that is not part of the content.
function attribute indicates the function of the character.tat siddhiḥ<metamark function="place marker">✗</metamark>
<html>tat siddhiḥ<span class=“metamark”>✗</span></html>
Indicates a location in a document.
n attribute indicates a page or folio number.unit attribute is the unit of measure.<milestone n="34r4" unit="folio"/>jātir vā dravyaṃ vā padārthāv iti
<html><span class=“milestone”>(From folio 34r4)</span> jātir vā dravyaṃ vā padārthāv iti</html>
Indicates a note.
place attribute indicates where the note is placed, such as in the top-margin, footer, appendix, or inline.xml:lang attribute indicates the language of the note. The main use of this is to write notes in English; if xml:lang=“en” is set, then the note text will not be transcribed into other scripts.‘ktaktavatū niṣṭhā’<note place="inline"/>(pa॰ 1|1|16)</note>
<html>‘ktaktavatū niṣṭhā’<span class=“note”>(pa॰ 1|1|16)</span></html>
Indicates a page beginning.
n attribute indicates the folio or page number of the new page.anekaviṣayanihi<pb n="3r"/>tapadānām
<html>anekaviṣayanihi<span class=“pb”>L</span>tapadānām</html>
Used to split compounds words.
aneka<pc> </pc>viṣaya<pc> </pc>nihita<pc> </pc>padānām
<html>ankeaviṣayanihitapadānām</html>
The spaces are not displayed, but variant readings will be split using those spaces as guides. This manual procedure is only needed occasionally, to clarify the critical apparatus in the case of very long or complex compounds.
Indicates text that has been retraced.
gṛhītaṃ <retrace>gṛha</retrace>śabdena śuddham evābhidhīyate
<html>gṛhītaṃ <span class=“retrace”>gṛha</span>śabdena śuddham evābhidhīyate</html>
Indicates text that has been transcribed as found in the document, without correction.
viśeṣo<sic>pa</sic>dhiḥ
<html>viśeṣo<span class=“sic”>pa</span>dhiḥ</html>
Indicates a blank space in the text.
unit attribute is the unit of measure.quantity attirbute is the length of the space in units.ity arthaḥ<space unit="akṣara" quantity="2"/> atha ca
<html>ity arthaḥ<span class=“space”>__</span> atha ca</html>
Indicates text that has been supplied by the transcriber.
ta<supplied>d</supplied> dravyam
<html>ta<span class=“supplied”>d</span> dravyam</html>
Indicates text that the transcriber believes is superfluous.
reason attribute indicates the transcriber's reason for marking the text as superfluous.iti teṣān darśanaṃ<lb n="5"/><surplus reason="repeated after line break">naṃ</surplus>
<html>iti teṣān darśanaṃ<span class=“lb”>⸤</span><span class=“surplus”>naṃ</span></html>
Indicates a passage that is unclear to the transcriber.
reason attribute indicates why the text is unclear.svarūpānyathā<unclear>t tānāpapaptiḥ</unclear>
<html>svarūpānyathā<span class=“unclear”>t tānāpapaptiḥ</span></html>