AAT Editorial Guidelines: 4.1 Diacritics (Getty Research Institute)

	Research Home Search Tools & Databases Learn about the Getty Vocabularies Editorial Guidelines Art & Architecture Thesaurus Online

4. Appendices

4	APPENDICES

4.1		Appendix A: Diacritics

4.1.1			Diacritics in AAT, TGN, ULAN, and CONA

4.1.1.1			Unicode For the the Getty vocabularies, terms and scope notes may be contributed in any language or character set, provided the data is expressed in Unicode (Unicode Consortium, Unicode 7.0 (2014)). The data is published as Unicode.

4.1.1.2			Legacy data As of this writing, legacy vocabulary data may be expressed as ASCII-extended characters. The following chart lists the codes used to indicate diacritics in the legacy vocabulary data. Each code consists of the dollar sign ($) followed by two numbers. This code is placed before (in front of) the letter to which the diacritical mark applies. The same code can be applied to multiple letters. For example, if an acute accent should be applied to an a (á), it is recorded as *$00a; if an acute accent should be applied to an e (é), it is recorded as $00e*. In some cases, the code means that two diacritics are placed over the same character (e.g., $30). In other isolated cases, the code applies to two adjacent characters (e.g., $57, a digraph).

4.1.2			Diacritical Codes: Quick Reference


4.1.3			Diacritical Codes: Full List
			Please consult the full list of diacritics and Unicode mapping as necessary.


Last updated 23 June 2015 Document is subject to frequent revisions

Printer-friendly version

		The J. Paul Getty Trust
		© J. Paul Getty Trust \| Privacy Policy \| Terms of Use