|
The AAT, ULAN, and TGN are currently available for licensing in three formats: XML, relational tables, and MARC. Sample data may be downloaded from this page free of charge.
Licensing: The Getty vocabulary data is copyrighted. Your institution must sign a license and pay the required fees in order to obtain the full set of data. Contact us at vocab@getty.edu, subject line: Licensing, to learn more about the fees and terms of licenses. Please include an explanation of how you intend to use the data. Fresh data is available for licensing in June of each year. Thousands of records are updated or added each year.
Formats of future releases: For releases beginning in June 2008 or June 2009, we are considering the discontinuation of the MARC and Relational Tables formats. We will also institute Unicode in our data, replacing our $xx diacritic codes. We would then release the data in XML UTF-8 format only. However, we would provide instructions regarding how to change the XML into relational tables or MARC records.
Data dictionaries: Data dictionaries are available by clicking the links below. The documentation does not give step-by-step instructions on how to construct a database or interface based on the data files; analysis and a competent programmer will be required to implement the vocabulary data files. The Getty does not provide technical support.
Persistent IDs: The Getty Vocabulary Program is pleased to announce a significant improvement in our data releases. We have implemented new functionalities in our editorial system that will result in more persistent IDs for our vocabulary records. Previously, although each record had a unique numeric ID, the ID would change when new records were "merged" with existing records, and in other rare situations. While licensees of Getty vocabulary data received annual mappings of old IDs to new ones, our user community was anxious to have a more persistent ID for the Getty vocabulary records over time. The new merge process, implemented in January 2008, will result in the ID of the original vocabulary record being maintained when a new record is merged into it. Other editorial situations may occasionally require the generation of new IDs (e.g., when one existing record is divided into two records); for these rare cases, a mapping of the old IDs to the new ones will continue to be published with the annual releases. Details of this improved maintenance of a persistent ID will be published with the licensee documentation for the July 2008 data releases for all three vocabularies. Please consult this Web page after July 1, 2008 for a full explanation of the change.
Download sample records from the Art & Architecture Thesaurus (AAT), a hierarchical vocabulary of 123,000 terms for 31,000 concepts describing art, architecture, and related fields.
Download sample records from the Union List of Artist Names (ULAN), a database of around 262,000 names and biographical information for nearly 118,000 artists and architects.
Download sample records from the Getty Thesaurus of Geographic Names (TGN), a hierarchical vocabulary of around 1.1 million names, and coordinates and other information for around 892,000 geographic places.
TGN Sample Data
Format options:
|
|
Data Dictionary for the
TGN Data Release |
|
|
|
XML |
|
XML format PDF |
Relational
Table |
|
Relational Table format PDF |
|
|
|
|
|
|
The Art & Architecture Thesaurus® (AAT), the Union List of Artists Names ® (ULAN), and the Getty Thesaurus of Geographic Names® (TGN) are copyrighted by the J. Paul Getty Trust. Companies and institutions interested in regular or extensive use of the vocabularies should explore licensing options by sending an email to vocab@getty.edu.
No warranties by Getty: The databases are provided "as is." Getty disclaims all other warranties, either express or implied, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose, with respect to the databases.
|
 |

|