The following information and examples may guide those who wish to license the AAT, ULAN, or TGN. (CONA is in development and not yet available for licensing.) The Getty vocabularies change and grow over time; thousands of records are edited or added each year. Implementors are encouraged to refresh the data regularly.

Licensing: The Getty vocabulary data is copyrighted by the J. Paul Getty Trust. All rights reserved. Your institution must sign a license and pay the required fees in order to use the data. The data is compiled from various contributors using published sources, which must be cited along with the J. Paul Getty Trust when the data is displayed.

The data files are released annually in July (barring unforeseen technical difficulties). If you wish to license multiple vocabularies, you must obtain a separate license for each one. Licensing agreements are for a fixed five-year term; annual data updates are now being offered free of charge during that five-year term. The license fee must be paid and the license agreement signed prior to the transfer of data. At the end of five years, you may renew your license for an additional five-year term upon payment of a fee. Contact us at vocab@getty.edu, subject line: Licensing, to learn the amount of the fees and terms of licenses. Please include an explanation of how you intend to use the data and whether your institution is for-profit or not-for-profit.

Formats of the releases: The next full data release will take place in July 2014. The data is released in data files only; current releases include relational tables and XML. No software is provided. Only full sets of data are available for licensing; we do not provide subsets of the data.

Web services: Data files may also be accessed through Web services APIs (application programming interfaces), which provide access to data that is refreshed every two weeks. Note that these files contain "in process" data, meaning it could change prior to the annual July release. The APIs are available to any institution that holds a valid license. See details in the Web Services User's Instructions (PDF, 153pp., 2MB).

Linked Open Data: The project to publish all four of the Getty vocabularies as Linked Open Data (LOD) is underway. It is anticipated that the data will be published as SKOS-extended format under the ODC-BY 1.0 license. For target release dates and news on the LOD project, click Vocabularies as LOD in the left navigation. Due to feedback from the user community, we currently plan to continue providing the data as relational tables and XML releases; licenses will remain in place for these release formats. If any format were to be discontinued in the future, users would be given notice far in advance.

Data dictionaries: Note that data dictionaries for XML and relational tables formats do not give step-by-step instructions on how to construct a database or interface based on the data files; analysis and a competent programmer will be required to implement the vocabulary data files. The Getty does not provide technical support. For details regarding data content and editorial rules, see the Editorial Guidelines.

Persistent IDs: The Getty vocabularies have unique persistent IDs (called "subject_ids"). Each ID is unique among all Getty vocabularies. However, to maintain persistent IDs in constantly changing databases presents challenges. When a contributed record is merged with an existing record, the ID of the original vocabulary record is maintained. Other editorial situations may occasionally require the generation of new IDs, but new IDs are generally limited to non-preferred terms (e.g., when one existing record is divided into two records, the non-preferred terms may be placed in a new record with a new ID, while the preferred terms remain with the original ID).

Data currently available: The data currently available for licensing was cut in June 2013. The next full release of XML and relational tables formats will be in July 2014.


Download sample records from the Art & Architecture Thesaurus (AAT). The full AAT is a hierarchical vocabulary of around 36,100 records, including 250,000 terms, descriptions, bibliographic citations, and other information relating to fine art, architecture, decorative arts, archival materials, archaeology, and other material culture. The full XML data file for the AAT is around 25,000 kilobytes in size. The sample available here is a small subset of the AAT. The next data release will take place in July 2013. For further information, see About the AAT.

AAT Sample Data
Format options:

 

Data Dictionary for the
AAT Data Release

XML UTF-8 (2.4MB)

 

XML format (PDF, 22pp., 419MB)

Relational Table UTF-8 (40MB)

 

Relational Table format (PDF, 11pp., 264MB)



Download sample records from the Union List of Artist Names (ULAN). The full ULAN is a vocabulary of around 222,900 records, including 600,000 names and biographical and bibliographic information for artists, architects, firms, shops, and art repositories, including a wealth of variant names, pseudonyms, and language variants. The full XML data file for the ULAN is around 100,000 kilobytes in size. The sample available here is a small subset of the ULAN. The next data release will take place in July 2013. For further information, see About the ULAN.

ULAN Sample Data
Format options:

 

Data Dictionary for the
ULAN Data Release

XML UTF-8 (14MB)

 

XML format (PDF, 29pp., 545KB)

Relational Table UTF-8 (92MB)

 

Relational Table format (PDF, 14pp., 312KB)



Download sample records from the Getty Thesaurus of Geographic Names (TGN). The full TGN is a hierarchical vocabulary of around 1,192,000 records, including 1,900,000 names, place types, coordinates, and descriptive notes, focusing on places important for the study of art and architecture. TGN is not a GIS; coordinates are included for many records, but they are for finding purposes only (i.e., to find the place on a map). The full XML data file for the TGN is around 150,000 kilobytes in size. The sample available here is a small subset of the TGN. The next data release will take place in July 2013. For further information, see About the TGN.

TGN Sample Data
Format options:

 

Data Dictionary for the
TGN Data Release

XML UTF-8 (1.1MB)

 

XML format (PDF, 137pp., 28KB)

Relational Table UTF-8 (7.9MB)

 

Relational Table format (PDF, 343pp., 14KB)



Data is copyrighted: The Art & Architecture Thesaurus® (AAT), the Union List of Artists Names ® (ULAN), the Getty Thesaurus of Geographic Names® (TGN), and the Cultural Objects Name Authority® (CONA) are copyrighted by the J. Paul Getty Trust. Companies and institutions interested in regular or extensive use of the vocabularies should explore licensing options by sending an email to vocab@getty.edu.

No warranties by Getty: The databases are provided "as is." Getty disclaims all other warranties, either express or implied, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose, with respect to the databases.



Revised 22 October 2013