NYU Libraries Data Collections : NYU Libraries data collections which are restricted to members of the NYU community.
Published 2022 | Version v1.0
Dataset

ProQuest Vogue Text-as-Data Collection

Creators

Contributors

Distributor:

Description

The collection consists of extracted machine-readable text from the print magazine, dating from 1892-2020. In total there are 450,921 xml files, with a size of 1.67 GB. There is one .xml for each item, advertisement, article, or subsection of a magazine issue, including metadata about that item and the full text as extracted from the digitized print using optical character recognition (OCR). The collection also includes 651,896 .jpeg files totaling 579 GB. There is one .jpeg for each page of the original print. This collection is static and is not updated with more current issues, and is available to NYU faculty and students only. Instructions for how to access this collection are available at https://guides.nyu.edu/tdm/proquest-vogue-magazine

Additional details

Dates

Issued
1892-01-01/2016-12-31
This collection holds Vogue issues spanning from December, 1892 to December 2020