OCLC Work ID Retrieval Script
January 2, 2024
Libraries rely on accurate metadata to manage collections, align bookstore adoption lists with their catalogs, and reconcile electronic and physical holdings. However, matching titles across various editions and formats remains a persistent challenge for technical services staff. The OCLC Work ID serves as a crucial data point for disambiguating records, grouping related editions, and enhancing bibliographic consistency.
To address this challenge, I developed a script that automates the retrieval of Work IDs using OCLC’s WorldCat Search API v.2. This tool enables library staff to efficiently identify and compare records based on ISBNs, improving metadata management and acquisitions workflows.
Project Overview
This project consists of scripts designed to process user-selected input files (.xlsx and .mrc), query OCLC’s WorldCat Search API, and extract the corresponding Work ID for each title. By leveraging OCLC’s metadata services, the script provides a structured approach to normalizing bibliographic data across diverse sources.
Why Work IDs Matter
The OCLC Work ID functions as a unique identifier that links different manifestations of a work, regardless of format or edition. This allows libraries to:
- Reduce duplication when evaluating electronic versus print holdings.
- Identify related records that share common intellectual content.
- Improve reporting accuracy when aligning course-adopted textbooks with library resources.
Key Features
Batch Processing of Bibliographic Data
The script supports both Excel (.xlsx) and MARC (.mrc) file formats, enabling batch queries for multiple titles at once.
Automated Queries via WorldCat Search API v.2
The script uses the API to match ISBNs against WorldCat records and retrieve the associated Work ID, streamlining metadata reconciliation.
Enhanced Accuracy in Title Matching
By referencing Work IDs, the script minimizes errors caused by slight title variations, helping libraries maintain consistent cataloging practices.
Configurable Authentication and API Access
The script integrates OAuth 2.0 authentication, ensuring secure access to OCLC’s metadata services.
Getting Started
Dependencies
To use this script, your institution must have:
- An OCLC Cataloging and Metadata subscription (full cataloging).
- A FirstSearch/WorldCat Discovery subscription.
- An API client ID and secret obtained from OCLC.
Configuration
The script requires a config.yml file containing:
- key: Your API key.
- secret: Your API secret.
- auth_url: The authentication endpoint.
- token_url: The token request URL.
- metadata_service_url: The OCLC metadata service endpoint.
A sample configuration file is available in OCLC’s Developer Network GitHub repository.
License & Acknowledgments
This project is licensed under the Apache-2.0 License. The OAuth 2.0 authentication code was originally developed by Karen Coombs and has not been substantially altered from its original form. All other development, workflow design, and implementation are my own work.
Final Thoughts
By automating Work ID retrieval, this script simplifies complex bibliographic comparisons and enhances collection management efficiency. As libraries continue refining their metadata workflows, tools like this offer valuable support for maintaining accurate and accessible catalogs. I hope this project provides a practical solution for fellow technical services professionals navigating the intricacies of bibliographic data reconciliation.