Other VTLS Open Source Components

The Metadata Extraction Service via JHOVE

This service facilitates the automated capture of technical metadata for preservation purposes, automatically capturing still image metadata according to the MIX standard (XML expression of Z39.87 NISO Standard for Technical Metadata from Digital Still Images (draft)). The functionality is enabled as a web service to automatically capture technical metadata from popular image file formats such as TIFF, JPEG, GIF and others; and will allow extraction and population of datastream metadata into FEDORA™ directly from the source image to further improve search results and to preserve information about the creation of the file.

This service also provides for automatic validation of JHOVE supported digital object content using JHOVE (JSTOR/Harvard Object Validation Environment). Administrators can trust that the content being added to the FEDORA™ repository can be validated as a known format type, can be trusted that it is not masquerading as another format type, and that metadata configured for that format type can be extracted to be used for preservation purposes, for future version migrations and for advanced resource discovery.

The Handles System

The Handles system is a comprehensive system for assigning, managing, and resolving persistent identifiers, known as "Handles," for digital objects and other resources on the Internet. Handles can be used as Uniform Resource Names (URNs). This service creates and assigns Handles to digital objects within the FEDORA™ repository.

The Web Crawler Indexing and Exposure Service

This service facilitates the exposure of repository content to web crawlers, such as Google, via a MARCXML to XHTML conversion of metadata. The service can differentiate users from robotic crawlers so that users can be redirected to the object within FEDORA™.

The Content Model Configuration Service

This service allows for customized content models to be created defining objects of similar content. Content models provide definition and, subsequently, control over how repository objects in FEDORA™ are submitted, managed, indexed and displayed to staff and end users. Content models can be created to suit the institution's specialized needs. Such content might include theses, dissertations, research publications, multimedia content or any other highly structured content type the institution seeks to manage.

The SRW/SRU Interface for exposure of repository content

The SRW/SRU services (Search/Retrieve Web and Search/Retrieve URL, respectively) define a method for interacting with and retrieving information from remote databases. This technology is ideally suited for abstracting a 'gateway' to search otherwise complicated protocols or proprietary access mechanisms. Since the search results and other information are transferred using XML, SRW/SRU has become a tool for creating bridges to more technically complicated protocols, such as Z39.50. This is a service using the SRU/SRW protocol to expose the content of the FEDORA™ repository to portals, federated search tools, and other search engines, which support this emerging protocol.

More Info...