Skip to primary content
Skip to secondary content

MARBLE

Museums, Archives, Rare Books & Libraries Exploration Platform

MARBLE

Main menu

  • Home
  • About
  • Resources
  • People
  • Updates

Category Archives: Metadata

Student Profiles: Liam Maher assists with exhibition history records for the Snite

Posted on November 26, 2019 by Abigail Shelton

By Liam Maher

This is the first post in a series written by students who have worked on various parts of the MARBLE project. 

Liam Maher, an alumnus of Notre Dame and current graduate student at the University of Oregon, assisted in the clean-up of exhibition history records for the Snite Museum of Art in preparation for publishing the collections online in the MARBLE platform. Liam’s process included verifying the paper exhibition files against digital records in the museum’s internal collection database while also consulting Hesburgh Libraries’ resources and publications. 

Metadata is to museums as the skeleton is to the human form

When I tell people I do metadata cleanup for an art museum, I am usually met with quizzical looks. Few people think of computer systems, let alone organizing them, as integral to the functioning of an art museum. 

A museum’s metadata, or information about its objects, is akin to the human skeleton—it gives form to the “body” of the museum and the many things contained therein. Like a misplaced or broken bone, messy data can result in lots of complications. And, inaccurate metadata, such as an incorrect artist attribution, improper dimensions, or wrong media, can spell disaster for an object and its maintenance.

Image of office table filed with files

Exhibition files staged for the summer project. Messy metadata comes in paper form too!

Nine decades of records reveal strong local, national and international connections

Over the summer, I worked with metadata that chronicles the history of exhibitions held at or organized by the Snite Museum of Art. The records span over a nearly ninety-year period, going back to the days when the Snite Museum of Art was simply known as the “Art Gallery.”

I digitized roughly 214 exhibitions from the Snite’s files. As I reviewed the files, it became abundantly clear that the Snite has been a valuable asset for the community, with connections to the Metropolitan Museum of Art, Museum of Modern Art, High Museum, Art Institute of Chicago, and the Louvre that go back as far as 1950. 

Such ties have brought priceless works by artists of international acclaim to South Bend. The Snite has and continues to serve as a cultural bridge from Notre Dame to the global art community, providing access to some of the world’s greatest treasures.

Online collections expand global access and impact

The Snite hopes to continue this tradition by increasing its reach through its online presence. 

Updated metadata for exhibitions and objects will enable the Snite to launch an online collections site. New features will allow visitors to view the collections more in-depth and immerse themselves in exhibitions from days-gone-by.

Both scholars and casual visitors to the Snite will appreciate being able to more fully understand the institution’s history and the part it has played in the global art scene. 

Cart filled with boxes next to man

Moving files at the Snite Museum…Messy metadata everywhere!

Looking back. Looking ahead.

When I first started this position, it was easy to get lost in the seemingly endless stacks of files, mysterious cabinets full of unsorted papers, and loose sheets with scribbled notes that somehow corresponded with one of the 900 exhibitions on file at the Snite. 

Keeping our team’s long-term goal in mind, however, has been a helpful means for understanding the importance of digital scholarship. 

Every digitized collection requires months of planning and development, usable and flexible metadata, and meticulous editing and updating information. The opportunity to participate in this process has given me valuable insight into how museums stay relevant in an increasingly digital age.

Student presenting in front of screen

Maher presents his work to library and museum colleagues at a July 2019 forum.

Posted in Metadata

Metadata connections

Posted on October 29, 2019 by Abigail Shelton

By Hanna Bertoldi 

What is metadata?

I am constantly thinking about metadata, or the data that describes other data. 

While most people may not often think about metadata, everyone who uses search tools benefits from clean, well-structured metadata. 

Think about a music file. Its metadata might contain the artist’s name, song title, album title, and year it was released. Spotify uses this metadata to help you find the songs you want and also to suggest music you might like. Some of these terms may be hidden from you, such as terms that describe the mood or genre of a song, but they all work in the background to create easily browsable categories. 

Screenshot of Spotify music player

Figure 1: Spotify browse page

Museums and metadata

Like Spotify, museums also create and use metadata, but, instead of music, the metadata describes pieces of art. 

As its best self, museum metadata has the potential to bridge gaps between cultural heritage institutions and provide access beyond virtual and physical barriers. This unbounded potential can only be leveraged by using accurate and standardized metadata. 

But, when I think about museum metadata, I mostly think about how messy it is. (Apparently, Spotify thinks about this too). In the screenshot below, the fields “Creation Date” and “Century” reveal some examples of data inconsistency and redundancy. For instance, you can see that at one time items without an identifiable creation date were marked with “no date” whereas others were marked with the abbreviation “n.d.” In addition, the same date is repeated in both the “Creation Date” and “Century” fields for many of these objects. These kinds of metadata errors can prevent database users from finding objects that meet their search criteria.

Screenshot of database page showing lists of dates

Figure 2: Screenshot of creation date export from EmbARK, the museum’s collections management system

Organizing the “mess” with controlled vocabularies

Controlled vocabularies are a critical element of creating consistent metadata. A controlled vocabulary is an organized arrangement of words and phrases used to index content and to retrieve content by browsing and searching.

If we wanted to build an application where you could search every artwork in the world, every arts organization or collector would have to agree to use the same vocabulary, or a “controlled vocabulary,” to describe their objects. We couldn’t have the Getty Museum using “Claude Monet” while the Louvre was using “Oscar-Claude Monet” to describe the artist that painted water lilies in the nineteenth century. They would have to agree on the same version of the artist’s name. 

Beginning in the 1980s, the Getty Research Institute started developing controlled vocabularies for cataloging visual arts and cultural heritage. The terms and associated information in the Getty Vocabularies are valued as authoritative because they are derived from published sources and represent current research and usage in art history and cultural heritage communities. 

In short, the Getty Vocabularies provide a convenient system for mapping data, without spending a lot of wasted time reinventing what already exists. 

Database screenshot showing list of keywords

Figure 3: Searching EmbARK, the museum’s collections management system, by subject terms

Applying the Getty Vocabularies to Snite Museum collections

Now, back to the messy data part. 

The Snite Museum of Art has been thinking about subject retrieval for quite a while. Perhaps as early as 2010, Snite curators began cataloging works of art with subject terms or keywords. This metadata allows curators to search for things about an object, such as who or what is depicted. Unfortunately, these terms were used without standardization. 

Building off of the curators’ previous work, my goal as Collections Database Coordinator was to transform these subject terms or keywords so that they fit within the Getty Vocabularies. On a superficial level, this process has helped to clean up things such as errors in spelling and inconsistencies with formatting and style. On a deeper level, the process created relationships among objects and increased discoverability.

Photograph of man pulling woman in a rickshaw

Figure 4: Unidentified photographer, Geisha in a Ricksha, Japan, ca. 1880-1890, albumen silver print with applied color. Snite Museum of Art, University of Notre Dame. Acquired with funds provided by Robert E. (ND ’63) and Beverly (SMC ‘63) O’Grady, 2008.054.002.

Such immediate improvements in discoverability are a result of the way Getty Vocabularies are structured: as hierarchical thesauri. 

Getty Vocabularies are hierarchical to allow similar objects to be grouped together. For example, you want to discover all of the works in your collection that depict transportation. 

  • You would search the term “vehicles” and return all objects that use the word “vehicles” 
  • You would also return any objects with more specific terms that are organized underneath the term “vehicles” such as “milk carts,” “fire engines,” “wheelbarrows,” etc.

With the hierarchical organization, you would not have to think of or know all the various subsets of vehicles.

Getty Vocabularies are also thesauri because they connect variable terms that express the same concept. It doesn’t matter if you search for “ricksha,” “rickshaws,” or the French pousse-pousse — the computer will always know what you mean. 

By leveraging the research that Getty has done, I can make the Snite collections more usable to a diverse audience.

Contributing to Getty Vocabularies to pave the way for MARBLE 

Creating a thesaurus for the entirety of human knowledge is a big task, so Getty relies on contributions from the user community to expand their vocabularies. 

I’ve come across many terms that weren’t part of the Getty Vocabularies in the Snite Museum’s database. These terms could have stayed “local,” which would have made the information only available to our staff. Our purpose, however, is to make the Museum’s collections searchable alongside the University Archives, Rare Books & Special Collections, and other cultural heritage materials on Notre Dame’s campus. 

To make sure our collections are cataloged in a way that computers can understand, I needed to add these missing terms to the Getty Vocabularies. 

Screenshot of Getty vocabulary entry for carrots

Figure 5: Record for carrots created from my submission

Using the Snite’s collections, I have contributed over 80 terms to the Getty Vocabularies within the past year. The terms are reviewed and published so that they can be used by others. Terms like these will be a core piece of the MARBLE website so that users can search across different kinds of collections through a single search portal.

MARBLE: Combining controlled vocabularies

The Getty Vocabularies are just one example of a controlled vocabulary. Other vocabularies are appropriate for cataloguing different kinds of materials, such as the Library of Congress subject terms for print materials or the National Institute for Health’s Medical Subject Headings for biomedical information. Although we don’t have many biomedical items slated for inclusion in the MARBLE site, we will need to accommodate the different controlled vocabularies used by art museum, library, and archival catalogers. 

Our Metadata Team is currently working on a solution for combining the authoritative vocabularies used by the Hesburgh Libraries and the Snite Museum. We’ve considered everything from enforcing a singular controlled vocabulary across collections to experimenting with Linked Open Data solutions that would allow us to combine different vocabularies using emerging web technologies. Cleaning up the Snite’s subject terms is just the first step towards revealing metadata’s best self. Stay tuned for future posts on our progress.

While our immediate aim is to improve discoverability, our future aim is to provide some of the same services as Spotify. Users may not be aware that controlled vocabularies are working to suggest artworks or texts that might be of interest to them. My hope is that specific metadata clean up will seamlessly improve searching and browsing on a large scale.

Posted in Metadata

Teamwork Makes the Dream Work

Posted on August 29, 2019 by Abigail Shelton

By Abigail Shelton 

At universities, there is often the desire for project partnerships, but differing organizational priorities sometimes inhibit true, long-term collaboration. The question is, how can two independent campus departments work effectively together toward sustainable outcomes?

At Notre Dame, the leadership for Hesburgh Libraries and the Snite Museum of Art started by identifying a project with shared goals that aligned with institutional priorities. 

Securing funding was the next step. The partners jointly submitted a grant proposal for a three-year project, the success of which depended on multiple stakeholders working together to develop collaborative solutions over time. In December 2017, The Andrew W. Mellon Foundation funded Notre Dame’s project to create a unified discovery and access platform for library and museum collections. 

It was clear early on that the project’s complexities demanded that technical and subject matter experts from both organizations had appropriate representation and a framework to sustain what would become a new, shared campus service. Ultimately, strong project management, persistent outreach, ongoing communication, and cross-departmental teams became important ingredients to collaboration. 

Early Planning and Outreach

Project organizers set some basic guidelines for collaboration: each unit would receive one grant-funded position; team members would regularly work in each other’s spaces; the project team would meet every quarter for an update; and staff would tour and gain familiarity with each others’ collections.

These structures worked well for those directly involved in the project’s work, but there was still a question of how to ensure the diverse community of experts from the museum and library were also involved and invested over the long-term. 

The first strategy was targeted outreach by the Outreach Specialist who met one-on-one with library and museum curators, archivists, subject specialists, metadata librarians, and other stakeholders. Project staff also held quarterly meetings open to all library and museum personnel to hear project updates, ask questions, and learn about next steps. The outreach specialist and project manager also began working towards a website and blog to be a communication resource for staff as well as the wider world.

The Content Team Forms 

As 2019 approached, the core team realized that they would also need needed investment from collections experts to make decisions about digital content. They invited curators from the Snite Museum of Art, archivists from University Archives, curators from Rare Books and Special Collections, and librarians from Hesburgh Libraries to form the Content Team. Led by a curator from Rare Books and Special Collections, this group began to meet regularly to select representative content from across collections for inclusion in the platform. 

This team organized the first batch of content around a theme that could encompass materials from the archives, library, and museum. In choosing performance as the opening topic, curators drew from the library’s rich sports collections, Irish music broadsides, and religious materials. Museum specialists selected objects from the Meso-American collections, prints depicting historic theater performances, and a portrait of one of the greatest nineteenth-century French actors, among others. Archives staff selected Notre Dame student jazz festival programs, historic commencement programs, and speeches from the University’s beloved past president, Rev. Theodore Hesburgh.

PPT slide with digitzed books, photographs, and archival materials

Sample of selected content

The team was sure to choose diverse formats — from photographs and print materials to three-dimensional objects and paintings — in order to challenge the next two teams in the chain: Workflow Team and Metadata Team.

From Content to Workflow and Metadata

After forming the Content Team, the core project team created the Workflow Team to empower those who would ultimately implement the project workflows. Once the Content Team handed off the first test batch, the Workflow Team set out to take the raw material and make it accessible online.

The first challenge was to identify the source systems for the digital assets and associated information for the pilot collections. Hesburgh Libraries and the Snite Museum of Art have been digitizing their rare materials for conservation, research, and teaching purposes over the course of several decades. As a result, digital images are scattered across a range of storage locations including file folders, hard drives, and cloud-based storage systems. 

The associated metadata is also dispersed across several locations, such as an integrated library system (ILS), proprietary museum database, ArchivesSpace (finding aids for archival materials), standalone websites, spreadsheets, our institutional repository, printed catalogs, digital exhibits, museum object files, and most importantly, the minds of our subject specialists. 

Once the Workflow Team identified the source systems, they began talking to the gatekeepers about formats and access. The team is currently working towards moving digital assets and metadata from their current locations into a directory structure, through a IIIF pipeline, into the search index, and ultimately to the user interface. 

Image of group working at desks and white board with diagrams

Workflow Team meeting and diagrams

A Metadata Team was also started to engage experts from each organization to make collaborative decisions about how to manage metadata across the platform. They work in tandem with the Workflow and Content Teams to plan for metadata remediation and mapping. 

One of their greatest challenges is devising a solution that enables a robust search across archives, museum, and library collections when each project partner uses different data standards and formats. 

The team has been working on better understanding how each partner is cataloging their items and storing their metadata. In the next phase of the project, the team plans to make recommendations for how to reconcile subject term differences and develop basic metadata profiles. They will also document how to map museum and archival metadata into a unified search index so that users can search across the repositories’ holdings. 

Screenshot of website with painting on leftside and metadata on rightside

Metadata display for painting from Snite Museum of Art

The Foundation of Collaboration

At the onset, the core team understood that the software would only as successful as the collaboration and the long-term commitment of the people behind it. Opportunities for staff in the museum, library, and archives to better understand each others’ collections, professional best practices, and workflows were intentionally designed to strengthen the partnership between the two campus departments.

Already, members of these teams have begun to learn the best practices of others. Walk into a Workflow Team meeting and you might hear a metadata librarian talking about museum software. Sit down with the Metadata Team and the museum database coordinator might tell you all about archival finding aids. 

The Content, Metadata, and Workflow Teams have limited project charters — their mandate is to tackle specific use-cases and questions within the three-year scope of the grant. However, the investment in developing shared understanding and vision through teamwork has cemented working relationships that will thrive after the grant period ends. This will be critical to ensuring the success of the platform well beyond the initial development and launch.

Posted in Metadata, Outreach, Project Management, Workflow

Categories

  • Development (2)
  • Metadata (3)
  • Outreach (2)
  • Project Management (3)
  • Uncategorized (1)
  • Workflow (2)

Archives

Proudly powered by WordPress