Repository logo
 
Loading...
Thumbnail Image
Publication

Identification of stone deterioration patterns with large multimodal models. Definitions and benchmarking

Use this identifier to reference this record.
Name:Description:Size:Format: 
1-s2.0-S1296207424002486-main.pdf3.82 MBAdobe PDF Download

Advisor(s)

Abstract(s)

The conservation of stone-based cultural heritage sites is a critical concern for preserving cultural and historical landmarks. With the advent of Large Multimodal Models, as GPT-4omni (OpenAI), Claude 3 Opus (Anthropic) and Gemini 1.5 Pro (Google), it is becoming increasingly important to define the operational capabilities of these models. In this work, we systematically evaluate the image classification capabilities of the main foundational multimodal models to recognise and categorize anomalies and deterioration patterns of stone elements that are useful in the practice of conservation and restoration of world heritage. After defining a taxonomy of the main stone deterioration patterns and anomalies, we asked the foundational models to identify a curated selection of 354 highly representative images of stone-built heritage, offering them a careful selection of labels to choose from. The result, which varies depending on the type of pattern, allowed us to identify the strengths and weaknesses of these models in the field of heritage conservation and restoration.

Description

Keywords

Cultural heritage Large multimodal models Benchmarking Stone deterioration patterns

Citation

Research Projects

Organizational Units

Journal Issue