HyperAI

LED Latin Inscription Dataset

Date

9 days ago

Organization

DeepMind

Publish URL

github.com

Download Help

LED is the largest machine-operable Latin inscription dataset to date, released by Google DeepMind in 2025. The related paper is "Contextualizing ancient texts with generative neural networks".

The dataset contains a total of 176,861 inscriptions, but most of them are partially damaged, and only 5% inscriptions can produce usable corresponding images. The data comes from the three most comprehensive Latin inscription databases: the Roman Inscription Database (EDR), the Heidelberg Inscription Database (EDH) and the Clauss-Slaby Database, which contain inscriptions from the seventh century BC to the eighth century AD, and the geographical coverage ranges from the Roman provinces of Britannia (now Britain) and Lusitania (Portugal) in the west to Egypt and Mesopotamia (Iraq) in the east.