Published December 5, 2025 | Version v1
Publication Open

CresCine's Film Industry Data Repository (FIDA)

Description

The Film Industry Data Repository (FIDA) is a lifecycle-wide, multi-source database developed by the CresCine consortium1 to address the persistent data scarcity facing Europe’s small and mid-sized film markets. Built on a scalable Databricks architecture and structured through a medallion pipeline (Bronze–Silver–Gold), FIDA integrates heterogeneous datasets covering production metadata, festival circulation, theatrical distribution (showtimes, admissions, box-office), streaming availability, television programming, and socio-economic context. Data from public and open infrastructures (TMDB, Wikidata, Lumiere, World Bank), institutional partners (Cinando, European Audiovisual Observatory), and selected commercial providers (International Showtimes, UsherU, media-press.tv) are cleaned, harmonised, and linked through an internal identifier (CresCine ID) using deterministic and fuzzy-matching techniques. The resulting star-schema repository enables cross-window, cross-territory analysis of European films with a granularity not previously available, especially for countries underrepresented in commercial analytics services. FIDA is disseminated through interactive analytical dashboards and simulation tools, supported by the release of specialised aggregated datasets that comply with licensing restrictions. Designed for long-term sustainability and interoperability, FIDA provides a durable evidence base for researchers, policymakers, and industry stakeholders seeking to understand and strengthen European film circulation, performance, and public value creation.

Files

CresCine_FIDA Data paper.pdf

Files (789.3 kB)

Name Size Download all
md5:b09c637997604067aa5aedc8ba554663
789.3 kB Preview Download

Additional details

Funding

European Commission
CresCine - CRESCINE – INCREASING THE INTERNATIONAL COMPETITIVENESS OF THE FILM INDUSTRY IN SMALL EUROPEAN MARKETS 101094988