Next Generation Archiver for WinCC OA

In collaboration with


The project aims to modernize archiving of data from CERN’s WinCC OA-based SCADA systems by delivering a flexible and future-proof solution capable of handling growing data volumes and evolving use cases. It improves the performance and accessibility of historical data, reduces the dependence on a single database technology, and empowers engineers and scientists with richer tools for analysis and operational insight.

Overview


WinCC OA is the de facto standard for developing SCADA systems at CERN, supporting over 850 mission-critical systems across experiments, infrastructure, and accelerators. Growing data rates and increasing needs for data retrieval, both within WinCC OA and from external systems, make performance and functional improvements essential. The NextGen Archiver addresses these challenges as a modular and future-proof solution with pluggable backends supporting multiple databases. At CERN, it will be used with Oracle for backward compatibility and TimescaleDB for improved query performance and new use cases.

Highlights in 2025


Significant progress on the TimescaleDB schema, backend, and test suite was achieved in 2025. The solution’s performance has been validated at the scale of large production deployments, with hundreds of simultaneous ingestion and querying processes. In all scenarios, it met or exceeded the requirements, even on moderately performant hardware.

To enable simple and efficient retrieval of historical data without requiring SQL knowledge, a dedicated API has been developed. Together with Grafana extensions – initially developed as part of an openlab Summer Student project – that allow users to browse and select signals using familiar table and tree views, this functionality addresses a long-standing need for creating rich and dynamic dashboards. These dashboards will simplify the daily work of hundreds of system experts, operators, and scientists at CERN.

To unlock the full benefits of the new archiving solution, data from existing Oracle schemas must be migrated to TimescaleDB. This represents a significant challenge, not only due to the scale – nearly 200 TB distributed across more than 350 schemas – but also because of the complex validation checks required to ensure full preservation of data integrity.

Next Steps


In 2026, the team will focus on completing the TimescaleDB support and preparing it for CERN-wide deployment in the first half of 2027. Production-grade versions of Grafana extensions and data retrieval APIs will also be finalized based on pilot user feedback, alongside the transfer of 200 TB of existing data from Oracle to TimescaleDB.

Publications & Presentations


A. Kveton, R. Kulaga, M. Zemko – “Optimizing time series data storage for CERN industrial control systems using TimescaleDB and PostgreSQL” Presented at CERN PGDay 2025, Geneva, 2025, https://indico.cern.ch/event/1471762/contributions/6280212/

Technical Team


Martin Zemko, Nikita Nekhotyachshiy, Rafal Kulaga, Ewald Sperrer, Pedro Agostinho

Project Coordinator


Rafal Kulaga

Collaboration Liaisons


Christopher Stoegerer, Ewald Sperrer, Pedro Agostinho