Come l’Intelligenza Artificiale accelera il refactoring dei sistemi legacy bancari?

L’integrazione tra AI Generativa e analisi statica permette di decifrare rapidamente codebase obsoleti, riducendo i tempi di discovery da settimane a minuti. Grazie all’architettura RAG, è possibile interrogare il codice vettorializzato per comprendere flussi logici complessi e dipendenze nascoste, facilitando l’estrazione delle regole di business senza dover analizzare manualmente milioni di righe di codice.

Qual è la strategia migliore per migrare un monolite bancario a microservizi?

La tecnica raccomandata è lo Strangler Fig Pattern potenziato dall’AI. Questo approccio evita la riscrittura totale immediata, preferendo l’isolamento graduale dei contesti limitati. Gli LLM vengono utilizzati per estrarre la logica pura dal vecchio sistema e riscriverla in linguaggi moderni, mentre si creano interfacce facade che instradano il traffico verso i nuovi microservizi in modo progressivo.

Come garantire la sicurezza del codice generato dall’AI in ambito finanziario?

La sicurezza si ottiene imponendo vincoli espliciti nei prompt, come l’uso di Prepared Statements per prevenire SQL Injection secondo gli standard OWASP, e integrando controlli automatici nella pipeline CI/CD. È essenziale mantenere un approccio human-in-the-loop per la revisione del codice critico e utilizzare strumenti SAST per scansionare vulnerabilità prima che il software vada in produzione.

Come si risolve il problema della mancanza di documentazione nel codice legacy?

Si utilizza un approccio di arricchimento semantico tramite LLM specializzati nel code understanding. Questi modelli analizzano il flusso dei dati e suggeriscono rinominazioni delle variabili oscure con termini basati sul contesto reale, come visto nel caso studio CRM BOMA. Questo processo trasforma il codice black box in una struttura leggibile e manutenibile per i futuri sviluppatori.

Cosa sono le allucinazioni dell’AI nel coding e come si gestiscono?

Le allucinazioni si verificano quando l’AI inventa librerie o metodi inesistenti. Per mitigarle, si adottano strategie come la compilazione immediata del codice suggerito direttamente nell’IDE e l’uso di modelli multipli per confrontare le soluzioni (Mixture of Experts). Inoltre, la generazione automatica di unit test assicura che il nuovo codice rispetti rigorosamente le funzionalità del sistema originale.

Refactoring Legacy Banking Systems: AI and Static Analysis Guide

by Francesco Zinghinì

Published on Jan 19, 2026

Updated on Jan 19, 2026

7 minutes reading time

artificial intelligence programming security software

Legacy system refactoring workflow with AI and static code analysis

In the financial landscape of 2026, legacy system refactoring is no longer an optional choice, but an operational survival necessity. Banking institutions find themselves squeezed between the need to innovate rapidly (to compete with digital-native Fintechs) and the weight of monolithic codebases, often written in COBOL or old versions of Java, that have been handling critical transactions for decades. This technical guide explores how the integration of Generative Artificial Intelligence (GenAI) and deterministic static code analysis tools is revolutionizing the way we approach software modernization.

The “Black Box” Problem in Banking Systems

The main obstacle in refactoring banking systems is not writing new code, but understanding the old one. We are talking about millions of lines of code where business logic is intertwined with infrastructure management, and where documentation is often missing or obsolete. In this context, a manual approach is risky and unsustainably slow.

The modern solution lies in a hybrid approach: using static analysis to create a certain map of dependencies and LLMs (Large Language Models) specialized in code understanding to decipher the semantic intent of functions.

Phase 1: Mapping and Discovery with Static Analysis and AI

Refactoring Legacy Banking Systems: AI and Static Analysis Guide - Summary Infographic — Summary infographic of the article “Refactoring Legacy Banking Systems: AI and Static Analysis Guide” (Visual Hub)

Before touching a single line of code, it is necessary to illuminate the shadow zones of the monolith. Here is how to structure the discovery phase:

1. Abstract Syntax Tree (AST) Generation

Static analysis tools (like advanced SonarQube or proprietary mainframe analysis tools) must be configured to generate not just quality reports, but complete dependency graphs. The goal is to identify:

Afferent and efferent coupling: Which modules are too interconnected?
Dead Code: Code that is never executed but consumes cognitive resources.
Hardcoded Secrets: Credentials buried in the source code.

2. Semantic Code Search with RAG (Retrieval-Augmented Generation)

Once the codebase is indexed, we can use a RAG architecture. Instead of asking a generic LLM to “explain this file”, we insert the entire vectorized codebase into a database. This allows us to query the system with high-level questions:

“Show me all functions that calculate compound interest and have direct dependencies with the DB_CUSTOMER_HISTORY table.”

The AI returns not only the files but the logical flow connecting them, reducing analysis time from weeks to minutes.

Phase 2: Refactoring Strategies towards Microservices

Programmer analyzing legacy banking code with AI tools — Static analysis and AI guide the refactoring of critical banking systems.

Once the territory is mapped, the goal is legacy system refactoring towards a microservices or modular architecture. The reigning technique remains the Strangler Fig Pattern, powered by AI.

Isolation of Business Logic

This is where the experience gained in developing the BOMA CRM comes into play. During the creation of BOMA, we faced the need to migrate complex customer management logic from an old VB6 management system. The common mistake is attempting to rewrite everything from scratch (Big Bang Rewrite). Instead, we used AI to extract the “pure rules” of business, separating them from the user interface and data access code.

The applied process:

Identification: Static analysis identifies the module boundaries (Bounded Context).
Assisted Extraction: The legacy code is provided to the LLM, requesting the generation of an agnostic version of the logic in a modern language (e.g., Go or Rust), keeping inputs and outputs identical.
Facade Creation: An interface is implemented to route calls from the old system to the new microservice.

Phase 3: Security and Compliance (OWASP Top 10)

In the banking sector, security is non-negotiable. Using AI to generate code introduces new risks (e.g., insecure code or hallucinations). It is imperative to integrate security controls into the refactoring pipeline.

Prompt Engineering for Security

When asking an LLM to refactor a function, the prompt must include explicit security constraints. Example of a structured prompt:

ROLE: Senior Security Architect
TASK: Refactoring of the function 'processTransaction' from COBOL to Java Spring Boot.
CONSTRAINTS:
1. Use Prepared Statements to prevent SQL Injection (OWASP A03:2021).
2. Implement rigorous input validation.
3. Ensure logs do not contain PII (Personally Identifiable Information).
4. Add Javadoc comments explaining the preserved business logic.

Automatic Validation in CI/CD

Code generated by AI must never go into production without validation. The CI/CD pipeline must include:

SAST (Static Application Security Testing): Automatic scanning for known vulnerabilities.
Generated Unit Tests: Ask the AI to generate unit tests for the old code and ensure the new code passes the same tests (Regression Testing).

The Case Study: The Legacy of CRM BOMA

The experience with CRM BOMA was enlightening in defining this protocol. In that project, the challenge was not just technological, but semantic. The old system used obscure nomenclature (e.g., variables like var1, x_temp). Using LLMs to analyze the data flow, we managed to rename and refactor variables with speaking names based on the real usage context (e.g., customerLifetimeValue, lastInteractionDate).

This process of “semantic enrichment” during refactoring allowed us not only to update the technology stack but to make the code maintainable for future developers, reducing technical debt by 60% in the first 6 months post-migration.

Troubleshooting: Managing AI Hallucinations

Even in 2026, LLMs can “hallucinate”, inventing non-existent libraries or methods. To mitigate this risk:

Human-in-the-loop: Human Code Review remains mandatory for critical logic.
Immediate Compilation: Integrate the IDE with the AI agent to verify that the suggested code compiles in real-time.
Cross-referencing: Use two different models to generate the code and a third model to compare the solutions (“Mixture of Experts” Pattern).

In Brief (TL;DR)

Modernization of legacy banking systems becomes sustainable by combining the precision of static analysis with the interpretive capacity of Generative Artificial Intelligence.

The adoption of RAG architectures and evolutionary patterns accelerates the mapping of obsolete code, facilitating the secure transition from complex monoliths to agile microservices.

Integrating rigorous security protocols and automatic validations into the development pipeline mitigates operational risks derived from automatic code generation.

(adsbygoogle = window.adsbygoogle || []).push({});

Conclusions

disegno di un ragazzo seduto a gambe incrociate con un laptop sulle gambe che trae le conclusioni di tutto quello che si è scritto finora

Legacy system refactoring in the banking sector is open-heart surgery. The adoption of static analysis tools combined with artificial intelligence allows for reducing operational risks and accelerating time-to-market. However, technology is only an accelerator: deep understanding of banking domains and software architecture, as demonstrated in the BOMA case, remains the irreplaceable foundation for project success.

Frequently Asked Questions

disegno di un ragazzo seduto con nuvolette di testo con dentro la parola FAQ

How does Artificial Intelligence accelerate legacy banking system refactoring?

The integration of Generative AI and static analysis allows for rapidly deciphering obsolete codebases, reducing discovery times from weeks to minutes. Thanks to the RAG architecture, it is possible to query the vectorized code to understand complex logical flows and hidden dependencies, facilitating the extraction of business rules without having to manually analyze millions of lines of code.

What is the best strategy for migrating a banking monolith to microservices?

The recommended technique is the Strangler Fig Pattern powered by AI. This approach avoids immediate total rewriting, preferring the gradual isolation of bounded contexts. LLMs are used to extract pure logic from the old system and rewrite it in modern languages, while facade interfaces are created to route traffic to the new microservices progressively.

How to ensure the security of AI-generated code in the financial sector?

Security is achieved by imposing explicit constraints in prompts, such as the use of Prepared Statements to prevent SQL Injection according to OWASP standards, and by integrating automatic controls into the CI/CD pipeline. It is essential to maintain a human-in-the-loop approach for reviewing critical code and to use SAST tools to scan for vulnerabilities before the software goes into production.

How is the problem of missing documentation in legacy code solved?

A semantic enrichment approach is used via LLMs specialized in code understanding. These models analyze the data flow and suggest renaming obscure variables with terms based on the real context, as seen in the CRM BOMA case study. This process transforms the «black box» code into a readable and maintainable structure for future developers.

What are AI hallucinations in coding and how are they managed?

Hallucinations occur when the AI invents non-existent libraries or methods. To mitigate them, strategies such as immediate compilation of the suggested code directly in the IDE and the use of multiple models to compare solutions (Mixture of Experts) are adopted. Furthermore, the automatic generation of unit tests ensures that the new code strictly respects the functionalities of the original system.

Francesco Zinghinì

Electronic Engineer with a mission to simplify digital tech. Thanks to his background in Systems Theory, he analyzes software, hardware, and network infrastructures to offer practical guides on IT and telecommunications. Transforming technological complexity into accessible solutions.

Did you find this article helpful? Is there another topic you’d like to see me cover?
Write it in the comments below! I take inspiration directly from your suggestions.