Language Models are Tactical Scenario Analysers: Leveraging LLMs for Automated Post-Mortem Analysis in High-Level Software Testing
2025 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesisAlternative title
Språkmodeller är taktiska-scenario granskare (Swedish)
Abstract [en]
High-level software testing is often complex and highly situational, and there are areas where conventional methods of automation are simply not feasible. One such potential area is the post-mortem analysis of failed tests. This thesis explores the feasibility of using Large Language Models (LLMs) for automating the analysis of simulation-based test-scenarios within Tacsi, a tactical simulation environment developed by Saab AB. A system monickered ScenarioLens was implemented to summarise JSON-encoded recordings of these scenarios and to generate a conclusive briefing. Extensive experiments evaluated model size and prompting strategies, validated through a novel method of measuring accuracy tailored to the LLM-generated conclusions. Results demonstrate that LLMs are able to generalise well to this domain under certain constraints and offer promising avenues for integrating AI into high-level, domain-specific software testing workflows. Additionally, the thesis explores the use of multimodal LLMs for the same task, though this approach showed less potential compared to purely text-based methods.
Place, publisher, year, edition, pages
2025. , p. 56
Keywords [en]
AI, LLM, LLMs, Artificial Intelligence, Large Language Models, Large Language Model, language model, language modelling, language models, reasoning, gemma3, gemma 3, deepseek, gpt, attention, saab, tactical, tactical simulation, scenario, tactical scenario, tactical simulation scenario, tacsi, scenario testing, automated testing, AI testing, AI-driven testing, AI-based testing, AI automated testing
Keywords [sv]
AI, artificiell intelligens, språkmodeller, scenario, taktik, taktiska, saab, tacsi, automatiserad testning, AI testning
National Category
Artificial Intelligence
Identifiers
URN: urn:nbn:se:liu:diva-219419ISRN: LIU-IDA/LITH-EX-A--25/095--SEOAI: oai:DiVA.org:liu-219419DiVA, id: diva2:2013751
External cooperation
Saab AB
Subject / course
Computer Engineering
Presentation
2025-06-18, Alan Turing, 16:30 (Swedish)
Supervisors
Examiners
2025-11-242025-11-132025-11-24Bibliographically approved