Student IT/EE Workshop 2025

Name: Student IT/EE Workshop 2025
Start: 2025-04-24T09:00:00+02:00
End: 2025-04-24T13:10:00+02:00
Location: Stara Kotłownia

24 April 2025

Stara Kotłownia

Europe/Warsaw timezone

Comparative Analysis of Multi-Agent LLM Systems for Solving Polish Matura in Physics Exams

24 Apr 2025, 10:45

30m

SK 04/05 (Stara Kotłownia)

SK 04/05

Stara Kotłownia

Warsaw University of Technology, Main Campus

Poster Session B (Poster)

Piotr Wróbel (Politechnika Warszawska)

Large Language Models have gained widespread
recognition since OpenAI released their revolutionary model,
ChatGPT 3.5. Since then, many new approaches have emerged
to improve the capabilities and accuracy of these models for
different tasks. One such method involves using multi-agent
conversations. This article compares two multi-agent setups
designed to solve the Polish standardized high school exam in
physics. Comparative benchmarks were performed on several
real final exams published by the Polish Central Examination
Board (pl. CKE — Centralna Komisja Egzaminacyjna). The
study employed ChatGPT-4 Turbo and the AutoGen framework.
Benchmarks covered a total of 90 tasks from three Polish Matura
physics exams (editions: 2018, 2019, 2023). The simpler multiagent systems achieved an average score of 76.1%, while the
more complex systems averaged 85.6%.

Piotr Wróbel (Politechnika Warszawska)

There are no materials yet.

Student IT/EE Workshop 2025

Comparative Analysis of Multi-Agent LLM Systems for Solving Polish Matura in Physics Exams

SK 04/05

Stara Kotłownia

Speaker

Description

Author

Presentation materials

Choose timezone

Student IT/EE Workshop 2025

Speaker

Description

Author

Presentation materials