Speaker
Description
This talk will explore the Linaro Forge toolsuite's capabilities for performance monitoring in accelerated compute environments, focusing on GPU utilization. We will present a light-weight profiler that allows application developers to gather essential, high-fidelity metrics offline using Linaro Performance Reports and enables deeper insight into performance issues using Linaro map. These tools provide minimal impact on application runtime and the design makes these tool easily integrable into scheduling scripts.
While Linaro Forge tools are primarily application-developer focused, we will address the system-wide perspective by discussing how they could be used to provide visibility into job-level GPU metrics. We will also share and seek feedback on speculative future product ideas aimed at extending our tools to deliver a broader, sysadmin-focused view of GPU occupation, utilization, and energy consumption.