Robots that can talk and move may turn from tools to potential participants, which poses new methodological challenges, particularly for transcription. This chapter first presents best practices for transcribing multimodal robot actions, focusing on sound. Robots animate the action repertoires that they are given by their designers and can do so again and again, producing virtually identical sequences. This work discusses how to transcribe such repeated action, balancing between the general script and situated moves. Moving from transcription to analysis, the chapter pays special attention to differences in how humans and robots demonstrate understanding of sequential actions. The chapter closes by demonstrating how transcription can reveal the dynamic character of robot participation, which is often assisted and scaffolded by humans who frame the robot's actions as relevant and accountable.