In this commentary we would like to question (a) Clark and Fischer's characterization of the “social artifact puzzle” – which we consider less puzzling than the authors, and (b) their account of social robots as depictions involving three physical scenes – which to us seems unnecessarily complex. We contrast the authors' model with a more parsimonious account based on attributions.