This study explores children’s multimodal behavior in response to a robot’s verbal turn during a word learning task. We conducted an experiment consisting of four sessions and measured changes in the behavior of the child reflecting the development of the interactions. Preliminary results suggest that children made use of various multimodal signals such as gestures or delay markers. Analyzes on children’s gaze behavior revealed further critical observations with respect to the use of a tablet in child-robot interactions.