In CSM, the user is expected to find the image of the sound he/she hears. The user is given 3 options and is expected to mark the object that carries the visual of the sound he/she hears. Objects are in motion from the bottom up. It is aimed at improving listening and visual perception skills. With this CSM technology, the reduced financial and time costs increase the diversity of content on EdTech platforms.