In CSM, the user is expected to match the texts they read with the correct visuals. The user drags the appropriate images onto the correct sentence. It is useful for language learning. It is aimed at improving reading and visual perception skills. With this CSM technology, the reduced financial and time costs increase the diversity of content on EdTech platforms.