Multimodal AI Explained: How AI Understands Text, Images, Audio, and Video