A client wants to use generative Al to create content that includes a combination of text, images, and videos. Which type of Gen Al model would be best suited for this client? O language model O diagnostic model O multimodal model O computer vision model OI don't know this yet.