Multimodal Model for AI-Generated Video Quality Assessment