MAOAM: Unified Object and Material Selection with Vision-Language Models
MAOAM uses a single vision-language model to select both objects and materials in images from text or clicks. That enables more precise, flexible photo and video editing than today’s simple masks. If you build creative tools, this points to where AI-powered selection is heading.
Jaden Park, Valentin Deschaintre