Ferret

Ferret

Refer and ground anything anywhere at any granularity

5.0
1 review

207 followers

A new type of multimodal large language model (MLLM) from Apple that excels in both image understanding and language processing, particularly demonstrating significant advantages in understanding spatial references.
Ferret gallery image
Ferret gallery image
Ferret gallery image
Free
Launch Team