Computer Vision
MolmoPoint: Open-Source …
- MolmoPoint is a new open-source multimodal model from Allen Institute for AI (AI2) that introduces advanced pointing and clicking capabilities.
- It utilizes a novel architecture that maps visual coordinates to text tokens, allowing the model to interact with user interfaces and physical environments …