Multimodal

MolmoPoint: Open-Source …

  • MolmoPoint is a new open-source multimodal model from Allen Institute for AI (AI2) that introduces advanced pointing and clicking capabilities.
  • It utilizes a novel architecture that maps visual coordinates to text tokens, allowing the model to interact with user interfaces and physical environments …