Grok 1.5: Grok-1.5V, a first-generation multimodal model, was unveiled by Elon Musk’s AI business, xAI. Apart from its robust text processing capabilities, Grok can handle a vast array of visual data such as screenshots, documents, charts, diagrams, and photos. Read on to know more.
Grok 1.5V: What is it?
The first multimodal model from Elon Musk’s OpenAI competitor has been unveiled. It has the ability to process data from papers, charts, diagrams, screenshots, and photos in addition to text. Grok-1.5 Vision, also known as Grok-1.5V, will shortly be made available to current Grok users and early testers.
“Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs,” the company said in a blog post.
In order to demonstrate the potential of the Grok-1.5V, the company provides seven examples. These range from turning a child’s drawing of a flowchart onto Python code to creating a bedtime story, translating a table into a CSV file format, and determining whether the wood on your deck needs to be replaced due to rot.
Keep watching our YouTube Channel ‘DNP INDIA’. Also, please subscribe and follow us on FACEBOOK, INSTAGRAM, and TWITTER