Musk’s xAI reveals Grok 1.5 Vision, claims top spatial understanding
By subscribing, you agree to our Terms of Use and Policies You may unsubscribe at any time. Elon Musk’s artificial intelligence (AI) company, xAI, has unveiled its first multimodal model, Grok 1.5 Vision, as it looks to compete with OpenAI. As per the preview, in addition to understanding text, the AI model can also work with documents, charts, diagrams, screenshots, and photos. One of OpenAI’s funders, Musk advocates that AI can help humanity in unimaginable ways. However, after falling out with the vision of how OpenAI should proceed, Musk started xAI last year with a group of influential AI researchers keen on developing AI models more openly. Last November, the company rolled out the first iteration of its AI model, Grok. Further, it emphasized its push for openness by making its base model weights and network architecture open-sourced last month. The pace at which the company is working is evident, and its first multimodal AI model arrived barely a month after its architecture was made open-source. According to its website, the Grok 1.5V connects the physical and digital worlds. The company has highlighted seven examples of its capabilities to explain how the multimodal model works. A user can share a picture of a flowchart with Grok, and the AI model can translate it into Python code. By simply showing the model a nutrition label, a user can inquire how many calories one would consume by consuming certain portions of the product. While this might seem like an easy case of multiplication, the AI model can also take a child’s drawing and build an entire bedtime story using it. The model can do the converse, too. Show it a meme, and it will explain why it is funny and provide the context needed to understand it. The AI model isn’t just built for play. It can convert a table into CSV format or help you correct a piece of code that might not be working. But if you need home repair advice, just share images of the affected area, and the model is designed to help you with that as well, the company lists on its website. xAI has also released a new benchmark dubbed RealWorldQA to evaluate the spatial understanding shown by multimodal models. From examples shared by the company, Grok 1.5V can look at images and differentiate between objects that are comparatively bigger or give driving advice as well. Grok 1.5V also handsomely beats other AI models on this benchmark as well as others, according to the company’s data shared in this chart. With Elon Musk stating in a recent interview that he expects AI to be smarter than any human by the end of 2025, all eyes are on what improvements his company will bring to the AI race in the upcoming months. xAI has said that in its aim to build beneficial artificial general intelligence (AGI) that can understand the universe, the company will make significant improvements to the capabilities of its models in other areas, such as audio, voice, and video, in the coming months. Grok 1.5V will soon become available for the company’s testers and existing users, the company added in its blog.What can Grok 1.5V do?
What’s in store for the future?
相关推荐
-
Yoon approves labor minister's appointment
-
North Korea opens photo exhibition marking decade of leader's rule
-
Parasols jump in popularity amid S. Korean heatwave
-
US will impose sanctions on N. Korea, Russia when necessary: state dept.
-
Apple to start manufacturing iPhone Pro in India, report claims
-
Toothless Canada held again in Gold Cup
- 最近发表
-
- Tesla considers adding a new ‘stuck detection' feature to Cybertruck. Here’s why.
- 全市社会保险费征缴突破16亿元
- Demand for non
- Seoul turns hawkish toward Pyongyang amid pressure for Russia sanctions
- CeeDee Lamb secures record
- 界炮圣女果勇闯湾区!现场爆火,市民吃玩乐购嗨翻天
- Best Le Creuset deal of August 2023: Get 20% off
- Red Bull ready for more records
- Yoon approves labor minister's appointment
- 打造“朝夕政务”助跑团 为特定群众提供优质服务
- 随机阅读
-
- 22 Unusual Things You Can Find in the Desert
- Turkey’s Erdogan orders removal of 10 Western ambassadors, including U.S. envoy.
- Turkey’s Erdogan orders removal of 10 Western ambassadors, including U.S. envoy.
- Seoul turns hawkish toward Pyongyang amid pressure for Russia sanctions
- Swifties for Kamala raises over $100,000 in donations for Harris campaign
- North Korea opens photo exhibition marking decade of leader's rule
- Iranian jail to Wimbledon royal box
- Texas city refused requests to escort Biden bus surrounded by aggressive Trump supporters.
- 10 Big Misconceptions About Computer Hardware
- US will impose sanctions on N. Korea, Russia when necessary: state dept.
- Iranian jail to Wimbledon royal box
- Parasols jump in popularity amid S. Korean heatwave
- 评论丨农事运动会:一场农民的盛会、新农人风采展现的盛会、城乡双向奔赴的盛会
- Tesla's cheaper Model S and Model X are here, but at cost of lower range
- Iranian jail to Wimbledon royal box
- 广东百名农业经理人圆梦清华大学
- How much for Oasis tickets? Fans joke about splurging on reunion shows
- 雅安市市场监督管理局开展化妆品检查确保群众用妆安全
- Toothless Canada held again in Gold Cup
- ‘Ninja’ Djokovic eyes eighth Wimbledon title
- 搜索
-
- 友情链接
-