The answer, according to new research from the data and AI platform company, is sobering. Even the best-performing AI agents achieve less than 45% accuracy on tasks that mirror real enterprise ...
Databricks introduces OfficeQA: An open-source benchmark that tests AI agents in realistic business scenarios.
Databricks recently reached a valuation exceeding $100 billion following its latest funding round, joining the elite group of most-valuable private companies like SpaceX, ByteDance and OpenAI.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results