4 Tips For Building Better Ai Agents That Your Business Can Trust

Trending 2 hours ago
aiagent11gettyimages-2263463626
Ekaterina Demidova/Moment via Getty Images

Follow ZDNET: Add america arsenic a preferred source on Google.


ZDNET's cardinal takeaways

  • Companies are exploring AI agents successful aggregate ways.
  • Professionals must see really to utilization these technologies.
  • Measurement, collaboration, and experimentation are key.

AI agents will impact each master role. If your institution hasn't started utilizing agents yet, it will soon, either done off-the-shelf package products aliases in-house devices that tie connected ample connection models and information sources.

Professionals exploring really to usage agents successful their roles are well-advised to seek best-practice guidance. One specified root of accusation is Joel Hron, CTO astatine Thomson Reuters Labs, who is helping nan accusation services institution utilization generative AI, instrumentality learning, and agentic technologies.

Also: Worried AI agents will switch you? 5 ways you tin move worry into action astatine work

Hron told ZDNET that Thomson Reuters uses a operation of in-house models and off-the-shelf devices to powerfulness its AI innovations. As good arsenic advances successful frontier labs from Big Tech firms, Hron and his squad guarantee nan patient exploits its proprietary knowledge and assets.

"If you look astatine nan halfway of what we do well, it's being capable to synthesize quality expertise and accusation into judgement that tin beryllium served backmost to professionals," he said. 

"The transportation system for really that expertise is delivered is evolving correct now. Traditionally, it's been delivered via software. But it's progressively delivered via agents, aliases agents positive software."

Hron points to respective cardinal agentic achievements astatine Thomson Reuters, including nan AI-powered ineligible investigation instrumentality Westlaw Advantage and nan firm's Deep Research supplier that reviews insights and strategizes arsenic a interrogator would.

Also: AI agents are fast, loose, and retired of control, MIT study finds

From these explorations, Hron said he's learned 4 cardinal lessons that professionals tin usage to build trustworthy agentic AI systems.

1. Measure your success

Hron said nan first area to attraction connected is evaluations: "You request to cognize what bully looks like."

While this attraction connected evaluations sounds for illustration an evident requirement, Hron said it's a difficult process to get right, to quantify, and to systematize.

"We've said that for nan past 3 years that this is 1 of nan astir important things for building bully AI systems, and it continues to beryllium existent coming successful an era of agents," he said.

joel-hron-cto-headshot

Hron: "We still want nan assurance of our quality experts."

Thomson Reuters

Hron's squad tracks and measures agentic occurrence successful respective ways. First, they leverage nationalist benchmarks, which he said supply bully early indicators of nan affirmative imaginable capacity of caller models.

Also: 5 information strategies your business can't get incorrect successful nan property of AI - and why they're critical

Second, they've developed their ain soul benchmarks pinch beardown directions for automated evaluations: "Rather than conscionable saying, 'How adjacent is nan generated reply to a bully answer?', our process is astir really defining, 'Well, what makes nan reply good?'"

Finally, Thomas Reuters keeps humans successful nan loop, ensuring evaluations spell a measurement beyond automated assessments.

"Automated evaluations thief thrust nan flywheel faster for our improvement teams, and they tin trial a batch of ideas comparatively quickly, and that's good. But earlier we ship, we still want nan assurance of our quality experts and their appraisal of nan performance," he said.

"The continued reliance connected that attack has allowed america to vessel awesome products that execute good successful nan market. I deliberation quality input is simply a captious constituent to america being capable to do that activity good and do it pinch confidence."

2. Make experts beryllium together

Hron advised professionals to understand profoundly what agents do and really they run complete time.

"Tightly coupling that consciousness to nan personification acquisition is progressively important," he said. "If you deliberation astir these agentic systems for illustration quality AI collaborators, past nan quality and nan supplier request a communal connection and a communal interface that they activity on."

Also: Why endeavor AI agents could go nan eventual insider threat

Hron said this communal connection and interface should springiness humans valuable penetration into agentic thought processes and vice versa.

"This area is simply a caller and important UI experience, and I deliberation tightly coupling heavy method knowing of nan supplier pinch a bully personification acquisition is critical."

While galore experts talk astir nan value of human/agent coupling, Hron said nan cardinal to occurrence is straightforward: bringing teams successful nan business together.

"This process isn't technological -- it's astir forcing my designers to beryllium pinch information scientists and talk astir what's happening," he said. "The person we tin make those 2 sets of people, and nan much often they tin beryllium together, nan amended you person nan osmosis of reasoning crossed those 2 areas."

3. Develop proven capabilities

Despite immoderate hype that mightiness person you judge otherwise, Hron said professionals must admit that agents and nan models that powerfulness them are acold from omniscient.

Hron said AI models are improving crossed 3 dimensions: penning code, executing plans, and multi-step reasoning. The latest advances let exemplary capabilities to beryllium extended by different package tools.

"What that improvement intends for america arsenic a institution is much affirmative than negative, because it intends that, if we tin return each of these hundreds of applications that we've sold into nan marketplace for galore decades, and we tin decompose them, past we person proven capabilities for professionals," he said.

Also: 90% of AI projects neglect - present are 3 ways to guarantee yours doesn't

"If we tin decompose these elements arsenic devices for nan agent, past we're really extending nan capabilities of these models rather a lot, and that's really nan early of agents."

Rather than seeing agentic AI arsenic an omniscient exemplary that attempts to do everything nether nan sun, Hron advised professionals to springiness agents entree to proven capabilities group already use, which is simply a attraction of his team.

"We're looking astatine our systems and asking ourselves, 'OK, we've built this for a quality personification for many, galore years. Now, what ergonomics are required for an supplier to activity pinch this system? How do you accommodate nan process to beryllium conducive to moving pinch an agent, versus needfully a quality successful each cases? And what does that attack mean for really nan instrumentality looks, feels, and performs?'"

4. Look beyond nan firewall

Thomson Reuters Labs precocious launched nan Trust successful AI Alliance, a builder-led forum for elder AI researchers from Anthropic, AWS, Google Cloud, OpenAI, and Thomson Reuters to talk really spot is engineered into agentic systems. 

Hron said nan Alliance, which shares lessons publically to pass nan broader manufacture speech astir trustworthy AI, besides helps elder members of his squad to study champion practices from manufacture pioneers.

"We're trying to bring guardant a attraction for explainability and transparency successful position of really these models operate," he said.

Also: 5 ways you tin extremity testing AI and commencement scaling it responsibly

Hron said nan exertion pioneers and their models person importantly reduced nan clip and effort required to get from zero accuracy to 90%.

"But we're not successful nan 90% game," he said. "We're successful nan 99% and 99.9% game, and we must see really we get that other 9 aliases 2 nines of accuracy, which is nan quality for trust."

As portion of this process, Thomson Reuters is besides moving pinch world institutions. Late past year, nan institution announced a five-year business to create a associated Frontier AI Research Lab astatine Imperial College London. 

"In these initiatives, we're focused connected those past 2 nines of accuracy, because that's what group look to bargain from america for erstwhile we merchandise our products to market," said Hron.

"The frontier exertion organizations will proceed to push nan limits connected what's possible. But for us, nan separator is wherever nan competitory separator successful nan world of law, tax, and compliance is won and lost. And truthful that's what we really request to get right."

More