Generative AI (Gen AI) is revolutionizing workflows by being strategically deployed at critical moments, maximizing effectiveness while keeping operational costs under control. This approach, often referred to as "just-in-time" AI, is drawing a mix of enthusiasm and skepticism from CIOs and IT leaders.


What Is Just-in-Time AI?

Just-in-time AI borrows its concept from the manufacturing sector, particularly the Japanese Kanban system, which focused on efficiency and timely execution. In the context of AI, this approach involves activating generative AI models precisely when they are needed, avoiding unnecessary costs and ensuring relevant real-time insights.

Sastry Durvasula, Chief Information and Client Services Officer at TIAA, explains how their "Research Buddy" AI tool exemplifies this principle. Used by Nuveen, TIAA's asset management arm, Research Buddy delivers insights from public documents only when requested, enhancing efficiency and relevance.

“The timeliness is critical. You don’t want to do the work too much in advance because you want that real-time context. We activate the AI just in time,” says Durvasula.

This strategy not only optimizes workflow but also addresses the high costs associated with generative AI processing.


The Cost Factor

Generative AI can be expensive to deploy, especially when used indiscriminately. Durvasula warns, "The cost of AI can be astronomically high and not always justified in terms of business value."

However, as Forrester analyst Mike Gualtieri points out, the cost of AI should be evaluated in context. For high-stakes scenarios where significant financial outcomes are involved, the investment in Gen AI may pale in comparison to the value it generates.

"If it costs you a million dollars and saves you $10 million, then cost should not hold you back," Gualtieri argues.

The key, he suggests, lies in knowing when cost should be a factor and when it should not, particularly when leveraging pre-trained large language models (LLMs) and retrieval-augmented generation (RAG) services.


Techniques to Reduce Costs

RAG services are a powerful tool to manage costs while improving the quality and relevance of generative AI outputs. These services allow enterprises to inject relevant data into pre-trained LLMs at the moment of need, avoiding expensive model training and over-reliance on high-cost data science talent.

“Vendors are providing built-in RAG solutions so enterprises won’t have to build them themselves,” notes Gualtieri. For example, Google’s RAG service allows businesses to integrate real-time data with pre-trained models seamlessly.


Case Study: SAIC’s Tenjin GPT

One standout example of just-in-time AI in action is SAIC’s Tenjin GPT, a generative AI platform deployed across its 24,000 employees. Built on Microsoft Azure and OpenAI, the platform is used to enhance workflows strategically, including:

  • IT service incident resolution

  • Customer service inquiries

  • AI-assisted software development

  • Data preparation and visualization

SAIC’s CIO, Nathan Rogers, emphasizes that this initiative aims to empower employees with AI-enabled tools that allow for timely, data-driven decision-making. "We will ultimately have citizen developers throughout the whole company who can get to a decision-making just-in-time moment," he states.


Challenges and Considerations

While just-in-time AI has clear advantages, it comes with its own challenges, including the high computational demands of generative AI models and the difficulty of ensuring bias-free, reliable outputs without a human-in-the-loop (HITL).

Max Chan, CIO of Avnet, critiques the term "just-in-time," suggesting that it might be better described as using the "right technique in the right places" to balance costs and efficiency.

Durvasula adds that responsible AI governance must be embedded in the system to ensure ethical and effective outcomes. In TIAA’s case, Nuveen analysts validate Research Buddy’s results before they are acted upon, providing an additional layer of quality assurance.


The Just-in-Case Perspective

For some workflows, "just-in-case" AI might be more appropriate. In scenarios like investment-driven decision-making, having insights readily available — even if not immediately needed — can be invaluable. Durvasula highlights the need for real-time personalization and low latency for such high-value use cases.


Lessons from Japanese Efficiency

Generative AI’s incremental implementation mirrors the revolutionary Japanese manufacturing techniques that focused on reducing inefficiencies in small but meaningful ways. Whether deployed as just-in-time, just-in-case, or part of a hybrid approach, success with AI depends on strategic planning, thoughtful execution, and a clear understanding of business value.

By striking the right balance, organizations can unlock AI’s potential to deliver transformative outcomes, one carefully timed deployment at a time.

Recent updates
Bio-Inspired Networking: Lessons from Nature in Designing Adaptive Systems

Bio-Inspired Networking: Lessons from Nature in Designing Adaptive Systems

In a world increasingly reliant on interconnected systems, traditional networking approaches are reaching their limits.

The Evolution of Mobile Network Operators: Pioneering the Future of Connectivity

The Evolution of Mobile Network Operators: Pioneering the Future of Connectivity

Mobile Network Operators are more than just service providers; they are enablers of a connected world.

The Dawn of 6G: Unlocking the Future of Hyper-Connectivity

The Dawn of 6G: Unlocking the Future of Hyper-Connectivity

As the world begins to harness the power of 5G, the tech industry is already setting its sights on the next frontier: 6G.

The Rise of Quantum Networks: Redefining the Future of Connectivity

The Rise of Quantum Networks: Redefining the Future of Connectivity

Quantum networks represent a paradigm shift in the way we think about communication and connectivity.

Still Thinking?
Give us a try!

We embrace agility in everything we do.
Our onboarding process is both simple and meaningful.
We can't wait to welcome you on AiDOOS!