• About
  • Subscribe
  • Contact
Tuesday, March 24, 2026
    Login
  • Management Leadership
    • Growth Strategies
    • Finance
    • Operations
    • Sales and Marketing
    • Careers
  • Technology
    • Infrastructure and Platforms
    • Business Applications and Databases
    • Big Data, Analytics and Intelligence
    • Security
  • Industry Verticals
    • Finance and Insurance
    • Manufacturing
    • Logistics and Transportation
    • Retail and Wholesale
    • Hospitality and Tourism
    • Government and Public Services
    • Utilities
    • Media and Telecommunications
  • Resources
    • Whitepapers
    • PodChats
    • Videos
  • Events
No Result
View All Result
  • Management Leadership
    • Growth Strategies
    • Finance
    • Operations
    • Sales and Marketing
    • Careers
  • Technology
    • Infrastructure and Platforms
    • Business Applications and Databases
    • Big Data, Analytics and Intelligence
    • Security
  • Industry Verticals
    • Finance and Insurance
    • Manufacturing
    • Logistics and Transportation
    • Retail and Wholesale
    • Hospitality and Tourism
    • Government and Public Services
    • Utilities
    • Media and Telecommunications
  • Resources
    • Whitepapers
    • PodChats
    • Videos
  • Events
No Result
View All Result
No Result
View All Result
Home AI and Machine Learning

F5, NVIDIA to boost AI inference efficiency and token economics

by FutureCIO Editors
March 24, 2026
Photo by Matthias Zomer: https://www.pexels.com/photo/low-angle-view-of-office-building-against-sky-313736/ CLOUD

Photo by Matthias Zomer: https://www.pexels.com/photo/low-angle-view-of-office-building-against-sky-313736/

F5 has announced expanded capabilities in its ongoing collaboration with NVIDIA, aiming to help enterprises and service providers improve the efficiency and economics of AI inference as adoption accelerates.

Kunal Anand
Kunal Anand

Kunal Anand, chief product officer, F5. Said: “Together with NVIDIA, we are enabling AI factories to treat token production as a measurable business metric. BIG-IP Next for Kubernetes provides the intelligence and governance required to increase GPU yield, reduce cost per token, and scale shared AI platforms confidently.”

Boosting AI inference efficiency and token economics

The integration combines F5’s BIG-IP Next for Kubernetes with NVIDIA’s BlueField-3 data processing units (DPUs). It aims to increase token throughput, reduce latency, and improve GPU utilisation.

In AI systems, the units of generated data, such as words or symbols called “tokens”, have emerged as a key performance metric shaping user experience and determining the return on costly GPU infrastructure. As a result, companies are increasingly focused on “token economics,” including throughput, time-to-first-token, and cost per token.

F5 claims that the enhanced platform uses real-time telemetry, including NVIDIA NIM statistics and GPU signals, to route workloads more efficiently before execution. The approach helps reduce delays and improve overall system performance by matching AI tasks to the most suitable compute resources.

Testing conducted by The Tolly Group revealed that the combined solution increased token throughput by up to 40%, reduced time to first token by 61%, and reduced request latency by 34%. Additionally, it frees up GPUs for ongoing inference workloads without requiring any modifications to existing AI models.

Kevin Deierling
Kevin Deierling

“NVIDIA’s accelerated computing infrastructure, coupled with F5’s AI-aware Application Delivery and Security Platform, unlocks superior AI factory tokenomics—delivering scalable and cost-effective inference without making any changes to the models,” said Kevin Deierling, SVP, Networking, NVIDIA. “Together, F5 and NVIDIA are empowering enterprises to scale AI factory inference efficiently and economically.”

Related:  Dell Technologies speeds business transformations with AI
Tags: Artificial Intelligencedigital transformationF5Nvidiatoken economics

FutureCIO Editors

No Result
View All Result

Recent Posts

  • F5, NVIDIA to boost AI inference efficiency and token economics
  • Workday brings superintelligence to work via new AI offering
  • Only 33% of Singaporean professionals report quantified ROI from AI, report finds
  • UiPath deploys agentic AI for PLDT Group’s enterprise risk management
  • Hitachi Vantara expands Hitachi iQ Capabilities for responsible AI deployments

Live Poll

Categories

  • AI and Machine Learning
  • Artificial Intelligence
  • Big Data, Analytics & Intelligence
  • Business Applications & Databases
  • Business-IT Alignment
  • Careers
  • Case Studies
  • CHRO
  • CISO
  • CISO strategies
  • Cloud, Platforms and Ecosystems
  • Cloud, Virtualization, Operating Environments and Middleware
  • Compliance and Governance
  • Compliance and Governance|Technology
  • Computer, Storage, Networks, Connectivity
  • Corporate Social Responsibility
  • Culture and Behaviour|People
  • Customer Experience / Engagement
  • Cyber risk management
  • Cyberattacks and data breaches
  • Cybersecurity careers
  • Cybersecurity operations
  • Data Protection
  • Digital Transformation
  • Education
  • Education
  • ESG and sustainability
  • Finance
  • Finance & Insurance
  • Future Workplace
  • FutureCISO
  • General
  • Governance, Risk and Compliance
  • Governance, Risk and Compliance
  • Governance, Standards and Regulations
  • Government and Public Services
  • Growth Strategies
  • Hospitality & Tourism
  • HR, education and Training
  • Industry Verticals
  • Infrastructure & Platforms
  • Insider threats
  • IT-OT integration
  • Latest Stories
  • Logistics & Transportation
  • Management Leadership
  • Manufacturing
  • Media and Telecommunications
  • News Stories
  • Operations
  • Opinion
  • Opinions
  • People
  • Process
  • Remote work
  • Retail & Wholesale
  • Sales & Marketing
  • Security
  • Sustainability
  • Tactics and Strategies
  • Technology
  • Utilities
  • Videos
  • Vulnerabilities and threats
  • White Papers

Strategic Insights for Chief Information Officers

FutureCIO is about enabling the CIO, his team, the leadership and the enterprise through shared expertise, know-how and experience - through a community of shared interests and goals. It is also about discovering unknown best practices that will help realize new business models.

Quick Links

  • Videos
  • Resources
  • Subscribe
  • Contact

Cxociety Media Brands

  • FutureIoT
  • FutureCFO
  • FutureCIO

Categories

  • Privacy Policy
  • Terms of Use
  • Cookie Policy

Copyright © 2022 Cxociety Pte Ltd | Designed by Pixl

Login to your account below

or

[wpli_login_link]

Not a member yet? Register here

Forgotten Password?

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Management Leadership
    • Growth Strategies
    • Finance
    • Operations
    • Sales and Marketing
    • Careers
  • Technology
    • Infrastructure and Platforms
    • Business Applications and Databases
    • Big Data, Analytics and Intelligence
    • Security
  • Industry Verticals
    • Finance and Insurance
    • Manufacturing
    • Logistics and Transportation
    • Retail and Wholesale
    • Hospitality and Tourism
    • Government and Public Services
    • Utilities
    • Media and Telecommunications
  • Resources
    • Whitepapers
    • PodChats
    • Videos
  • Events
Login

Copyright © 2022 Cxociety Pte Ltd | Designed by Pixl

Subscribe