Systems | Development | Analytics | API | Testing

LLM Cost Management: How to Implement AI Showback and Chargeback

Every enterprise moving AI into production is about to face a familiar problem in an unfamiliar form: the cost explosion, but for LLMs. This is *very *similar to what happened with cloud. In the early days of cloud, teams spun up infrastructure with no visibility into who was consuming what. Finance got the bill. Engineering got the blame. No one had the data to make good decisions. It took years of hard-won FinOps discipline to fix that. LLM spend is on the same trajectory *and moving faster*.

5 Best Practices for Securing Microservices at Scale

The microservices revolution promised agility and scalability. Teams could deploy faster, scale independently, and innovate without monolithic constraints. You gain speed and flexibility, but you also multiply trust boundaries, identities, network paths, and policy decisions. Then came AI, and everything changed. In 2025, the security reality for AI-integrated microservices is stark.

From Microservices to AI Traffic: Kong's Unified Control Plane When Architecture Gets Complicated

Modern enterprise architecture faces a three-body problem. Three distinct traffic patterns pull your teams in different directions. External APIs serve mobile apps and partner integrations. Internal microservices communicate within Kubernetes clusters. AI and LLM calls flow to OpenAI, AWS Bedrock, and self-hosted models. Each pattern looks API-like on the surface. Yet many organizations handle them with separate tools. The result?

Practical Strategies to Monetize AI APIs in Production

AI APIs don't get enough credit for how much weight they're actually carrying. These AI APIs aren't merely technical connectors. They're, in fact, cost drivers and potential revenue engines. And when something goes sideways, they're ground zero. In production, they behave nothing like the traditional APIs your teams have been running for years; they introduce a whole new set of hurdles around operations, security, and governance that most organizations are still struggling to understand.

Connecting Kong and Solace: Building Smarter Event-Driven APIs

Bringing APIs and events together has always been a challenge. REST APIs give developers a familiar interface, while event brokers like Solace Broker excel at fan-out, filtering, and scalable, reliable event delivery. The tricky part? Bridging these two worlds without building a lot of custom glue. That’s exactly what the new Kong plugin for Solace upstream mediation does.

Kong Simplifies Multicloud Cloud Gateways with Managed Redis Cache

As enterprises race to deploy multicloud architectures and Agentic AI, they face a common bottleneck: "state." To govern AI token usage, manage agent-to-agent communication, or optimize performance via caching, API and AI gateways require a persistence layer to synchronize data. We’re excited to share the GA of Managed Redis cache for Kong Dedicated Cloud Gateways (DCGW).

Configuring Kong Dedicated Cloud Gateways with Managed Redis in a Multi-Cloud Environment

A persistent challenge arises as businesses adopt multicloud architectures and agentic AI: the need for state synchronization. API and AI gateways require a robust persistence layer to synchronize data, whether it's for governing AI token usage, facilitating agent-to-agent communication, or boosting performance through caching.