Bring Your Own AI

Your AI. Your keys. Your boundary.

WPLoadTester 7 uses your own Anthropic, OpenAI, Azure OpenAI, or AWS Bedrock key. Your enterprise contract already covers it, your security team already approved it, and your data stays wherever it already lives. No new vendor review. No AI markup.

Four providers, all of them yours

Paste a key. That's the integration. The AI call happens between WPLoadTester and the provider of your choice. WPI never sits in the middle, never sees your prompts, never stores your data.

Anthropic

Claude models direct via the Anthropic API. If your enterprise has an Anthropic contract, WPLoadTester just uses it.

Anthropic API ↗

OpenAI

GPT models direct via the OpenAI API. Same approval, same billing account, same audit trail you already have.

OpenAI Platform ↗

Azure OpenAI

GPT models through your own Azure OpenAI deployment. Your Azure subscription, your region, your Microsoft enterprise agreement and governance.

Azure OpenAI Service ↗

AWS Bedrock

Any model available in your AWS region: Claude on Bedrock, Llama, Titan, and more. Your AWS governance, your IAM policies, your VPC.

AWS Bedrock ↗
Configure AI Provider · WPLoadTester 7
The Configure AI Provider dialog. Bring your own key for Anthropic, OpenAI, Azure OpenAI, or AWS Bedrock.

We test the models, so you don't have to guess

Bring-your-own-AI only pays off if the model can actually do the work. We benchmark the AI Assistant against the auth and correlation patterns that break recordings in the real world: PKCE and OAuth logins, SAML federation, and JavaScript bot challenges. Then we tell you which models clear them from start to finish, so you can point your team at one that finishes the job instead of one that stalls halfway.

Claude Sonnet 4.6

Our top all-around pick. It clears every benchmark case end to end, and you can run it direct from Anthropic or through AWS Bedrock on your own AWS account.

GPT-5-class

Equally reliable on the hard cases. Use GPT-5.5 direct from OpenAI, or deploy a GPT-5-class model (the lower-cost gpt-5.4 works great) in your own Azure OpenAI resource.

Smaller or older models are fine for simple recordings and quick questions, and you are always free to use any model your provider offers. For the toughest correlation work, though, a validated model is the difference between a recording that replays clean and one that almost does.

What the AI sees once you've configured it

Once you've pasted your key, the AI Assistant lives inside WPLoadTester, analyzing your test cases, identifying auth patterns, and explaining what it's about to do before it does it.

Underneath the AI is a deterministic expert system battle-tested against thousands of real test cases. The rules for ASM correlation, dynamic-field extraction, and session-state recognition aren't being invented by the LLM each time. The AI decides which rules to apply and explains why; the expert system handles the precision work. That's why auto-configuration works on real-world test cases with hundreds of dynamic fields, instead of breaking on the third dynamic variable like a pure-LLM tool would. See the diagnostic playbook for the symptom taxonomy the AI consults when it analyzes results.

And it gets cheaper the more you use it. The rules the AI creates don't disappear when the session ends. Once it has worked out how to correlate a session token or extract a dynamic field on your application, the expert system keeps that rule. The next time you configure a test case against the same site, the expert system applies those saved rules automatically, so the AI isn't asked to solve the same configuration problems again. The system learns how your application behaves from past runs, which means less repeated AI work and lower token spend on top of the no-markup pricing.

WPLoadTester 7 AI Assistant identifying an OAuth2/OIDC + PKCE pattern in a pre-configured stock trading test case
WPLoadTester 7.0 with the bundled Stock Trades demo test case. The AI Assistant has correctly identified the OAuth2/OIDC + PKCE pattern, flagged the complexity, and broken the flow down into its six pages. This test case ships with every 7.0 install. Evaluators can reproduce the same assessment on their own machine.

When the AI hits something tricky, you see it think.

For harder configuration problems (extracting a JWT from a streaming Next.js payload, correlating dynamic variables across pages, debugging a regex that isn't matching), the AI walks through its attempts out loud instead of silently giving up. You see exactly what it tried, what it observed, and why it's refining.

AI Assistant in WPLoadTester 7 iteratively refining a regex to extract a JWT token from an escaped-JSON Next.js response, showing four successive attempts
Real AI Assistant transcript troubleshooting JWT extraction from a Next.js streaming response. Each message refines the previous attempt based on what it observed in the actual response body. The same loop a senior engineer would follow, happening in seconds.

And after the test runs, the AI interprets the results.

WPLoadTester 7's Load Test Analytics dashboard produces AI-written views of every run: a plain-English performance narrative, and a bottleneck analysis that picks out inflection points and traces root causes from the actual numbers.

WPLoadTester 7 Load Test Analytics Report tab with AI-written narrative analysis of performance degradation by page, and a Critical/Warning status table covering write-heavy, schema-loading, and config/token pages at baseline versus capacity
Load Test Analytics Report tab. The AI groups pages by behavior (write-heavy, schema-loading, config/token), flags Critical vs Warning based on actual degradation at your test's capacity, and explains why, in plain English, using the specific numbers from the run.
WPLoadTester 7 Load Test Analytics Analysis tab showing top 10 slowest transactions as a line chart across user levels from 3 to 303, with AI commentary in the right panel identifying a sudden cliff at 252 users and correlating UPSERT failures to failed disclosure_id variable extraction
Load Test Analytics Analysis tab. The AI picks out the inflection point (here, a 4× degradation cliff at 252 users) and traces the root cause: cascading 400 errors from failed variable extraction, not a server-side capacity limit. Actionable recommendations instead of a dashboard you have to interpret yourself.

See the full AI-generated report

The screenshots above are excerpts. Here's the complete report generated by the AI from a real e-commerce load test: narrative analysis, bottleneck breakdown, and per-page recommendations end-to-end.

Download sample report (PDF)

Why security and procurement will like it

Every objection a security review raises against "another SaaS AI vendor" evaporates when the AI is one you've already approved.

No new vendor review

Your security team has already vetted Anthropic, OpenAI, Microsoft Azure, or AWS. The path of an AI request is identical to what they approved.

Existing contracts apply

Enterprise pricing, volume commits, SLA, usage limits, cost-center tagging. Whatever you negotiated with your AI vendor carries straight through.

Data stays within your boundary

Prompts and responses travel between WPLoadTester and the provider you chose. WPI isn't in the request path and never sees the payload.

No AI markup

You pay what the provider charges. Token pricing is transparent. WPLoadTester doesn't add a margin on AI usage.

Bring your own agentic toolkit via MCP

WPLoadTester 7 also exposes its test cases, results, and AI workflow as an MCP (Model Context Protocol) server. Point any MCP-compatible agent at it (Claude Code, Cursor, your own CLI tool, anything you build) and drive the whole product from your terminal. The AI layer isn't a black box; it's extensible end-to-end.

This is the senior-engineer / agentic-CLI surface. If you'd rather work from a chat panel inside the app, see the in-app AI Assistant described above.

Any agent, any client

If it speaks MCP, it can drive WPLoadTester. Use the agentic workflow you already use for everything else instead of learning a new UI.

Multiple MCP servers, collaborating

Install other MCP servers alongside (a better image generator for prettier charts, your company's internal tools, whatever your agent needs) and let them work together on a single load-test workflow.

Your prompts, your standards

Write prompts from your corporate standards for what a performance report should include. The AI produces reports in your voice, with the metrics your stakeholders care about.

Customize every step

Test case configuration, data extraction, report format, chart style: all open-ended. Swap any component, pipe the output anywhere, script the whole thing.

Read the full MCP server feature page →

The Azure angle: Microsoft-first enterprises

If your company already runs on Microsoft (Entra ID, Azure subscriptions, an enterprise agreement), Azure OpenAI is the provider your team will approve fastest. The GPT models run in your own Azure OpenAI resource, in the region you choose, inside the subscription your security team already governs.

Microsoft's commitment is explicit: your prompts and completions are not used to train its models, and are not shared with OpenAI. Private networking, customer-managed encryption keys, and your existing compliance certifications (SOC 2, HIPAA, ISO 27001) carry over because the resource is yours, not a new vendor's. It's the same data-residency story as Bedrock, told in Azure terms.

The Bedrock angle: regulated and air-gapped buyers

If your organization is regulated (financial services, healthcare, federal) or operates in isolated AWS environments (GovCloud, dedicated regions), AWS Bedrock support is the wedge.

Bedrock runs inside your AWS account, subject to your IAM policies, within your compliance boundary. If Bedrock is already approved for your workloads, WPLoadTester using it for AI auto-configuration is the same data-residency story, not a new one. GovCloud regions and FedRAMP-authorized environments work identically.

If Bedrock isn't an option (fully air-gapped, no AWS), self-hosted AI is on the roadmap. Ollama, vLLM, and any OpenAI-compatible endpoint will plug into the same provider-paste workflow, just pointed at your own model server. Talk to us if that's blocking your evaluation.

Select Bedrock Model · Your AWS Account
  • ✓ anthropic.claude-opus-4-8
  •     anthropic.claude-sonnet-4-6
  •     anthropic.claude-haiku-4-5
  •     meta.llama4-maverick-17b-instruct-v1:0
  •     amazon.nova-pro-v1:0

Region: us-east-1 · IAM: arn:aws:iam::••••••:role/WPLoadTester

Selecting a Bedrock model. The list reflects whatever's enabled in your AWS account and region.
Coming in WPLoadTester 7.1: Real-browser load testing returns in the 7.1 release. WPLoadTester 7.0 focuses on AI-powered auto-configuration and HTTP-protocol virtual users.

Run AI-configured load tests on your own terms

Download the free single-machine edition. Includes a bundled AI sample so you can try auto-configuration before committing a production AI key.

Software

×

(1) 919-845-7601 9AM-5PM EST

Complete this form and we will get back to you as soon as possible. Please note: Technical support questions should be posted to our online support system.

About You
How Many Concurrent Users