Skip to main content

AI-Models

The AI-Models page is where an administrator configures the chat models that msg.ZenTestAI can use. Multiple models can be configured per tenant — typically a cheap one for high-volume tasks (element identification, simple decisions), a vision-capable one for assertions, and a high-end reasoning model for the AI Assistant.

Models are configured per tenant (product). There is no global model registry.

Where to configure

Configure AI models in Administration → Select your Tenant → AI tab. The page lists every configured model and lets you add new ones with the + action.

After saving, configured models become selectable in:

  • The default-model fields on Tenant Settings (assertions, element identification, step identification, agents).
  • The per-step AI model override on the Steps tab.

Supported vendors

msg.ZenTestAI ships connectors for seven vendor platforms. Pick a vendor first; the form then adapts to show only the fields relevant to that vendor.

VendorUse for
OpenAIOpenAI's public API (api.openai.com or a compatible reverse proxy).
Azure OpenAIOpenAI models hosted in your Azure subscription as deployments.
AnthropicAnthropic's public API (api.anthropic.com or compatible relay).
AWS BedrockAnthropic Claude (and others) hosted via AWS Bedrock.
Google GeminiGoogle's public Gemini API (generativelanguage.googleapis.com).
Google Vertex AIGemini hosted on Google Cloud Vertex AI.
Azure AI StudioLlama-family and other models served from Azure AI Studio deployments.

Supported models

Each vendor exposes a curated list of base models. The list below is the current set at the time of writing — the actual dropdown in your tenant reflects what the platform ships.

OpenAI / Azure OpenAI

Base modelNotes
gpt-5, gpt-5.2Latest GPT-5 generation. Reasoning-capable.
gpt-5-mini, gpt-5-nanoLower-cost GPT-5 variants. Reasoning-capable.
gpt-4.1Mid-tier reasoning model.
gpt-4.1-mini, gpt-4.1-nanoLower-cost 4.1 variants.
gpt-4oGeneral-purpose, vision-capable.
gpt-4o-miniLow-cost, vision-capable.
gpt-o3, gpt-o3-mini, gpt-o4-miniDedicated reasoning models.
openai-generic / openai-reasoning-generic"BYO model name" — use this when you want to point at an OpenAI-compatible model that is not in the curated list.
azure-openai-generic / azure-openai-reasoning-genericSame idea, but for Azure-hosted deployments.

Anthropic / AWS Bedrock

Base modelNotes
claude-4-6-opusCurrent flagship Claude. Reasoning-capable. Recommended for the AI Assistant.
claude-4-6-sonnetCurrent high-quality Claude. Reasoning-capable.
claude-4-5-sonnetPrevious-generation Sonnet. Reasoning-capable.
claude-4-5-haikuCurrent low-cost Claude. Reasoning-capable.
claude-3-7-sonnetOlder Sonnet (still supported).
claude-3-5-sonnetVision-capable, widely deployed.
claude-3-haikuLow-cost, no reasoning.
bedrock-generic / bedrock-reasoning-generic"BYO model id" — use this to point at any Bedrock-hosted model by its provider id.

The same Claude models are reachable via Anthropic (direct API) and AWS Bedrock (managed).

Google Gemini / Vertex AI

Base modelNotes
gemini-3.1-proCurrent flagship Gemini. Reasoning-capable.
gemini-3.1-flash-liteCurrent low-cost Gemini. Reasoning-capable.
gemini-3-pro, gemini-3-flashGemini 3.0 family.
gemini-2.5-proHigh-quality, reasoning-capable.
gemini-2.5-flash, gemini-2.5-flash-liteFast, low-cost variants.
gemini-2.0-pro, gemini-2.0-flashEarlier 2.x generation.
gemini-1.5-proLegacy, still supported.
vertex-generic / vertex-reasoning-generic"BYO model id" for Vertex deployments not in the curated list.

Azure AI Studio

Base modelNotes
llama-3-2-90b-vision-instructorVision-capable Llama for assertions.
azure-ai-studio-generic / azure-ai-studio-reasoning-generic"BYO model id" for any Azure AI Studio deployment.
tip

The *-generic and *-reasoning-generic entries are the supported way to use a model that isn't in the curated list — for instance a brand-new vendor release that has not yet been added by name. Pick the generic entry of the right vendor and put the actual provider model id into the Deployment Name field. Use the -reasoning- variant if the underlying model supports reasoning / extended thinking.

Model capabilities

Some capabilities are flagged on the model:

  • Reasoning — the model supports extended thinking. Required for the AI Assistant; recommended for step identification on complex applications. All gpt-5*, gpt-o*, Claude 4.x, Gemini 2.5+, and the *-reasoning-generic entries are reasoning-capable.
  • Vision — the model can interpret a screenshot. Required for the Default AI Model for Assertions in Tenant Settings. GPT-4o / 4o-mini, Claude 3.5+, and the Llama-3.2-90B-Vision-Instructor are vision-capable; all current Gemini models are vision-capable too.
  • Low-cost — used to flag cheap models (currently gpt-4o-mini, claude-3-haiku, the various Gemini -flash-lite variants). The platform doesn't enforce anything based on this flag; it's a hint for admins picking sensible defaults.

Configuration form

When you add or edit a model, the form is split into four sections.

Basic info

FieldDescription
IDA unique name for this model entry inside msg.ZenTestAI. Used everywhere a model is selected; pick something descriptive (e.g. claude-4-6-opus-eu, gpt-5-cheap). Required.
AI-Model VendorThe vendor connector to use. Required. Determines which fields appear below.
Base AI ModelThe base model from the curated list (see above). Required.

Once a model is saved, a Test the connection button is shown next to the ID. Pressing it calls the configured vendor with the stored credentials and reports success or the exact error returned by the vendor.

Authentication

The authentication section adapts to the vendor:

VendorFieldNotes
OpenAIAPI-KeyYour OpenAI API key.
Azure OpenAIAPI-KeyThe key of your Azure OpenAI resource.
AnthropicAPI-KeyYour Anthropic API key.
Google GeminiAPI-KeyA Google AI Studio API key.
Azure AI StudioAPI-KeyThe key from the Azure AI Studio deployment.
AWS BedrockCredentialsJSON containing AWS region and access keys for the IAM principal that may call Bedrock.
Google Vertex AICredentialsThe JSON of a Google Cloud service account that has Vertex AI permissions.

API keys and credentials are stored encrypted and never displayed back in plain text.

Configuration

The configuration section is the most vendor-specific part of the form:

FieldRequired forWhat it is
Deployment NameAll vendorsThe provider-side name of the deployment / model. For OpenAI this is typically the model name (gpt-5); for Azure it is the deployment name you created in your Azure portal; for AWS Bedrock the Bedrock model id (e.g. anthropic.claude-3-5-sonnet-20241022-v2:0).
API-VersionAzure OpenAI, Azure AI Studio (and optional for Vertex AI / Gemini)The API version string of the vendor (e.g. 2024-08-01-preview for Azure OpenAI).
API-Instance-NameAzure OpenAIThe Azure OpenAI resource name (the one that goes into the URL {instance}.openai.azure.com).
API-Base-PathOptional for most vendorsCustom endpoint URL — useful when you front the vendor with a reverse proxy or use a regional endpoint.

The form hides fields that don't apply to the selected vendor, so you only see what you need to fill in.

Advanced settings

The advanced section is collapsed by default. It contains:

  • Fallback Model — pick another configured model to use when this one fails (connection error, rate limit, vendor outage). Useful when you have a low-cost primary model and a higher-quality backup. Leave empty if you don't want a fallback.
  • Reasoning Effort — shown only when the selected base model is reasoning-capable. Lets you set how much reasoning the model spends per call: none, low, medium, high, or auto. Higher effort improves quality on complex steps but costs more tokens and runs slower.
  • Input Cost Per Million Tokens / Output Cost Per Million Tokens — optional. Fill these in so that test executions and AI Assistant sessions can show a meaningful cost estimate. The numbers are in USD per million tokens, matching the format vendors publish on their pricing pages.

Lifecycle

Saving

Saving validates the form and stores the encrypted credentials. The vendor connection itself is not validated on save — use Test the connection after saving to verify that the credentials and configuration actually work end-to-end.

Deleting

Models can be deleted from the AI tab at any time. There is no safeguard preventing deletion of a model that is currently referenced (as a tenant default, as a per-step override, as the default model for agents). If you delete a model that's in use, the tests that reference it will fail at execution time with a "model not found" error. To be safe, swap references to another model first and then delete the entry.

Reordering

The list of models can be reordered via drag and drop. This only changes the display order — it does not change which model is the default for anything.

Picking the right model for each task

The platform uses several model defaults that are configured separately in Tenant Settings. The recommendations below are starting points:

Task / settingRecommended model class
Default model for element identificationA cheap model with a large context window (gpt-4o-mini, claude-3-haiku, a Gemini flash variant).
Default model for assertionsA vision-capable model. gpt-4o, claude-3-5-sonnet, or a high-quality Gemini.
Default model for step identificationA high-quality reasoning model — gpt-5, claude-4-6-sonnet, gemini-3-pro.
Default model for the AI AssistantA top-tier reasoning model — Claude Opus or GPT-5+ ("Copilot"). See AI Assistant → Recommended models.
tip

If you set a low-cost model as the default for element identification, the runner will automatically escalate to a higher-quality model for steps that need it. You don't have to oversize the default just to handle the occasional difficult step.

Migration notes

  • The platform retains older model entries (Claude 3.x, Gemini 1.5, GPT-4o) for backward compatibility. Existing test configurations continue to work, but for new tests prefer the current-generation models listed above.
  • When a vendor introduces a new model that isn't in the curated dropdown yet, use the corresponding *-generic entry and provide the new model id in Deployment Name.
  • If you have any other vendor or model not covered above, contact us at hello@zentest.ai.