PromptLayer fits into your stack at three levels of sophistication:

promptlayer_client.run – zero-setup SDK sugar
Webhook-driven caching – maintain local cache of prompt templates
Managed Agents – let PromptLayer orchestrate everything server-side

Use `promptlayer_client.run` (quickest path)

When every millisecond of developer time counts, call promptlayer_client.run() directly from your application code.

Fetch latest prompt – We pull the template (by version or release label) from PromptLayer.
Execute – The SDK sends the populated prompt to OpenAI, Anthropic, Gemini, etc.
Log – The raw request/response pair is saved back to PromptLayer.

from promptlayer import PromptLayer
pl_client = PromptLayer(api_key="...")

response = pl_client.run(
    prompt_name="order-summary",
    input_variables={"cart": cart_items},
    prompt_release_label="prod"
)

Under the hood

SDK pulls the latest prompt (or the version/label you specify).
Your client calls the model provider (OpenAI, Anthropic, Gemini, …).
SDK writes the log back to PromptLayer.

💡 Tip – If latency is critical, enqueue the log to a background worker and let your request return immediately.

Cache prompts with Webhooks

Eliminate the extra round‑trip by replicating prompts into your own cache or database. PromptLayer keeps that cache fresh through webhook events—no polling required.

Step‑by‑step

Subscribe to webhooks in the UI

Run fully-managed Agents

For complex workflows requiring orchestration, use PromptLayer’s managed agent infrastructure.

How it works

Define multi-step workflows in PromptLayer’s Agent Builder
Trigger agent execution via API
Monitor execution on PromptLayer servers
Receive results via webhook or polling

PromptLayer handles all orchestration, parallelization, and model provider communication.

Implementation

from promptlayer import PromptLayer
promptlayer_client = PromptLayer(api_key="…")

execution = promptlayer_client.run_workflow(  # SDK method
    workflow_name="customer_support_agent",   # <- this is an Agent
    workflow_label_name="prod",
    input_variables={"ticket_id": 123}
)

Because execution is server-side, you inherit centralized tracing, cost analytics, and secure sandboxed tool-nodes without extra ops. Learn more: Agents documentation ↗

Which pattern should I pick?

Requirement	`promptlayer_client.run`	Webhook Cache	Managed Agent
⏱️ extreme latency reqs	❌	✅	✅
🛠 Single LLM call	✅	✅	➖
🌩 Complex plans / tools	➖	➖	✅
👥 Non-eng prompt editors	✅	✅	✅
🧰 Zero ops overhead	✅	➖	✅

Get Started

Languages & Environments

Usage Documentation

Why PromptLayer?

Reference

Deployment Strategies

Use `promptlayer_client.run` (quickest path)

Cache prompts with Webhooks

Step‑by‑step

Run fully-managed Agents

How it works

Implementation

Which pattern should I pick?

Further reading 📚

Get Started

Languages & Environments

Usage Documentation

Why PromptLayer?

Reference

​Use promptlayer_client.run (quickest path)

​Cache prompts with Webhooks

​Step‑by‑step

​Run fully-managed Agents

​How it works

​Implementation

​Which pattern should I pick?

​Further reading 📚

Use `promptlayer_client.run` (quickest path)

Cache prompts with Webhooks

Step‑by‑step

Run fully-managed Agents

How it works

Implementation

Which pattern should I pick?

Further reading 📚