Quickstart - Pt 2 - PromptLayer

Streaming

Streaming responses using promptlayer_client.run is easy. Streaming allows the API to return responses incrementally, delivering partial outputs as they’re generated rather than waiting for the complete response. This can significantly improve the perceived responsiveness of your application.

input_variables = {
    "topic": "The American Revolution"
}

response_stream = promptlayer_client.run(
    prompt_name="ai-poet", input_variables=input_variables,
    stream=True # Add this
) 

for data in response_stream:
    # Access raw streaming response
    print(data["raw_response"])
    
    # Access progressively built prompt blueprint
    if data["prompt_blueprint"]:
        current_response = data["prompt_blueprint"]["prompt_template"]["messages"][-1]
        if current_response.get("content"):
            print(f"Current response: {current_response['content']}")

When streaming is enabled, each chunk includes both the raw streaming response and the progressively built prompt_blueprint, allowing you to track how the response is constructed in real-time. The request_id is only included in the final chunk, indicating completion of the streaming response. Learn more about OpenAI streams.

Organization

Workspaces

Shared workspaces allow you to collaborate with the rest of the team. Use separate workspaces to organize work between projects or deployment environments. We commonly see teams with “Prod” and “Dev” workspaces. Each workspace has its own unique PromptLayer API key, allowing you to maintain separate authentication for different environments.

Prompt branching

Use “Duplicate” or “Copy a version” to organize and branch your work. You can use this to duplicate a prompt into another workspace or to pop out a version into a brand new prompt template.

Switching models

An important part of prompt engineering is finding the right model. PromptLayer makes it easy to switch between language models and test them out. To switch between different models for your prompts, we recommend using our Prompt Blueprint feature.

Prompt Blueprint

Prompt Blueprint is a model-agnostic data format that allows you to update models in PromptLayer without changing any code. Instead of using response["raw_response"] to access the LLM response (as done in earlier code snippets), we recommend using the standardized response["prompt_blueprint"]. Using it looks something like this:

response = promptlayer_client.run(
    prompt_name="ai-poet", input_variables=input_variables)

print(response["prompt_blueprint"]["prompt_template"]["messages"][-1]["content"])

Using the above code snippet, you can update from OpenAI -> Anthropic without any code changes. For the exact schema, please look at the prompt_template return type of get-prompt-template.

Migrating prompts

PromptLayer supports various models beyond OpenAI. You can easily switch between different models by updating the model parameter in your prompt template. For details on comparing models, see our blog post on migrating prompts to open source models.

If you're using a new model, make sure to add the new key to your .env file

For better security, you can use environment variables from a .env file:

.env

OPENAI_API_KEY=sk-<your_openai_api_key>
PROMPTLAYER_API_KEY=<your_promptlayer_api_key>
ANTHROPIC_API_KEY=<your_anthropic_api_key>
GOOGLE_API_KEY=<your_gemini_api_key>

For more on setting up environment variables in Python, refer to this guide.If you are using Python, we recommend using python-dotenv:

pip install python-dotenv

Create a Python file called app.py and load the environment variables:

from dotenv import load_dotenv

load_dotenv()  # Load env vars from .env file

Updating the Base URL

To use your own self-hosted models or those from providers like HuggingFace, add a custom base URL to your workspace. In settings, scroll to “Provider Base URLs”. Models must conform to one of the listed provider model-families.

Base URLs will work locally and in the PromptLayer Playground.

Fine-tuning

PromptLayer makes it incredibly easy to build fine-tuned models. It’s specifically useful for fine-tuning a cheaper gpt-3.5-turbo model on more expensive gpt-4 historical request data. Be warned, fine-tuning is hard to get right. We wrote a blog post on why most teams should not rely on fine-tuning.

Advanced prompt engineering

A/B releases (Prompt A/B Testing)

A/B Releases allow you to test different versions of your prompts in production, safely roll out updates, and segment users. With this feature, you can split traffic between prompt versions based on percentages or user segments, enabling gradual rollouts and targeted testing. This functionality is powered by Dynamic Release Labels, which let you overload release labels and dynamically route traffic to different prompt versions based on your configuration. Learn more.

Batch jobs and datasets

In PromptLayer, you can build datasets by either uploading new data or utilizing historical data. This is a crucial step for running batch jobs and evaluating the performance of your prompts. Learn more about datasets. PromptLayer provides tools to label or annotate data, or to build datasets with requests that you have previously logged. Once your datasets are ready, you can use the evals page to run a batch job. In this context, the datasets serve as the input variables to the prompt for each run in the batch.

Backtests

Backtesting is the easiest way to evaluate your prompts, allowing you to assess how new prompt versions would have performed under past conditions. To perform backtests, start by building a dataset from your request history. This can be done in a few clicks on the Datasets page. Once you have your dataset, the next step is to create an evaluation pipeline. This pipeline will feed historical request contexts into your new prompt version and compare the new results to the old results. You can use simple string comparisons or more advanced techniques like cosine similarities to measure differences. For detailed instructions, visit the backtesting section. Backtesting is an effective way to detect potential regressions and validate improvements, ensuring that updates enhance rather than detract from the user experience.

Custom evals

For some prompts, it’s better to build tailored evaluation pipelines that meet your specific requirements. For example, you can use PromptLayer to build end-to-end RAG pipelines or unit test evaluations. Custom evaluations can be integrated into your CI/CD pipeline to run on every new version of your prompt. Learn more about continuous integration and explore eval examples here. Evaluations provide a robust framework for continuously improving your prompts by rigorously testing them against a variety of scenarios and metrics.

Get Started

Languages & Environments

Usage Documentation

Why PromptLayer?

Reference

Quickstart - Pt 2

Streaming

Organization

Workspaces

Prompt branching

Switching models

Prompt Blueprint

Migrating prompts

Updating the Base URL

Fine-tuning

Advanced prompt engineering

A/B releases (Prompt A/B Testing)

Batch jobs and datasets

Backtests

Custom evals

Get Started

Languages & Environments

Usage Documentation

Why PromptLayer?

Reference

​Streaming

​Organization

​Workspaces

​Prompt branching

​Switching models

​Prompt Blueprint

​Migrating prompts

​Updating the Base URL

​Fine-tuning

​Advanced prompt engineering

​A/B releases (Prompt A/B Testing)

​Batch jobs and datasets

​Backtests

​Custom evals

Streaming

Organization

Workspaces

Prompt branching

Switching models

Prompt Blueprint

Migrating prompts

Updating the Base URL

Fine-tuning

Advanced prompt engineering

A/B releases (Prompt A/B Testing)

Batch jobs and datasets

Backtests

Custom evals