Azure.AI.OpenAI
2.0.0-beta.4
Prefix Reserved
See the version list below for details.
dotnet add package Azure.AI.OpenAI --version 2.0.0-beta.4
NuGet\Install-Package Azure.AI.OpenAI -Version 2.0.0-beta.4
<PackageReference Include="Azure.AI.OpenAI" Version="2.0.0-beta.4" />
paket add Azure.AI.OpenAI --version 2.0.0-beta.4
#r "nuget: Azure.AI.OpenAI, 2.0.0-beta.4"
// Install Azure.AI.OpenAI as a Cake Addin #addin nuget:?package=Azure.AI.OpenAI&version=2.0.0-beta.4&prerelease // Install Azure.AI.OpenAI as a Cake Tool #tool nuget:?package=Azure.AI.OpenAI&version=2.0.0-beta.4&prerelease
Azure OpenAI client library for .NET
The Azure OpenAI client library for .NET is a companion to the official OpenAI client library for .NET. The Azure OpenAI library configures a client for use with Azure OpenAI and provides additional strongly typed extension support for request and response models specific to Azure OpenAI scenarios.
Azure OpenAI is a managed service that allows developers to deploy, tune, and generate content from OpenAI models on Azure resources.
Source code | Package (NuGet) | API reference documentation | Product documentation | Samples
Getting started
Prerequisites
To use an Azure OpenAI resource, you must have:
These prerequisites allow you to create an Azure OpenAI resource and get both a connection URL and API keys. For more information, see Quickstart: Get started generating text using Azure OpenAI Service.
Install the package
Install the client library for .NET with NuGet:
dotnet add package Azure.AI.OpenAI --prerelease
The Azure.AI.OpenAI
package builds on the official OpenAI package, which is included as a dependency.
Authenticate the client
To interact with Azure OpenAI or OpenAI, create an instance of AzureOpenAIClient with one of the following approaches:
Create client with a Microsoft Entra credential
A secure, keyless authentication approach is to use Microsoft Entra ID (formerly Azure Active Directory) via the Azure Identity library. To use the library:
Install the Azure.Identity package:
dotnet add package Azure.Identity
Use the desired credential type from the library. For example, DefaultAzureCredential:
AzureOpenAIClient azureClient = new(
new Uri("https://your-azure-openai-resource.com"),
new DefaultAzureCredential());
ChatClient chatClient = azureClient.GetChatClient("my-gpt-4o-mini-deployment");
Configure client for Azure sovereign cloud**
If your Microsoft Entra credentials are issued by an entity other than Azure Public Cloud, you can set the Audience
property on OpenAIClientOptions
to modify the token authorization scope used for requests.
For example, the following will configure the client to authenticate tokens via Azure Government Cloud, using https://cognitiveservices.azure.us/.default
as the authorization scope:
AzureOpenAIClientOptions options = new()
{
Audience = AzureOpenAIAudience.AzureGovernment,
};
AzureOpenAIClient azureClient = new(
new Uri("https://your-azure-openai-resource.com"),
new DefaultAzureCredential());
ChatClient chatClient = azureClient.GetChatClient("my-gpt-4o-mini-deployment");
For a custom or non-enumerated value, the authorization scope can be provided directly as the value for Audience
:
AzureOpenAIClientOptions optionsWithCustomAudience = new()
{
Audience = "https://cognitiveservices.azure.com/.default",
};
Create client with an API key
While not as secure as Microsoft Entra-based authentication, it's possible to authenticate using a client subscription key:
string keyFromEnvironment = Environment.GetEnvironmentVariable("AZURE_OPENAI_API_KEY");
AzureOpenAIClient azureClient = new(
new Uri("https://your-azure-openai-resource.com"),
new AzureKeyCredential(keyFromEnvironment));
ChatClient chatClient = azureClient.GetChatClient("my-gpt-35-turbo-deployment");
Key concepts
Assistants
See OpenAI's Assistants API overview.
Audio transcription/translation and text-to-speech generation
See OpenAI Capabilities: Speech to text.
Batch
Chat completion
Chat models take a list of messages as input and return a model-generated message as output. Although the chat format is designed to make multi-turn conversations easy, it's also useful for single-turn tasks without any conversation.
See OpenAI Capabilities: Chat completion.
Image generation
See OpenAI Capabilities: Image generation.
Files
See OpenAI's Files API reference.
Text embeddings
See OpenAI Capabilities: Embeddings.
Thread safety
We guarantee that all client instance methods are thread-safe and independent of each other (guideline). This ensures that the recommendation of reusing client instances is always safe, even across threads.
Additional concepts
Client options | Accessing the response | Long-running operations | Handling failures | Diagnostics | Mocking | Client lifetime
Examples
You can familiarize yourself with different APIs using Samples from OpenAI's .NET library or Azure.AI.OpenAI-specific samples. Most OpenAI capabilities are available on both Azure OpenAI and OpenAI using the same scenario clients and methods, so not all scenarios are redundantly covered here.
Get a chat completion
AzureOpenAIClient azureClient = new(
new Uri("https://your-azure-openai-resource.com"),
new DefaultAzureCredential());
ChatClient chatClient = azureClient.GetChatClient("my-gpt-35-turbo-deployment");
ChatCompletion completion = chatClient.CompleteChat(
[
// System messages represent instructions or other guidance about how the assistant should behave
new SystemChatMessage("You are a helpful assistant that talks like a pirate."),
// User messages represent user input, whether historical or the most recen tinput
new UserChatMessage("Hi, can you help me?"),
// Assistant messages in a request represent conversation history for responses
new AssistantChatMessage("Arrr! Of course, me hearty! What can I do for ye?"),
new UserChatMessage("What's the best way to train a parrot?"),
]);
Console.WriteLine($"{completion.Role}: {completion.Content[0].Text}");
Stream chat messages
Streaming chat completions use the CompleteChatStreaming
and CompleteChatStreamingAsync
method, which return a ResultCollection<StreamingChatCompletionUpdate>
or AsyncCollectionResult<StreamingChatCompletionUpdate>
instead of a ClientResult<ChatCompletion>
. These result collections can be iterated over using foreach
or await foreach
, with each update arriving as new data is available from the streamed response.
AzureOpenAIClient azureClient = new(
new Uri("https://your-azure-openai-resource.com"),
new DefaultAzureCredential());
ChatClient chatClient = azureClient.GetChatClient("my-gpt-35-turbo-deployment");
CollectionResult<StreamingChatCompletionUpdate> completionUpdates = chatClient.CompleteChatStreaming(
[
new SystemChatMessage("You are a helpful assistant that talks like a pirate."),
new UserChatMessage("Hi, can you help me?"),
new AssistantChatMessage("Arrr! Of course, me hearty! What can I do for ye?"),
new UserChatMessage("What's the best way to train a parrot?"),
]);
foreach (StreamingChatCompletionUpdate completionUpdate in completionUpdates)
{
foreach (ChatMessageContentPart contentPart in completionUpdate.ContentUpdate)
{
Console.Write(contentPart.Text);
}
}
Use chat tools
Tools extend chat completions by allowing an assistant to invoke defined functions and other capabilities in the process of fulfilling a chat completions request. To use chat tools, start by defining a function tool. Here, we root the tools in local methods for clarity and convenience:
static string GetCurrentLocation()
{
// Call the location API here.
return "San Francisco";
}
static string GetCurrentWeather(string location, string unit = "celsius")
{
// Call the weather API here.
return $"31 {unit}";
}
ChatTool getCurrentLocationTool = ChatTool.CreateFunctionTool(
functionName: nameof(GetCurrentLocation),
functionDescription: "Get the user's current location"
);
ChatTool getCurrentWeatherTool = ChatTool.CreateFunctionTool(
functionName: nameof(GetCurrentWeather),
functionDescription: "Get the current weather in a given location",
functionParameters: BinaryData.FromString("""
{
"type": "object",
"properties": {
"location": {
"type": "string",
"description": "The city and state, e.g. Boston, MA"
},
"unit": {
"type": "string",
"enum": [ "celsius", "fahrenheit" ],
"description": "The temperature unit to use. Infer this from the specified location."
}
},
"required": [ "location" ]
}
""")
);
With the tool defined, include that new definition in the options for a chat completions request:
ChatCompletionOptions options = new()
{
Tools = { getCurrentLocationTool, getCurrentWeatherTool },
};
List<ChatMessage> conversationMessages =
[
new UserChatMessage("What's the weather like in Boston?"),
];
ChatCompletion completion = chatClient.CompleteChat(conversationMessages);
When the assistant decides that one or more tools should be used, the response message includes one or more "tool calls" that must all be resolved via "tool messages" on the subsequent request. This resolution of tool calls into new request messages can be thought of as a sort of "callback" for chat completions.
To provide tool call resolutions to the assistant to allow the request to continue, provide all prior historical context -- including the original system and user messages, the response from the assistant that included the tool calls, and the tool messages that resolved each of those tools -- when making a subsequent request.
// Purely for convenience and clarity, this standalone local method handles tool call responses.
string GetToolCallContent(ChatToolCall toolCall)
{
if (toolCall.FunctionName == getCurrentWeatherTool.FunctionName)
{
// Validate arguments before using them; it's not always guaranteed to be valid JSON!
try
{
using JsonDocument argumentsDocument = JsonDocument.Parse(toolCall.FunctionArguments);
if (!argumentsDocument.RootElement.TryGetProperty("location", out JsonElement locationElement))
{
// Handle missing required "location" argument
}
else
{
string location = locationElement.GetString();
if (argumentsDocument.RootElement.TryGetProperty("unit", out JsonElement unitElement))
{
return GetCurrentWeather(location, unitElement.GetString());
}
else
{
return GetCurrentWeather(location);
}
}
}
catch (JsonException)
{
// Handle the JsonException (bad arguments) here
}
}
// Handle unexpected tool calls
throw new NotImplementedException();
}
if (completion.FinishReason == ChatFinishReason.ToolCalls)
{
// Add a new assistant message to the conversation history that includes the tool calls
conversationMessages.Add(new AssistantChatMessage(completion));
foreach (ChatToolCall toolCall in completion.ToolCalls)
{
conversationMessages.Add(new ToolChatMessage(toolCall.Id, GetToolCallContent(toolCall)));
}
// Now make a new request with all the messages thus far, including the original
}
When using tool calls with streaming responses, accumulate tool call details much like you'd accumulate the other
portions of streamed choices, in this case using the accumulated StreamingToolCallUpdate
data to instantiate new
tool call messages for assistant message history. Note that the model will ignore ChoiceCount
when providing tools
and that all streamed responses should map to a single, common choice index in the range of [0..(ChoiceCount - 1)]
.
Dictionary<int, string> toolCallIdsByIndex = [];
Dictionary<int, string> functionNamesByIndex = [];
Dictionary<int, StringBuilder> functionArgumentBuildersByIndex = [];
StringBuilder contentBuilder = new();
foreach (StreamingChatCompletionUpdate streamingChatUpdate
in chatClient.CompleteChatStreaming(conversationMessages, options))
{
foreach (ChatMessageContentPart contentPart in streamingChatUpdate.ContentUpdate)
{
contentBuilder.Append(contentPart.Text);
}
foreach (StreamingChatToolCallUpdate toolCallUpdate in streamingChatUpdate.ToolCallUpdates)
{
if (!string.IsNullOrEmpty(toolCallUpdate.Id))
{
toolCallIdsByIndex[toolCallUpdate.Index] = toolCallUpdate.Id;
}
if (!string.IsNullOrEmpty(toolCallUpdate.FunctionName))
{
functionNamesByIndex[toolCallUpdate.Index] = toolCallUpdate.FunctionName;
}
if (!string.IsNullOrEmpty(toolCallUpdate.FunctionArgumentsUpdate))
{
StringBuilder argumentsBuilder
= functionArgumentBuildersByIndex.TryGetValue(toolCallUpdate.Index, out StringBuilder existingBuilder)
? existingBuilder
: new();
argumentsBuilder.Append(toolCallUpdate.FunctionArgumentsUpdate);
functionArgumentBuildersByIndex[toolCallUpdate.Index] = argumentsBuilder;
}
}
}
List<ChatToolCall> toolCalls = [];
foreach (KeyValuePair<int, string> indexToIdPair in toolCallIdsByIndex)
{
toolCalls.Add(ChatToolCall.CreateFunctionToolCall(
indexToIdPair.Value,
functionNamesByIndex[indexToIdPair.Key],
functionArgumentBuildersByIndex[indexToIdPair.Key].ToString()));
}
conversationMessages.Add(new AssistantChatMessage(toolCalls, contentBuilder.ToString()));
// Placeholder: each tool call must be resolved, like in the non-streaming case
string GetToolCallOutput(ChatToolCall toolCall) => null;
foreach (ChatToolCall toolCall in toolCalls)
{
conversationMessages.Add(new ToolChatMessage(toolCall.Id, GetToolCallOutput(toolCall)));
}
// Repeat with the history and all tool call resolution messages added
Use your own data with Azure OpenAI
The use your own data feature is unique to Azure OpenAI and won't work with a client configured to use the non-Azure service. See the Azure OpenAI using your own data quickstart for conceptual background and detailed setup instructions.
NOTE: The concurrent use of Chat Functions and Azure Chat Extensions on a single request isn't yet supported. Supplying both will result in the Chat Functions information being ignored and the operation behaving as if only the Azure Chat Extensions were provided. To address this limitation, consider separating the evaluation of Chat Functions and Azure Chat Extensions across multiple requests in your solution design.
// Extension methods to use data sources with options are subject to SDK surface changes. Suppress the
// warning to acknowledge and this and use the subject-to-change AddDataSource method.
#pragma warning disable AOAI001
ChatCompletionOptions options = new();
options.AddDataSource(new AzureSearchChatDataSource()
{
Endpoint = new Uri("https://your-search-resource.search.windows.net"),
IndexName = "contoso-products-index",
Authentication = DataSourceAuthentication.FromApiKey(
Environment.GetEnvironmentVariable("OYD_SEARCH_KEY")),
});
ChatCompletion completion = chatClient.CompleteChat(
[
new UserChatMessage("What are the best-selling Contoso products this month?"),
],
options);
AzureChatMessageContext onYourDataContext = completion.GetAzureMessageContext();
if (onYourDataContext?.Intent is not null)
{
Console.WriteLine($"Intent: {onYourDataContext.Intent}");
}
foreach (AzureChatCitation citation in onYourDataContext?.Citations ?? [])
{
Console.WriteLine($"Citation: {citation.Content}");
}
Use Assistants and stream a run
Assistants provide a stateful, service-persisted conversational model that can be enriched with a larger array of tools than Chat Completions.
Creating an AssistantClient
is similar to other scenario clients. An important difference is that Assistants features
are marked as [Experimental]
to reflect the API's beta status, and thus you'll need to suppress the corresponding
warning to instantiate a client. This can be done in the .csproj
file via the <NoWarn>
element or, as below, in
the code itself with a #pragma
directive.
AzureOpenAIClient azureClient = new(
new Uri("https://your-azure-openai-resource.com"),
new DefaultAzureCredential());
// The Assistants feature area is in beta, with API specifics subject to change.
// Suppress the [Experimental] warning via .csproj or, as here, in the code to acknowledge.
#pragma warning disable OPENAI001
AssistantClient assistantClient = azureClient.GetAssistantClient();
With a client, you can then create Assistants, Threads, and new Messages on a thread in preparation to start a run. As is the case for other shared API surfaces, you should use an Azure OpenAI model deployment name wherever a model name is requested.
Assistant assistant = await assistantClient.CreateAssistantAsync(
model: "my-gpt-4o-deployment",
new AssistantCreationOptions()
{
Name = "My Friendly Test Assistant",
Instructions = "You politely help with math questions. Use the code interpreter tool when asked to "
+ "visualize numbers.",
Tools = { ToolDefinition.CreateCodeInterpreter() },
});
ThreadInitializationMessage initialMessage = new(
MessageRole.User,
[
"Hi, Assistant! Draw a graph for a line with a slope of 4 and y-intercept of 9."
]);
AssistantThread thread = await assistantClient.CreateThreadAsync(new ThreadCreationOptions()
{
InitialMessages = { initialMessage },
});
You can then start a run and stream updates as they arrive using the Streaming
method variants, handling the updates
you're interested in using the enumerated kind of event it is and/or one of the several derived types for the streaming
update class, as shown here for content:
RunCreationOptions runOptions = new()
{
AdditionalInstructions = "When possible, talk like a pirate."
};
await foreach (StreamingUpdate streamingUpdate
in assistantClient.CreateRunStreamingAsync(thread, assistant, runOptions))
{
if (streamingUpdate.UpdateKind == StreamingUpdateReason.RunCreated)
{
Console.WriteLine($"--- Run started! ---");
}
else if (streamingUpdate is MessageContentUpdate contentUpdate)
{
Console.Write(contentUpdate.Text);
if (contentUpdate.ImageFileId is not null)
{
Console.WriteLine($"[Image content file ID: {contentUpdate.ImageFileId}");
}
}
}
Remember that things like Assistants, Threads, and Vector Stores are persistent resources. You can save their IDs to reuse them later or, as demonstrated below, delete them when no longer desired.
// Optionally, delete persistent resources that are no longer needed.
_ = await assistantClient.DeleteAssistantAsync(assistant);
_ = await assistantClient.DeleteThreadAsync(thread);
Next steps
Troubleshooting
When you interact with Azure OpenAI using the .NET SDK, errors returned by the service correspond to the same HTTP status codes returned for REST API requests.
For example, if you try to create a client using an endpoint that doesn't match your Azure OpenAI Resource endpoint, a 404
error is returned, indicating Resource Not Found
.
Contributing
See the OpenAI CONTRIBUTING.md for details on building, testing, and contributing to this library.
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit cla.microsoft.com.
When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information, see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- Azure.Core (>= 1.42.0)
- OpenAI (>= 2.0.0-beta.10)
NuGet packages (76)
Showing the top 5 NuGet packages that depend on Azure.AI.OpenAI:
Package | Downloads |
---|---|
Omnia.Fx.Models
Package Description |
|
ImmediaC.SimpleCms
ASP.NET Core based CMS |
|
Microsoft.Azure.Workflows.WebJobs.Extension
Extensions for running workflows in Azure Functions |
|
Microsoft.SemanticKernel.Connectors.AzureOpenAI
Semantic Kernel connectors for Azure OpenAI. Contains clients for chat completion, embedding and DALL-E text to image. |
|
LangChain.Providers.Azure
OpenAI API LLM and Chat model provider. |
GitHub repositories (26)
Showing the top 5 popular GitHub repositories that depend on Azure.AI.OpenAI:
Repository | Stars |
---|---|
microsoft/PowerToys
Windows system utilities to maximize productivity
|
|
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
|
|
Azure/azure-sdk-for-net
This repository is for active development of the Azure SDK for .NET. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/dotnet/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-net.
|
|
dotnet/aspire
Tools, templates, and packages to accelerate building observable, production-ready apps
|
|
Azure-Samples/cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
|
Version | Downloads | Last updated |
---|---|---|
2.1.0-beta.2 | 0 | 11/5/2024 |
2.1.0-beta.1 | 61,147 | 10/2/2024 |
2.0.0 | 60,877 | 10/1/2024 |
2.0.0-beta.6 | 14,638 | 9/23/2024 |
2.0.0-beta.5 | 110,805 | 9/4/2024 |
2.0.0-beta.4 | 63,227 | 8/30/2024 |
2.0.0-beta.3 | 26,436 | 8/24/2024 |
2.0.0-beta.2 | 230,168 | 6/15/2024 |
2.0.0-beta.1 | 29,221 | 6/7/2024 |
1.0.0-beta.17 | 788,986 | 5/3/2024 |
1.0.0-beta.16 | 195,711 | 4/12/2024 |
1.0.0-beta.15 | 482,599 | 3/20/2024 |
1.0.0-beta.14 | 340,507 | 3/4/2024 |
1.0.0-beta.13 | 521,141 | 2/1/2024 |
1.0.0-beta.12 | 607,884 | 12/15/2023 |
1.0.0-beta.11 | 152,339 | 12/8/2023 |
1.0.0-beta.10 | 7,691 | 12/7/2023 |
1.0.0-beta.9 | 330,437 | 11/6/2023 |
1.0.0-beta.8 | 733,713 | 9/21/2023 |
1.0.0-beta.7 | 388,677 | 8/25/2023 |
1.0.0-beta.6 | 542,200 | 7/19/2023 |
1.0.0-beta.5 | 1,229,167 | 3/22/2023 |
1.0.0-beta.4 | 10,380 | 2/23/2023 |
1.0.0-beta.3 | 422 | 2/17/2023 |
1.0.0-beta.2 | 661 | 2/8/2023 |
1.0.0-beta.1 | 821 | 2/7/2023 |