Microsoft.ML.Tokenizers.Data.Cl100kBase
2.0.0
Prefix Reserved
dotnet add package Microsoft.ML.Tokenizers.Data.Cl100kBase --version 2.0.0
NuGet\Install-Package Microsoft.ML.Tokenizers.Data.Cl100kBase -Version 2.0.0
<PackageReference Include="Microsoft.ML.Tokenizers.Data.Cl100kBase" Version="2.0.0" />
<PackageVersion Include="Microsoft.ML.Tokenizers.Data.Cl100kBase" Version="2.0.0" />
<PackageReference Include="Microsoft.ML.Tokenizers.Data.Cl100kBase" />
paket add Microsoft.ML.Tokenizers.Data.Cl100kBase --version 2.0.0
#r "nuget: Microsoft.ML.Tokenizers.Data.Cl100kBase, 2.0.0"
#:package Microsoft.ML.Tokenizers.Data.Cl100kBase@2.0.0
#addin nuget:?package=Microsoft.ML.Tokenizers.Data.Cl100kBase&version=2.0.0
#tool nuget:?package=Microsoft.ML.Tokenizers.Data.Cl100kBase&version=2.0.0
About
The Microsoft.ML.Tokenizers.Data.Cl100kBase includes the Tiktoken tokenizer data file cl100k_base.tiktoken, which is utilized by models such as GPT-4.
Key Features
- This package mainly contains the cl100k_base.tiktoken file, which is used by the Tiktoken tokenizer. This data file is used by the following models: 1. gpt-4 2. gpt-3.5-turbo 3. gpt-3.5-turbo-16k 4. gpt-35 5. gpt-35-turbo 6. gpt-35-turbo-16k 7. text-embedding-ada-002 8. text-embedding-3-small 9. text-embedding-3-large
How to Use
Reference this package in your project to use the Tiktoken tokenizer with the specified models.
// Create a tokenizer for the specified model or any other listed model name
Tokenizer tokenizer = TiktokenTokenizer.CreateForModel("gpt-4");
// Create a tokenizer for the specified encoding
Tokenizer tokenizer = TiktokenTokenizer.CreateForEncoding("cl100k_base");
Main Types
Users shouldn't use any types exposed by this package directly. This package is intended to provide tokenizer data files.
Additional Documentation
Related Packages
Microsoft.ML.Tokenizers
Feedback & Contributing
Microsoft.ML.Tokenizers.Data.Cl100kBase is released as open source under the MIT license. Bug reports and contributions are welcome at the GitHub repository.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
| .NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
| .NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
| .NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
| MonoAndroid | monoandroid was computed. |
| MonoMac | monomac was computed. |
| MonoTouch | monotouch was computed. |
| Tizen | tizen40 was computed. tizen60 was computed. |
| Xamarin.iOS | xamarinios was computed. |
| Xamarin.Mac | xamarinmac was computed. |
| Xamarin.TVOS | xamarintvos was computed. |
| Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- Google.Protobuf (>= 3.30.2)
- Microsoft.Bcl.AsyncInterfaces (>= 9.0.4)
- Microsoft.Bcl.HashCode (>= 6.0.0)
- Microsoft.Bcl.Memory (>= 9.0.4)
- Microsoft.ML.Tokenizers (>= 2.0.0)
- System.Buffers (>= 4.6.1)
- System.IO.Pipelines (>= 9.0.4)
- System.Memory (>= 4.6.3)
- System.Runtime.CompilerServices.Unsafe (>= 6.1.2)
- System.Text.Encodings.Web (>= 9.0.4)
- System.Text.Json (>= 9.0.4)
NuGet packages (15)
Showing the top 5 NuGet packages that depend on Microsoft.ML.Tokenizers.Data.Cl100kBase:
| Package | Downloads |
|---|---|
|
Microsoft.KernelMemory.AI.Tiktoken
Provide tokenizers to allow counting content tokens for text and embeddings |
|
|
ImmediaC.SimpleCms
ASP.NET Core based CMS |
|
|
FoundationaLLM.Common
FoundationaLLM.Common is a .NET library that the FoundationaLLM.Client.Core and FoundationaLLM.Client.Management client libraries share as a common dependency. |
|
|
IL.UmbracoSearch
A comprehensive search solution for Umbraco, supporting both Lucene and Azure Search, with extensible indexing and flexible search parameters. |
|
|
Microsoft.Agents.Extensions.Teams.AI
Library for creating AI Teams agents using Microsoft Agent SDK |
GitHub repositories (8)
Showing the top 8 popular GitHub repositories that depend on Microsoft.ML.Tokenizers.Data.Cl100kBase:
| Repository | Stars |
|---|---|
|
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
|
|
|
ravendb/ravendb
ACID Document Database
|
|
|
microsoft/kernel-memory
Research project. A Memory solution for users, teams, and applications.
|
|
|
fagenorn/handcrafted-persona-engine
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.
|
|
|
sdcb/chats
A powerful and flexible frontend & AI gateway for large language models, supporting 21+ mainstream AI model providers.
|
|
|
axzxs2001/Asp.NetCoreExperiment
原来所有项目都移动到**OleVersion**目录下进行保留。新的案例装以.net 5.0为主,一部分对以前案例进行升级,一部分将以前的工作经验总结出来,以供大家参考!
|
|
|
microsoft/Agents-for-net
This repository is for active development of the Microsoft 365 Agent SDK components for .NET
|
|
|
marcominerva/SqlDatabaseVectorSearch
A Blazor Web App and Minimal API for performing RAG (Retrieval Augmented Generation) and vector search using the native VECTOR type in Azure SQL Database and Azure OpenAI.
|
| Version | Downloads | Last Updated |
|---|---|---|
| 2.0.0 | 92,718 | 11/11/2025 |
| 2.0.0-preview.25527.5 | 1,300 | 10/29/2025 |
| 2.0.0-preview.25503.2 | 5,159 | 10/3/2025 |
| 2.0.0-preview.25373.1 | 8,150 | 7/28/2025 |
| 2.0.0-preview.1.25127.4 | 114,270 | 2/28/2025 |
| 2.0.0-preview.1.25125.4 | 252 | 2/25/2025 |
| 1.0.3 | 14,453 | 10/28/2025 |
| 1.0.2 | 662,279 | 2/26/2025 |
| 1.0.1 | 307,269 | 1/15/2025 |
| 1.0.0 | 154,480 | 11/14/2024 |
| 0.22.0 | 2,981 | 11/13/2024 |
| 0.22.0-preview.24526.1 | 1,561 | 10/27/2024 |
| 0.22.0-preview.24522.7 | 2,789 | 10/23/2024 |