Microsoft.ML.Tokenizers.Data.R50kBase
0.22.0-preview.24526.1
Prefix Reserved
See the version list below for details.
dotnet add package Microsoft.ML.Tokenizers.Data.R50kBase --version 0.22.0-preview.24526.1
NuGet\Install-Package Microsoft.ML.Tokenizers.Data.R50kBase -Version 0.22.0-preview.24526.1
<PackageReference Include="Microsoft.ML.Tokenizers.Data.R50kBase" Version="0.22.0-preview.24526.1" />
paket add Microsoft.ML.Tokenizers.Data.R50kBase --version 0.22.0-preview.24526.1
#r "nuget: Microsoft.ML.Tokenizers.Data.R50kBase, 0.22.0-preview.24526.1"
// Install Microsoft.ML.Tokenizers.Data.R50kBase as a Cake Addin #addin nuget:?package=Microsoft.ML.Tokenizers.Data.R50kBase&version=0.22.0-preview.24526.1&prerelease // Install Microsoft.ML.Tokenizers.Data.R50kBase as a Cake Tool #tool nuget:?package=Microsoft.ML.Tokenizers.Data.R50kBase&version=0.22.0-preview.24526.1&prerelease
About
The Microsoft.ML.Tokenizers.Data.R50kBase
includes the Tiktoken tokenizer data file r50k_base.tiktoken
, which is utilized by models such as text-davinci-001
.
Key Features
- This package mainly contains the
r50k_base.tiktoken
file, which is used by the Tiktoken tokenizer. This data file is used by the following models: 1. text-davinci-001 2. text-curie-001 3. text-babbage-001 4. text-ada-001 5. davinci 6. curie 7. babbage 8. ada 9. text-similarity-davinci-001 10. text-similarity-curie-001 11. text-similarity-babbage-001 12. text-similarity-ada-001 13. text-search-davinci-doc-001 14. text-search-curie-doc-001 15. text-search-babbage-doc-001 16. text-search-ada-doc-001 17. code-search-babbage-code-001 18. code-search-ada-code-001
How to Use
Reference this package in your project to use the Tiktoken tokenizer with the specified models.
// Create a tokenizer for the specified model or any other listed model name
Tokenizer tokenizer = TiktokenTokenizer.CreateForModel("text-davinci-001");
// Create a tokenizer for the specified encoding
Tokenizer tokenizer = TiktokenTokenizer.CreateForEncoding("r50k_base");
Main Types
Users shouldn't use any types exposed by this package directly. This package is intended to provide tokenizer data files.
Additional Documentation
Related Packages
Microsoft.ML.Tokenizers
Feedback & Contributing
Microsoft.ML.Tokenizers.Data.R50kBase is released as open source under the MIT license. Bug reports and contributions are welcome at the GitHub repository.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- Microsoft.ML.Tokenizers (>= 0.22.0-preview.24526.1)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
1.0.0 | 358 | 11/14/2024 |
0.22.0 | 110 | 11/13/2024 |
0.22.0-preview.24526.1 | 117 | 10/27/2024 |
0.22.0-preview.24522.7 | 174 | 10/23/2024 |