WordNormalization 1.0.1

There is a newer version of this package available.
See the version list below for details.
dotnet add package WordNormalization --version 1.0.1                
NuGet\Install-Package WordNormalization -Version 1.0.1                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="WordNormalization" Version="1.0.1" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add WordNormalization --version 1.0.1                
#r "nuget: WordNormalization, 1.0.1"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install WordNormalization as a Cake Addin
#addin nuget:?package=WordNormalization&version=1.0.1

// Install WordNormalization as a Cake Tool
#tool nuget:?package=WordNormalization&version=1.0.1                

WordNormalization Library

WordNormalization Library

This library provides utilities for normalizing Spanish words by removing specific words, replacing special characters, and more.

Features

  • Normalize words by converting to lowercase, removing extra whitespace, and more.
  • Replace Spanish special characters with their ASCII equivalents.
  • Remove Spanish special characters.
  • Remove specific words from a given text.
  • Convert text to title case.
  • Dynamically add or remove words to be excluded from text.
  • Load removable words from a JSON file.

Installation

dotnet add package WordNormalization

Usage

using WordNormalization;

// Normalize a word
string normalizedWord = Normalize.NormalizeWord("�������������� Hola como estas los d�as est�n buenos");
Console.WriteLine(normalizedWord); // Output: "Aeiouun AEIOUUN Hola Como Estas D�as Est�n Buenos"

// Replace Spanish special characters
string replacedWord = Normalize.ReplaceSpanishSpecialCharacters("��������������");
Console.WriteLine(replacedWord); // Output: "aeiouunAEIOUUN"

// Remove Spanish special characters
string removedSpecialCharsWord = Normalize.RemoveSpanishSpecialCharacters("��������������");
Console.WriteLine(removedSpecialCharsWord); // Output: ""

// Remove specific words
string cleanedWord = Normalize.RemoveRemovableWords("Hola como estas los d�as est�n buenos");
Console.WriteLine(cleanedWord); // Output: "Hola como estas d�as est�n buenos"

// Convert to title case
string titleCaseWord = Normalize.ToTitleCase("this is a sentence");
Console.WriteLine(titleCaseWord); // Output: "This Is A Sentence"

// Add dynamic removable words
Normalize.AddDynamicRemovableWords(new List { "d�as" });
cleanedWord = Normalize.RemoveRemovableWords("Hola como estas los d�as est�n buenos");
Console.WriteLine(cleanedWord); // Output: "Hola como estas est�n buenos"

// Load removable words from JSON
string json = File.ReadAllText("RemovableWords.json");
Normalize.AddDynamicRemovableWordsFromJson(json);
cleanedWord = Normalize.RemoveRemovableWords("Hola como estas en el dia de hoy?");
Console.WriteLine(cleanedWord); // Output: "Hola como estas dia hoy?"

Contributing

Contributions are welcome! Please fork this repository and submit pull requests.

Product Compatible and additional computed target framework versions.
.NET net8.0 is compatible.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
  • net8.0

    • No dependencies.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
1.0.2 134 7/23/2024
1.0.1 93 7/23/2024
1.0.0 92 7/22/2024